Difference between revisions of "Sorani"
Jump to navigation
Jump to search
(Created page with "Details on building a Sorani transducer and related language pairs") |
|||
Line 1: | Line 1: | ||
Details on building a Sorani transducer and related language pairs |
Details on building a Sorani transducer and related language pairs |
||
==Grammar and Paradigms== |
|||
We are largely taking the paradigms from [https://www.fas.harvard.edu/~iranian/Sorani/sorani_1_grammar.pdf Thackston's grammar], Amuzesh-e Zeban-e Kordi by Sayyed Mohammad Sina Ahmadi may be consulted in the future. |
|||
==ZWNJ and ه == |
|||
As a rule of thumb we are not adding or leaving zero-width non joiner characters at the end of word entries in the dictionaries. Two variants of the same character, ه, are used and these can stand for each other as defined in the ACX file. In "standard, proper" written Sorani it seems that the variant that breaks the abjad is used in typing. Both variants can be used at the end of words and paradigms, while throughout words and paradigms we will use the usual, non-abjad breaking ه together with a ZWNJ character. |
|||
{| class="wikitable" |
|||
|- |
|||
! Character |
|||
! Connection |
|||
! Hexdump |
|||
! Note |
|||
|- |
|||
| ه |
|||
| هت |
|||
| '''87d9''' aad8 000a |
|||
| Connects the usual way for an abjad. |
|||
|- |
|||
| ە |
|||
| ەت |
|||
| '''95db''' aad8 000a |
|||
| Does not connect. |
|||
|} |
Revision as of 10:34, 19 September 2016
Details on building a Sorani transducer and related language pairs
Grammar and Paradigms
We are largely taking the paradigms from Thackston's grammar, Amuzesh-e Zeban-e Kordi by Sayyed Mohammad Sina Ahmadi may be consulted in the future.
ZWNJ and ه
As a rule of thumb we are not adding or leaving zero-width non joiner characters at the end of word entries in the dictionaries. Two variants of the same character, ه, are used and these can stand for each other as defined in the ACX file. In "standard, proper" written Sorani it seems that the variant that breaks the abjad is used in typing. Both variants can be used at the end of words and paradigms, while throughout words and paradigms we will use the usual, non-abjad breaking ه together with a ZWNJ character.
Character | Connection | Hexdump | Note |
---|---|---|---|
ه | هت | 87d9 aad8 000a | Connects the usual way for an abjad. |
ە | ەت | 95db aad8 000a | Does not connect. |