Difference between revisions of "Indo-Iranian languages"
Jump to navigation
Jump to search
Line 82: | Line 82: | ||
|- |
|- |
||
|} |
|} |
||
==Classification== |
|||
* Southwestern: [[Iranian Persian]], [[Tajik]] |
|||
* Northwestern: [[Kurdish]] ([[Kurmanji]], [[Sorani]]) |
|||
* Southeastern: [[Pashto]] |
|||
* Northeastern: [[Ossetian]] |
Revision as of 12:52, 22 August 2016
Status
The ultimate goal is to have multi-purposable transducers for a variety of Indo-Iranian languages. These can then be paired for X→Y translation with the addition of a CG for language X and transfer rules / dictionary for the pair X→Y. Below is listed development progress for each language's transducers and dictionary pairs.
Transducers
Once a transducer has ~80% coverage on a range of medium-large corpora we can say it is "working". Over 90% and it can be considered to be "production".
name | Language | ISO 639 | formalism | state | stems | paradigms | coverage | location | primary authors | |
---|---|---|---|---|---|---|---|---|---|---|
-2 | -3 | |||||||||
apertium-kmr
|
Kurdish (Kurmanji) | ku
|
kmr
|
lttoolbox | production | 17,771 | 157 | [[Apertium-kmr#Current_State|~Apertium-kmr/stats/average%]] | apertium-kmr (languages) | Fran, Memduh |
apertium-pes
|
Iranian Persian |
|
pes
|
lttoolbox | production | 13,167 | 113 | [[Apertium-pes#Current_State|~Apertium-pes/stats/average%]] | apertium-pes (languages) | Fran, ... |
apertium-tgk
|
Tajik | tg
|
tgk
|
lttoolbox | production | 2,784 | 79 | [[Apertium-tgk#Current_State|~Apertium-tgk/stats/average%]] | apertium-tgk (languages) | Fran, ... |
apertium-oss
|
Ossetian | os
|
oss
|
lttoolbox | production | 111 | ~17% | apertium-oss (languages) | Fran, ... | |
apertium-glk
|
Gilaki |
|
glk
|
lttoolbox | production | 4 | 28 | [[Apertium-oss#Current_State|~Apertium-glk/stats/average%]] | apertium-glk (languages) | ... |