Difference between revisions of "Indo-Iranian languages"
Jump to navigation
Jump to search
(Created page with "==Status== The ultimate goal is to have multi-purposable transducers for a variety of Indo-Iranian languages. These can then be paired for X→Y translation with the addition ...") |
|||
Line 32: | Line 32: | ||
|| [[apertium-kmr]] ([[languages]]) |
|| [[apertium-kmr]] ([[languages]]) |
||
|| [[User:Francis Tyers|Fran]], [[User:Memduh|Memduh]] |
|| [[User:Francis Tyers|Fran]], [[User:Memduh|Memduh]] |
||
|- |
|||
|| <code>[[apertium-pes]]</code> |
|||
|| [[Iranian Persian]] |
|||
|| <code></code> |
|||
|| <code>pes</code> |
|||
|| [[lttoolbox]] |
|||
|| production |
|||
|align="right"| {{#lst:Apertium-pes/stats|stems}} |
|||
|align="right"| {{#lst:Apertium-pes/stats|paradigms}} |
|||
|align="center"| [[Apertium-pes#Current_State|~{{:Apertium-pes/stats/average}}%]] |
|||
|| [[apertium-pes]] ([[languages]]) |
|||
|| [[User:Francis Tyers|Fran]], ... |
|||
|- |
|||
|| <code>[[apertium-tgk]]</code> |
|||
|| [[Tajik]] |
|||
|| <code>tg</code> |
|||
|| <code>tgk</code> |
|||
|| [[lttoolbox]] |
|||
|| production |
|||
|align="right"| {{#lst:Apertium-tgk/stats|stems}} |
|||
|align="right"| {{#lst:Apertium-tgk/stats|paradigms}} |
|||
|align="center"| [[Apertium-tgk#Current_State|~{{:Apertium-tgk/stats/average}}%]] |
|||
|| [[apertium-tgk]] ([[languages]]) |
|||
|| [[User:Francis Tyers|Fran]], ... |
|||
|- |
|||
|| <code>[[apertium-oss]]</code> |
|||
|| [[Ossetian]] |
|||
|| <code>os</code> |
|||
|| <code>oss</code> |
|||
|| [[lttoolbox]] |
|||
|| production |
|||
|align="right"| {{#lst:Apertium-oss/stats|stems}} |
|||
|align="right"| {{#lst:Apertium-oss/stats|paradigms}} |
|||
|align="center"| [[Apertium-oss#Current_State|~{{:Apertium-oss/stats/average}}%]] |
|||
|| [[apertium-oss]] ([[languages]]) |
|||
|| [[User:Francis Tyers|Fran]], ... |
|||
|- |
|||
|| <code>[[apertium-glk]]</code> |
|||
|| [[Gilaki]] |
|||
|| <code></code> |
|||
|| <code>glk</code> |
|||
|| [[lttoolbox]] |
|||
|| production |
|||
|align="right"| {{#lst:Apertium-glk/stats|stems}} |
|||
|align="right"| {{#lst:Apertium-glk/stats|paradigms}} |
|||
|align="center"| [[Apertium-oss#Current_State|~{{:Apertium-glk/stats/average}}%]] |
|||
|| [[apertium-glk]] ([[languages]]) |
|||
|| ... |
|||
|- |
|- |
||
|} |
|} |
Revision as of 12:48, 22 August 2016
Status
The ultimate goal is to have multi-purposable transducers for a variety of Indo-Iranian languages. These can then be paired for X→Y translation with the addition of a CG for language X and transfer rules / dictionary for the pair X→Y. Below is listed development progress for each language's transducers and dictionary pairs.
Transducers
Once a transducer has ~80% coverage on a range of medium-large corpora we can say it is "working". Over 90% and it can be considered to be "production".
name | Language | ISO 639 | formalism | state | stems | paradigms | coverage | location | primary authors | |
---|---|---|---|---|---|---|---|---|---|---|
-2 | -3 | |||||||||
apertium-kmr
|
Kurdish (Kurmanji) | ku
|
kmr
|
lttoolbox | production | 17,771 | 157 | [[Apertium-kmr#Current_State|~Apertium-kmr/stats/average%]] | apertium-kmr (languages) | Fran, Memduh |
apertium-pes
|
Iranian Persian |
|
pes
|
lttoolbox | production | 13,167 | 113 | [[Apertium-pes#Current_State|~Apertium-pes/stats/average%]] | apertium-pes (languages) | Fran, ... |
apertium-tgk
|
Tajik | tg
|
tgk
|
lttoolbox | production | 2,784 | 79 | [[Apertium-tgk#Current_State|~Apertium-tgk/stats/average%]] | apertium-tgk (languages) | Fran, ... |
apertium-oss
|
Ossetian | os
|
oss
|
lttoolbox | production | 111 | ~17% | apertium-oss (languages) | Fran, ... | |
apertium-glk
|
Gilaki |
|
glk
|
lttoolbox | production | 4 | 28 | [[Apertium-oss#Current_State|~Apertium-glk/stats/average%]] | apertium-glk (languages) | ... |