Difference between revisions of "Indo-Iranian languages"

From Apertium
Jump to navigation Jump to search
Line 35: Line 35:
|| <code>[[apertium-pes]]</code>
|| <code>[[apertium-pes]]</code>
|| [[Iranian Persian]]
|| [[Iranian Persian]]
|| <code></code>
|| <code>fa</code>
|| <code>pes</code>
|| <code>pes</code>
|| [[lttoolbox]]
|| [[lttoolbox]]
Line 89: Line 89:
* Southeastern: [[Pashto]]
* Southeastern: [[Pashto]]
* Northeastern: [[Ossetian]]
* Northeastern: [[Ossetian]]

[[Category:Indo-Iranian languages|*]]

Revision as of 12:54, 22 August 2016

Status

The ultimate goal is to have multi-purposable transducers for a variety of Indo-Iranian languages. These can then be paired for X→Y translation with the addition of a CG for language X and transfer rules / dictionary for the pair X→Y. Below is listed development progress for each language's transducers and dictionary pairs.

Transducers

Once a transducer has ~80% coverage on a range of medium-large corpora we can say it is "working". Over 90% and it can be considered to be "production".

name Language ISO 639 formalism state stems paradigms coverage location primary authors
-2 -3
apertium-kmr Kurdish (Kurmanji) ku kmr lttoolbox production 17,771 157 [[Apertium-kmr#Current_State|~Apertium-kmr/stats/average%]] apertium-kmr (languages) Fran, Memduh
apertium-pes Iranian Persian fa pes lttoolbox production 13,167 113 [[Apertium-pes#Current_State|~Apertium-pes/stats/average%]] apertium-pes (languages) Fran, ...
apertium-tgk Tajik tg tgk lttoolbox production 2,784 79 [[Apertium-tgk#Current_State|~Apertium-tgk/stats/average%]] apertium-tgk (languages) Fran, ...
apertium-oss Ossetian os oss lttoolbox production 111 ~17% apertium-oss (languages) Fran, ...
apertium-glk Gilaki glk lttoolbox production 4 28 [[Apertium-oss#Current_State|~Apertium-glk/stats/average%]] apertium-glk (languages) ...

Classification