Languages of Central Asia

From Apertium
Revision as of 09:12, 9 January 2014 by Firespeaker (talk | contribs) (Created page with "The languages of Central Asia include several Turkic and Iranian languages spoken in Kazakhstan, Uzbekistan, Kyrgyzstan, Turkmenista...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

The languages of Central Asia include several Turkic and Iranian languages spoken in Kazakhstan, Uzbekistan, Kyrgyzstan, Turkmenistan, Tajikistan, and Afghanistan. These include Kazakh, Kyrgyz, Uzbek, Turkmen, Tajik, Dari, Pashto, Uyghur, and Karakalpak.

The master plan involves generating independent finite-state transducers for each language, and then making individual dictionaries and transfer rules for every pair. The current status of these goals is listed below.

Status

Transducers

Existing language pairs

kaz kir tuk uzb uig tgk kaa prs
kaz - kaz-kir
kaz-kaa
kir kaz-kir
- kir-uzb
tuk -
uzb kir-uzb
-
uig -
tgk -
kaa kaz-kaa
-
prs -
eng eng-kaz
ky-en
fas tg-fa
khk khk-kaz
nog nog-kaz
tat 'kaz-tat
'
tat-kir
tur tur-kir
tuk-tur
tur-uzb