Difference between revisions of "Apertium Turkic"
Firespeaker (talk | contribs) |
Firespeaker (talk | contribs) |
||
Line 53: | Line 53: | ||
* [[apertium-eng-kaz|English-Kazakh]] (occasional consultation) |
* [[apertium-eng-kaz|English-Kazakh]] (occasional consultation) |
||
Transducers: |
Transducers: |
||
* [[apertium- |
* [[apertium-kaz|Kazakh]] (developed much of morphotactics and morphophonology) |
||
* [[apertium- |
* [[apertium-kir|Kyrgyz]] (developed almost entirety of morphotactics and morphophonology) |
||
* [[apertium- |
* [[apertium-tat|Tatar]] (helped develop morphotactics and morphophonology) |
||
* [[apertium-bak|Bashqort]] (helped develop morphotactics and morphophonology) |
* [[apertium-bak|Bashqort]] (helped develop morphotactics and morphophonology) |
||
* [[apertium-chv|Chuvash]] (helped develop morphotactics and morphophonology) |
* [[apertium-chv|Chuvash]] (helped develop morphotactics and morphophonology) |
||
* [[apertium-tkm|Turkmen]] (helped develop morphotactics and morphophonology) |
* [[apertium-tkm|Turkmen]] (helped develop morphotactics and morphophonology) |
||
* [[apertium- |
* [[apertium-kum|Kumyk]] (helped develop morphotactics and morphophonology) |
||
* [[apertium- |
* [[apertium-nog|Nogay]] (helped develop morphotactics and morphophonology) |
||
| |
| |
||
* Uzbek-Kyrgyz |
* Uzbek-Kyrgyz |
||
Line 76: | Line 76: | ||
| |
| |
||
|- |
|- |
||
| [[Image:mlf-photo.jpg|100px|center]] || Mikel Forcada || |
| [[Image:mlf-photo.jpg|100px|center]] || Mikel Forcada || mlforcada || |
||
* [[apertium-eng-kaz|English-Kazakh]] |
|||
| |
|||
|- |
|- |
||
| || Aida Sundetova |
| || Aida Sundetova || || |
||
* [[apertium-eng-kaz|English-Kazakh]] |
|||
| |
|||
|} |
|} |
||
Revision as of 19:25, 4 January 2014
The Apertium Turkic working group includes everyone who works on Turkic-language resources as part of the Apertium project. Resources we develop include not just Machine Translation systems, but their underlying components which can be repurposed, including morphological transducers, disambiguators, and dictionaries.
Translation pairs
We have done quite a bit of work on Machine Translation systems involving Turkic languages. This section provides a short overview of some of them, roughly in order of how well they work.
Released
- Our Kazakh-Tatar system was developed largely by Ilnar, who did the majority of work on it as his GSoC 2012 project. The project was overseen by Jonathan, who did a lot of work on the transducers (especially Kazakh), and Fran. The system was deemed production-ready and released during summer of 2013, and work is ongoing to increase its accuracy.
Near production quality
The following pairs are all near production quality, but have suffered from stalled development.
- The Turkish-Kyrgyz pair was developed in the summer of 2011 by Mirlan Ipasov under the supervision of Jonathan, and was our first Turkic-Turkic pair using HFST. Mirlan and Jonathan's work on the Kyrgyz transducer paved the way for other Turkic pairs. The pair needs some work to be brought up to date to work with newer transducers.
- The Kazakh-Kyrgyz pair was largely developed by Qantörö under the supervision of Jonathan, but is not yet production-ready.
- The Uzbek-Turkish pair was largely developed by Akın under the supervision of Gianluca, but is not yet production-ready.
Under development
The following pairs are under active development, but are a ways from being production-ready:
- The English-Kazakh pair is being worked on by Aida Sundetova under the supervision of Mikel Forcada.
Prototypes
The following pairs are prototypes that could blossom if given proper attention.
- Tatar-Bashqort
- Chuvash-Turkish
- Khalkha-Kazakh
- Chuvash-Tatar
- Tatar-Turkish
- Qaraqalpaq-Kazakh
- The Azeri-Turkish pair was originally developed by Gianluca, but azmorph is obsolete.
- The Turkmen-Turkish pair needs some attention.
Planned for the future
There are pairs that Apertium Turkic developers would like to see exist at some point.
- Uzbek-Kyrgyz
- Qaraqalpaq-Uzbek
- Kazakh-Kumyk
- Kazakh-Nogay
People
Active contributors
Photo | Name | IRC nick | Turkic projects involved in (role) | Other Turkic projects interested in |
---|---|---|---|---|
Francis Morton Tyers (wiki · email) |
spectie, spectei, spectre | |||
Jonathan North Washington (wiki · email) |
firespeaker, jonorthwash, kd5cfx |
Pairs:
Transducers:
|
| |
Ilnar Salimzyanov (wiki · email) |
selimcan |
|
| |
Gianluca Grossi (wiki) |
zfe |
|
||
Mikel Forcada | mlforcada | |||
Aida Sundetova |
Contributors emeritus
Photo | Name | IRC nick | Turkic projects involved in (role) | Other Turkic projects interested in |
---|---|---|---|---|
Mirlan Ipasov | Turkish-Kyrgyz | |||
Hèctor Alòs i Font | Chuvash-Turkish | |||
Röstäm Batalov | Tatar-Bashqort | |||
Akın Dalkı | Uzbek-Turkish |
Contact
To contact the Apertium Turkic team, you can find us on apertium's IRC channel, send one of us a message through the wiki, or send a message to ___PRaddresshere___.
We maintain a low-traffic mailing list (apertium-turkic@lists.sourceforge.net) where occasional discussion and announcements occur. See our archives or subscribe to join in on the fun!