Difference between revisions of "Kazakh and Tatar"
Jump to navigation
Jump to search
Firespeaker (talk | contribs) |
Firespeaker (talk | contribs) |
||
Line 15: | Line 15: | ||
== Developers == |
== Developers == |
||
Information on what remains to be done for this pair can be found at the [[/TODO]] list. |
Information on what remains to be done for this pair can be found at the [[/TODO]] list. |
||
=== Development workflow === |
|||
We work on the transducers ([[apertium-kaz]] and [[apertium-tat]]) individually, and use a special process to import to the pair transducers that contain only the words found in the bidix. The following documents this process: |
|||
* … |
|||
[[Category:Kazakh and Tatar|*]] |
[[Category:Kazakh and Tatar|*]] |
Revision as of 19:55, 21 January 2013
This is a language pair translating between Kazakh and Tatar. The pair is currently located in incubator, but it is expected that it will soon be moved to staging.
General information
- The Kazakh transducer has 36,595 stems and ~94.5% coverage over random corpora
- The Tatar transducer has 55,702 stems and ~91% coverage over random corpora
Installation
You will need:
- hfst (svn ≥r1916)
- foma
- flex
- foma
- apertium
- lttoolbox
Developers
Information on what remains to be done for this pair can be found at the /TODO list.
Development workflow
We work on the transducers (apertium-kaz and apertium-tat) individually, and use a special process to import to the pair transducers that contain only the words found in the bidix. The following documents this process:
- …