Kazakh and Tatar
- The Kazakh transducer has 36,595 stems and ~94.5% coverage over random corpora
- The Tatar transducer has 55,702 stems and ~91% coverage over random corpora
You will need:
- hfst (svn ≥r1916)
Information on what remains to be done for this pair can be found at the /TODO list.
We work on the transducers (apertium-kaz and apertium-tat) individually, and use a special process to import to the pair transducers that contain only the words found in the bidix. The following documents this process: