Apertium-kaz

From Apertium
Revision as of 01:04, 31 December 2011 by Francis Tyers (talk | contribs)
Jump to navigation Jump to search

Kazmorph is a morphological analyser/generator for Kazakh, currently under development. It is intended to be compatible with transducers for other Turkic languages so that they can be translated between.

Installation

kazmorph is currently located in ky-kk.

Dependency tree

  • hfst (svn ≥r1916)
    • foma

Current State

  • Number of stems: 9,306
  • Coverage: 94.1
corpus words coverage
Әуезов 155K ~83.2%
bible 577K ~85.5%
azattyq 2010 3.2M ~85.4%
wp 2011-11 0.84M ~79.6%

To-do

Improve coverage

  • Causitives
  • collective numbers
  • fix demonstratives
  • vowel harmony of single-syllable words with у and и

Future

  • run tests on morphophonology