Difference between revisions of "Apertium-kaz"
Jump to navigation
Jump to search
Firespeaker (talk | contribs) (→Current State: quran) |
Firespeaker (talk | contribs) |
||
Line 15: | Line 15: | ||
== Current State == |
== Current State == |
||
* Number of stems: {{: |
* Number of stems: {{:Apertium-kaz/stems}} |
||
* Coverage: {{: |
* Coverage: {{:Apertium-kaz/coverage/average}} |
||
{| class="wikitable" |
{| class="wikitable" |
||
Line 24: | Line 24: | ||
|[[Әуезов corpus|Әуезов]] |
|[[Әуезов corpus|Әуезов]] |
||
|align="right"|155K |
|align="right"|155K |
||
| ~{{: |
| ~{{:Apertium-kaz/coverage/Әуезов}}% |
||
|- |
|- |
||
| bible |
| bible |
||
|align="right"| {{:bible corpora/kk/stems}} |
|align="right"| {{:bible corpora/kk/stems}} |
||
| ~{{: |
| ~{{:Apertium-kaz/coverage/bible}}% |
||
|- |
|- |
||
| [[RFERL corpora|azattyq]] 2010 |
| [[RFERL corpora|azattyq]] 2010 |
||
|align="right"| {{:RFERL corpus/kk/2010/stems}} |
|align="right"| {{:RFERL corpus/kk/2010/stems}} |
||
| ~{{: |
| ~{{:Apertium-kaz/coverage/rferl2010}}% |
||
|- |
|- |
||
|wp 2011-11 |
|wp 2011-11 |
||
|align="right"| 0.84M |
|align="right"| 0.84M |
||
| ~{{: |
| ~{{:Apertium-kaz/coverage/wp}}% |
||
|- |
|- |
||
| quran |
| quran |
||
|align="right"| 107K |
|align="right"| 107K |
||
| ~{{: |
| ~{{:Apertium-kaz/coverage/quran}}% |
||
|- |
|- |
||
|} |
|} |
Revision as of 06:36, 20 August 2012
Kazmorph is a morphological analyser/generator for Kazakh, currently under development. It is intended to be compatible with transducers for other Turkic languages so that they can be translated between.
Installation
kazmorph is currently located in incubator/apertium-kaz.
Dependency tree
- hfst (svn ≥r1916)
- foma
- flex
- foma
- apertium
- lttoolbox
Current State
- Number of stems: 9,306
- Coverage: 94.1
corpus | words | coverage |
---|---|---|
Әуезов | 155K | ~83.2% |
bible | 577K | ~85.5% |
azattyq 2010 | 3.2M | ~85.4% |
wp 2011-11 | 0.84M | ~79.6% |
quran | 107K | ~87.3% |
To-do
Improve coverage
- Causitives
- collective numbers
- fix demonstratives
- vowel harmony of single-syllable words with у and и
Future
- run tests on morphophonology