Apertium-kaz/stats
< Apertium-kaz
Jump to navigation
Jump to search
Revision as of 18:35, 19 August 2015 by StemCounterBot (talk | contribs)
The language
- native name: қазақ тілі
- families: Turkic languages
- areas: Languages of Central Asia, Languages of the former Soviet Union
Over-all stats
- average: ~92.1%
- authors: Ilnar, Jonathan, Fran, Aida, Beknazar, Nathan
- location: apertium-kaz (languages)
- stems: 21,404 as of r61478 by aida27 ~ StemCounterBot (talk) 20:35, 19 August 2015 (CEST), run by firespeaker
- rlx rules: 125 as of r61315 by aida27 ~ StemCounterBot (talk) 20:35, 19 August 2015 (CEST), run by firespeaker
- vanilla stems: 14,216 as of r61478 by aida27 ~ StemCounterBot (talk) 20:35, 19 August 2015 (CEST), run by firespeaker
Corpora
Әуезов
- words: 155K
- coverage: ~90.62%
- as of: r55794
- wikipage: Әуезов corpus
bible
- words: 577K
- coverage: ~93.02%
- as of: r55794
azattyq2010
- words: 3.2M
- coverage: ~93.20%
- as of: r55794
- wikipage: RFERL_corpora
wp2011
- words: 850K
- coverage: ~86.93%
- as of: r55794
wp2013
- words: 18.2M
- coverage: ~87.11%
- as of: r55794
quran
- words: 107K
- coverage: ~95.40%
- as of: r55794
UDHR
- words: 1.5K
- coverage: ~93.07%
- as of: r55794
- wikipage: UDHR