Apertium-kaz/stats
< Apertium-kaz
Jump to navigation
Jump to search
Revision as of 12:09, 13 November 2015 by StemCounterBot (talk | contribs)
The language
- native name: қазақ тілі
- families: Turkic languages
- areas: Languages of Central Asia, Languages of the former Soviet Union
Over-all stats
- average: ~92.1%
- authors: Ilnar, Jonathan, Fran, Aida, Beknazar, Nathan
- location: apertium-kaz (languages)
- stems: 23,568 as of r62812 by jonorthwash ~ StemCounterBot (talk) 13:09, 13 November 2015 (CET), run by Unhammer
- rlx rules: 127 as of r62644 by aidana1 ~ StemCounterBot (talk) 13:09, 13 November 2015 (CET), run by Unhammer
- vanilla stems: 16,371 as of r62812 by jonorthwash ~ StemCounterBot (talk) 13:09, 13 November 2015 (CET), run by Unhammer
Corpora
Әуезов
- words: 155K
- coverage: ~90.62%
- as of: r55794
- wikipage: Әуезов corpus
bible
- words: 577K
- coverage: ~93.02%
- as of: r55794
azattyq2010
- words: 3.2M
- coverage: ~93.20%
- as of: r55794
- wikipage: RFERL_corpora
wp2011
- words: 850K
- coverage: ~86.93%
- as of: r55794
wp2013
- words: 18.2M
- coverage: ~87.11%
- as of: r55794
quran
- words: 107K
- coverage: ~95.40%
- as of: r55794
UDHR
- words: 1.5K
- coverage: ~93.07%
- as of: r55794
- wikipage: UDHR