Apertium-kaz/stats
< Apertium-kaz
Jump to navigation
Jump to search
Revision as of 20:29, 18 December 2014 by StemCounterBot (talk | contribs)
Over-all stats
- native name: қазақ тілі
- average: ~92.1%
- authors: Ilnar, Jonathan, Fran, Aida, Beknazar, Nathan
- location: apertium-kaz (languages)
- stems: 18,264 as of r58005 by selimcan ~ StemCounterBot (talk) 21:29, 18 December 2014 (CET), run by sushain
- vanilla stems: 11,743 as of r58005 by selimcan ~ StemCounterBot (talk) 21:29, 18 December 2014 (CET), run by sushain
- rlx rules: 106 as of r57846 by aida27 ~ StemCounterBot (talk) 07:04, 16 December 2014 (CET), run by sushain
Corpora
Әуезов
- words: 155K
- coverage: ~90.62%
- as of: r55794
- wikipage: Әуезов corpus
bible
- words: 577K
- coverage: ~93.02%
- as of: r55794
azattyq2010
- words: 3.2M
- coverage: ~93.20%
- as of: r55794
- wikipage: RFERL_corpora
wp2011
- words: 850K
- coverage: ~86.93%
- as of: r55794
wp2013
- words: 18.2M
- coverage: ~87.11%
- as of: r55794
quran
- words: 107K
- coverage: ~95.40%
- as of: r55794
UDHR
- words: 1.5K
- coverage: ~93.07%
- as of: r55794
- wikipage: UDHR