Apertium-kaz/stats
< Apertium-kaz
Jump to navigation
Jump to search
Revision as of 02:06, 28 February 2020 by StemCounterBot (talk | contribs)
The language[edit]
- native name: қазақ тілі
- families: Turkic languages
- areas: Languages of Central Asia, Languages of the former Soviet Union
In Apertium[edit]
Over-all stats[edit]
- average: ~94.5%
- stems: 36,595 as of 51fc4e by jonorthwash ~ StemCounterBot (talk) 03:06, 28 February 2020 (CET), run by scoopgracie
- rlx rules: 150 as of aab480 by jonorthwash ~ StemCounterBot (talk) 06:22, 3 June 2019 (CEST), run by firespeaker
- vanilla stems: 27,433 as of 51fc4e by jonorthwash ~ StemCounterBot (talk) 03:06, 28 February 2020 (CET), run by scoopgracie
- rules: 150 as of aab480 by jonorthwash ~ StemCounterBot (talk) 03:06, 28 February 2020 (CET), run by scoopgracie
Corpora[edit]
Әуезов
- words: 155K
- coverage: ~92.89%
- as of: r65751
- wikipage: Әуезов corpus
bible
- words: 577K
- coverage: ~95.29%
- as of: r65751
azattyq2010
- words: 3.2M
- coverage: ~95.07%
- as of: r65817
- wikipage: RFERL_corpora
wp2011
- words: 850K
- coverage: ~90.72%
- as of: r65751
wp2013
- words: 18.2M
- coverage: ~90.10%
- as of: r65751
quran
- words: 107K
- coverage: ~96.71%
- as of: r65751
UDHR
- words: 1.5K
- coverage: ~96.86%
- as of: r65817
- wikipage: UDHR