Difference between revisions of "Apertium-test/teststats/"
Jump to navigation
Jump to search
Line 57: | Line 57: | ||
* words: <section begin=wp2017-words />850K<section end=wp2017-words /> |
* words: <section begin=wp2017-words />850K<section end=wp2017-words /> |
||
* coverage: ~<section begin=wp2017-coverage />54.7<section end=wp2017-coverage />% |
* coverage: ~<section begin=wp2017-coverage />54.7<section end=wp2017-coverage />% |
||
* as of: r65751 |
|||
wp2017 |
|||
* words: <section begin=wp2017-words />4184206<section end=wp2017-words /> |
|||
* coverage: ~<section begin=wp2017-coverage />88.7<section end=wp2017-coverage />% |
|||
* as of: r65751 |
* as of: r65751 |
||
[[Category:Datastats]] |
[[Category:Datastats]] |
Revision as of 13:18, 26 December 2017
The language
- native name: қазақ тілі
- families: Turkic languages
- areas: Languages of Central Asia, Languages of the former Soviet Union
In Apertium
Over-all stats
- average: ~94.5%
- stems: 35,892 as of r83396 by spectre360 ~ StemCounterBot (talk) 09:57, 24 December 2017 (CET), run by grzegorzs_
- rlx rules: 140 as of r83396 by spectre360 ~ StemCounterBot (talk) 09:57, 24 December 2017 (CET), run by grzegorzs_
- vanilla stems: 26,748 as of r83396 by spectre360 ~ StemCounterBot (talk) 09:57, 24 December 2017 (CET), run by grzegorzs_
Corpora
Әуезов
- words: 155K
- coverage: ~92.89%
- as of: r65751
- wikipage: Әуезов corpus
bible
- words: 577K
- coverage: ~95.29%
- as of: r65751
azattyq2010
- words: 3.2M
- coverage: ~95.07%
- as of: r65817
- wikipage: RFERL_corpora
wp2011
- words: 850K
- coverage: ~90.72%
- as of: r65751
wp2013
- words: 18.2M
- coverage: ~90.10%
- as of: r65751
quran
- words: 107K
- coverage: ~96.71%
- as of: r65751
UDHR
- words: 1.5K
- coverage: ~96.86%
- as of: r65817
- wikipage: UDHR
wp2017
- words: 850K
- coverage: ~54.7%
- as of: r65751
wp2017
- words: 4184206
- coverage: ~88.7%
- as of: r65751