Difference between revisions of "Apertium-test/teststats/"
Jump to navigation
Jump to search
(4 intermediate revisions by the same user not shown) | |||
Line 17: | Line 17: | ||
== Corpora == |
== Corpora == |
||
[https://google.com google] |
|||
wp2012 |
|||
* words: <section begin=wp2017-words />4.8M<section end=wp2017-words /> |
|||
* coverage: ~<section begin=wp2017-coverage />92.3<section end=wp2017-coverage />% |
|||
* as of: r76449 |
|||
wp2018 |
|||
* words: <section begin=wp2017-words />4.8M<section end=wp2017-words /> |
|||
* coverage: ~<section begin=wp2017-coverage />92.3<section end=wp2017-coverage />% |
|||
* as of: r76449 |
|||
wp2017 |
wp2017 |
Latest revision as of 19:49, 3 January 2018
The language[edit]
- native name: қазақ тілі
- families: Turkic languages
- areas: Languages of Central Asia, Languages of the former Soviet Union
In Apertium[edit]
Over-all stats[edit]
- average: ~Apertium-test/stats/average/%
- stems: 35,892 as of r83396 by spectre360 ~ StemCounterBot (talk) 09:57, 24 December 2017 (CET), run by grzegorzs_
- rlx rules: 140 as of r83396 by spectre360 ~ StemCounterBot (talk) 09:57, 24 December 2017 (CET), run by grzegorzs_
- vanilla stems: 26,748 as of r83396 by spectre360 ~ StemCounterBot (talk) 09:57, 24 December 2017 (CET), run by grzegorzs_
Corpora[edit]
wp2017
- words: 3.8M
- coverage: ~93.3%
- as of: r76449