Difference between revisions of "Apertium-kaz/stats"
Jump to navigation
Jump to search
Line 11: | Line 11: | ||
== Over-all stats == |
== Over-all stats == |
||
* '''average''': ~<section begin=average />{{:Apertium-kaz/stats/average}}<section end=average />% |
* '''average''': ~<section begin=average />{{:Apertium-kaz/stats/average}}<section end=average />% |
||
*'''[https://svn.code.sf.net/p/apertium/svn/languages/apertium-kaz/apertium-kaz.kaz.lexc stems]''': <section begin=stems /> |
*'''[https://svn.code.sf.net/p/apertium/svn/languages/apertium-kaz/apertium-kaz.kaz.lexc stems]''': <section begin=stems />35,273<section end=stems /> as of r81604 by jonorthwash ~ [[User:StemCounterBot|StemCounterBot]] ([[User talk:StemCounterBot|talk]]) 21:46, 15 August 2017 (CEST), run by firespeaker |
||
*'''[https://svn.code.sf.net/p/apertium/svn/languages/apertium-kaz/apertium-kaz.kaz.rlx rlx rules]''': <section begin=rlx_rules /> |
*'''[https://svn.code.sf.net/p/apertium/svn/languages/apertium-kaz/apertium-kaz.kaz.rlx rlx rules]''': <section begin=rlx_rules />136<section end=rlx_rules /> as of r81435 by jonorthwash ~ [[User:StemCounterBot|StemCounterBot]] ([[User talk:StemCounterBot|talk]]) 21:46, 15 August 2017 (CEST), run by firespeaker |
||
*'''[https://svn.code.sf.net/p/apertium/svn/languages/apertium-kaz/apertium-kaz.kaz.lexc vanilla stems]''': <section begin=vanilla_stems /> |
*'''[https://svn.code.sf.net/p/apertium/svn/languages/apertium-kaz/apertium-kaz.kaz.lexc vanilla stems]''': <section begin=vanilla_stems />26,257<section end=vanilla_stems /> as of r81604 by jonorthwash ~ [[User:StemCounterBot|StemCounterBot]] ([[User talk:StemCounterBot|talk]]) 21:46, 15 August 2017 (CEST), run by firespeaker |
||
== Corpora == |
== Corpora == |
Revision as of 19:46, 15 August 2017
The language
- native name: қазақ тілі
- families: Turkic languages
- areas: Languages of Central Asia, Languages of the former Soviet Union
In Apertium
Over-all stats
- average: ~94.5%
- stems: 35,273 as of r81604 by jonorthwash ~ StemCounterBot (talk) 21:46, 15 August 2017 (CEST), run by firespeaker
- rlx rules: 136 as of r81435 by jonorthwash ~ StemCounterBot (talk) 21:46, 15 August 2017 (CEST), run by firespeaker
- vanilla stems: 26,257 as of r81604 by jonorthwash ~ StemCounterBot (talk) 21:46, 15 August 2017 (CEST), run by firespeaker
Corpora
Әуезов
- words: 155K
- coverage: ~92.89%
- as of: r65751
- wikipage: Әуезов corpus
bible
- words: 577K
- coverage: ~95.29%
- as of: r65751
azattyq2010
- words: 3.2M
- coverage: ~95.07%
- as of: r65817
- wikipage: RFERL_corpora
wp2011
- words: 850K
- coverage: ~90.72%
- as of: r65751
wp2013
- words: 18.2M
- coverage: ~90.10%
- as of: r65751
quran
- words: 107K
- coverage: ~96.71%
- as of: r65751
UDHR
- words: 1.5K
- coverage: ~96.86%
- as of: r65817
- wikipage: UDHR