Difference between revisions of "Apertium-kaz/stats"
Jump to navigation
Jump to search
(36 intermediate revisions by 4 users not shown) | |||
Line 1: | Line 1: | ||
== |
== The language == |
||
* '''native name''': <section begin=nativename />қазақ тілі<section end=nativename /> |
* '''native name''': <section begin=nativename />қазақ тілі<section end=nativename /> |
||
* ''' |
* '''families''': <section begin=families />[[Turkic languages]]<section end=families /> |
||
* '''areas''': <section begin=areas />[[Languages of Central Asia]], [[Languages of the former Soviet Union]]<section end=areas /> |
|||
== In Apertium == |
|||
* '''authors''': <section begin=authors />[[User:Ilnar.salimzyan|Ilnar]], [[User:Firespeaker|Jonathan]], [[User:Francis Tyers|Fran]], Aida, [[User:Beknazar|Beknazar]], [[User:nathan0n5ire|Nathan]]<section end=authors /> |
* '''authors''': <section begin=authors />[[User:Ilnar.salimzyan|Ilnar]], [[User:Firespeaker|Jonathan]], [[User:Francis Tyers|Fran]], Aida, [[User:Beknazar|Beknazar]], [[User:nathan0n5ire|Nathan]]<section end=authors /> |
||
* '''location''': <section begin=location />[[apertium-kaz]] ([[languages]])<section end=location /> |
* '''location''': <section begin=location />[[apertium-kaz]] ([[languages]])<section end=location /> |
||
⚫ | |||
⚫ | *'''[https:// |
||
== Over-all stats == |
|||
⚫ | *'''[https:// |
||
* '''average''': ~<section begin=average />{{:Apertium-kaz/stats/average}}<section end=average />% |
|||
⚫ | *'''[https://raw.githubusercontent.com/apertium/apertium-kaz/master/apertium-kaz.kaz.lexc stems]''': <section begin=stems />36,595<section end=stems /> as of [https://raw.githubusercontent.com/apertium/apertium-kaz/51fc4e532eb61e589a796116ab973069f26b7145/apertium-kaz.kaz.lexc 51fc4e] by jonorthwash ~ [[User:StemCounterBot|StemCounterBot]] ([[User talk:StemCounterBot|talk]]) 03:06, 28 February 2020 (CET), run by scoopgracie |
||
⚫ | *'''[https://raw.githubusercontent.com/apertium/apertium-kaz/master/apertium-kaz.kaz.rlx rlx rules]''': <section begin=rlx_rules />150<section end=rlx_rules /> as of [https://raw.githubusercontent.com/apertium/apertium-kaz/aab4808acc1dd3625747d2722bf78fee959afd83/apertium-kaz.kaz.rlx aab480] by jonorthwash ~ [[User:StemCounterBot|StemCounterBot]] ([[User talk:StemCounterBot|talk]]) 06:22, 3 June 2019 (CEST), run by firespeaker |
||
⚫ | *'''[https://raw.githubusercontent.com/apertium/apertium-kaz/master/apertium-kaz.kaz.lexc vanilla stems]''': <section begin=vanilla_stems />27,433<section end=vanilla_stems /> as of [https://raw.githubusercontent.com/apertium/apertium-kaz/51fc4e532eb61e589a796116ab973069f26b7145/apertium-kaz.kaz.lexc 51fc4e] by jonorthwash ~ [[User:StemCounterBot|StemCounterBot]] ([[User talk:StemCounterBot|talk]]) 03:06, 28 February 2020 (CET), run by scoopgracie |
||
*'''[https://raw.githubusercontent.com/apertium/apertium-kaz/master/apertium-kaz.kaz.rlx rules]''': <section begin=rules />150<section end=rules /> as of [https://raw.githubusercontent.com/apertium/apertium-kaz/aab4808acc1dd3625747d2722bf78fee959afd83/apertium-kaz.kaz.rlx aab480] by jonorthwash ~ [[User:StemCounterBot|StemCounterBot]] ([[User talk:StemCounterBot|talk]]) 03:06, 28 February 2020 (CET), run by scoopgracie |
|||
== Corpora == |
== Corpora == |
||
Әуезов |
Әуезов |
||
* words: <section begin=Әуезов-words />155K<section end=Әуезов-words /> |
* words: <section begin=Әуезов-words />155K<section end=Әуезов-words /> |
||
* coverage: ~<section begin=Әуезов-coverage /> |
* coverage: ~<section begin=Әуезов-coverage />92.89<section end=Әуезов-coverage />% |
||
* as of: |
* as of: r65751 |
||
* wikipage: <section begin=Әуезов-wikipage />Әуезов corpus<section end=Әуезов-wikipage /> |
* wikipage: <section begin=Әуезов-wikipage />Әуезов corpus<section end=Әуезов-wikipage /> |
||
bible |
bible |
||
* words: <section begin=bible-words />577K<section end=bible-words /> |
* words: <section begin=bible-words />577K<section end=bible-words /> |
||
* coverage: ~<section begin=bible-coverage /> |
* coverage: ~<section begin=bible-coverage />95.29<section end=bible-coverage />% |
||
* as of: |
* as of: r65751 |
||
azattyq2010 |
azattyq2010 |
||
* words: <section begin=azattyq2010-words />3.2M<section end=azattyq2010-words /> |
* words: <section begin=azattyq2010-words />3.2M<section end=azattyq2010-words /> |
||
* coverage: ~<section begin=azattyq2010-coverage /> |
* coverage: ~<section begin=azattyq2010-coverage />95.07<section end=azattyq2010-coverage />% |
||
* as of: |
* as of: r65817 |
||
* wikipage: <section begin=azattyq2010-wikipage />RFERL_corpora<section end=azattyq2010-wikipage /> |
* wikipage: <section begin=azattyq2010-wikipage />RFERL_corpora<section end=azattyq2010-wikipage /> |
||
wp2011 |
wp2011 |
||
* words: <section begin=wp2011-words />850K<section end=wp2011-words /> |
* words: <section begin=wp2011-words />850K<section end=wp2011-words /> |
||
* coverage: ~<section begin=wp2011-coverage /> |
* coverage: ~<section begin=wp2011-coverage />90.72<section end=wp2011-coverage />% |
||
* as of: |
* as of: r65751 |
||
wp2013 |
wp2013 |
||
* words: <section begin=wp2013-words />18.2M<section end=wp2013-words /> |
* words: <section begin=wp2013-words />18.2M<section end=wp2013-words /> |
||
* coverage: ~<section begin=wp2013-coverage /> |
* coverage: ~<section begin=wp2013-coverage />90.10<section end=wp2013-coverage />% |
||
* as of: |
* as of: r65751 |
||
quran |
quran |
||
* words: <section begin=quran-words />107K<section end=quran-words /> |
* words: <section begin=quran-words />107K<section end=quran-words /> |
||
* coverage: ~<section begin=quran-coverage /> |
* coverage: ~<section begin=quran-coverage />96.71<section end=quran-coverage />% |
||
* as of: |
* as of: r65751 |
||
UDHR |
UDHR |
||
* words: <section begin=udhr-words />1.5K<section end=udhr-words /> |
* words: <section begin=udhr-words />1.5K<section end=udhr-words /> |
||
* coverage: ~<section begin=udhr-coverage /> |
* coverage: ~<section begin=udhr-coverage />96.86<section end=udhr-coverage />% |
||
* as of: |
* as of: r65817 |
||
* wikipage: <section begin=udhr-wikipage />UDHR<section end=udhr-wikipage /> |
* wikipage: <section begin=udhr-wikipage />UDHR<section end=udhr-wikipage /> |
||
[[Category:Datastats]] |
[[Category:Datastats]] |
Latest revision as of 02:06, 28 February 2020
The language[edit]
- native name: қазақ тілі
- families: Turkic languages
- areas: Languages of Central Asia, Languages of the former Soviet Union
In Apertium[edit]
Over-all stats[edit]
- average: ~94.5%
- stems: 36,595 as of 51fc4e by jonorthwash ~ StemCounterBot (talk) 03:06, 28 February 2020 (CET), run by scoopgracie
- rlx rules: 150 as of aab480 by jonorthwash ~ StemCounterBot (talk) 06:22, 3 June 2019 (CEST), run by firespeaker
- vanilla stems: 27,433 as of 51fc4e by jonorthwash ~ StemCounterBot (talk) 03:06, 28 February 2020 (CET), run by scoopgracie
- rules: 150 as of aab480 by jonorthwash ~ StemCounterBot (talk) 03:06, 28 February 2020 (CET), run by scoopgracie
Corpora[edit]
Әуезов
- words: 155K
- coverage: ~92.89%
- as of: r65751
- wikipage: Әуезов corpus
bible
- words: 577K
- coverage: ~95.29%
- as of: r65751
azattyq2010
- words: 3.2M
- coverage: ~95.07%
- as of: r65817
- wikipage: RFERL_corpora
wp2011
- words: 850K
- coverage: ~90.72%
- as of: r65751
wp2013
- words: 18.2M
- coverage: ~90.10%
- as of: r65751
quran
- words: 107K
- coverage: ~96.71%
- as of: r65751
UDHR
- words: 1.5K
- coverage: ~96.86%
- as of: r65817
- wikipage: UDHR