Difference between revisions of "Apertium-kaz/stats"

From Apertium
Jump to navigation Jump to search
 
(38 intermediate revisions by 4 users not shown)
Line 1: Line 1:
== Over-all stats ==
== The language ==
* '''native name''': <section begin=nativename />қазақ тілі<section end=nativename />
* '''native name''': <section begin=nativename />қазақ тілі<section end=nativename />
* '''average''': ~<section begin=average />{{:Apertium-kaz/stats/average}}<section end=average />%
* '''families''': <section begin=families />[[Turkic languages]]<section end=families />
* '''areas''': <section begin=areas />[[Languages of Central Asia]], [[Languages of the former Soviet Union]]<section end=areas />

== In Apertium ==
* '''authors''': <section begin=authors />[[User:Ilnar.salimzyan|Ilnar]], [[User:Firespeaker|Jonathan]], [[User:Francis Tyers|Fran]], Aida, [[User:Beknazar|Beknazar]], [[User:nathan0n5ire|Nathan]]<section end=authors />
* '''authors''': <section begin=authors />[[User:Ilnar.salimzyan|Ilnar]], [[User:Firespeaker|Jonathan]], [[User:Francis Tyers|Fran]], Aida, [[User:Beknazar|Beknazar]], [[User:nathan0n5ire|Nathan]]<section end=authors />
* '''location''': <section begin=location />[[apertium-kaz]]&nbsp;([[languages]])<section end=location />
* '''location''': <section begin=location />[[apertium-kaz]]&nbsp;([[languages]])<section end=location />

*'''[https://svn.code.sf.net/p/apertium/svn/languages/apertium-kaz/apertium-kaz.kaz.lexc stems]''': <section begin=stems />18,264<section end=stems /> as of r58005 by selimcan ~ [[User:StemCounterBot|StemCounterBot]] ([[User talk:StemCounterBot|talk]]) 04:07, 14 December 2014 (CET), run by sushain

*'''[https://svn.code.sf.net/p/apertium/svn/languages/apertium-kaz/apertium-kaz.kaz.lexc vanilla stems]''': <section begin=vanilla_stems />11,743<section end=vanilla_stems /> as of r58005 by selimcan ~ [[User:StemCounterBot|StemCounterBot]] ([[User talk:StemCounterBot|talk]]) 04:07, 14 December 2014 (CET), run by sushain
== Over-all stats ==
* '''average''': ~<section begin=average />{{:Apertium-kaz/stats/average}}<section end=average />%
*'''[https://raw.githubusercontent.com/apertium/apertium-kaz/master/apertium-kaz.kaz.lexc stems]''': <section begin=stems />36,595<section end=stems /> as of [https://raw.githubusercontent.com/apertium/apertium-kaz/51fc4e532eb61e589a796116ab973069f26b7145/apertium-kaz.kaz.lexc 51fc4e] by jonorthwash ~ [[User:StemCounterBot|StemCounterBot]] ([[User talk:StemCounterBot|talk]]) 03:06, 28 February 2020 (CET), run by scoopgracie
*'''[https://raw.githubusercontent.com/apertium/apertium-kaz/master/apertium-kaz.kaz.rlx rlx rules]''': <section begin=rlx_rules />150<section end=rlx_rules /> as of [https://raw.githubusercontent.com/apertium/apertium-kaz/aab4808acc1dd3625747d2722bf78fee959afd83/apertium-kaz.kaz.rlx aab480] by jonorthwash ~ [[User:StemCounterBot|StemCounterBot]] ([[User talk:StemCounterBot|talk]]) 06:22, 3 June 2019 (CEST), run by firespeaker
*'''[https://raw.githubusercontent.com/apertium/apertium-kaz/master/apertium-kaz.kaz.lexc vanilla stems]''': <section begin=vanilla_stems />27,433<section end=vanilla_stems /> as of [https://raw.githubusercontent.com/apertium/apertium-kaz/51fc4e532eb61e589a796116ab973069f26b7145/apertium-kaz.kaz.lexc 51fc4e] by jonorthwash ~ [[User:StemCounterBot|StemCounterBot]] ([[User talk:StemCounterBot|talk]]) 03:06, 28 February 2020 (CET), run by scoopgracie
*'''[https://raw.githubusercontent.com/apertium/apertium-kaz/master/apertium-kaz.kaz.rlx rules]''': <section begin=rules />150<section end=rules /> as of [https://raw.githubusercontent.com/apertium/apertium-kaz/aab4808acc1dd3625747d2722bf78fee959afd83/apertium-kaz.kaz.rlx aab480] by jonorthwash ~ [[User:StemCounterBot|StemCounterBot]] ([[User talk:StemCounterBot|talk]]) 03:06, 28 February 2020 (CET), run by scoopgracie


== Corpora ==
== Corpora ==
Әуезов
Әуезов
* words: <section begin=Әуезов-words />155K<section end=Әуезов-words />
* words: <section begin=Әуезов-words />155K<section end=Әуезов-words />
* coverage: ~<section begin=Әуезов-coverage />90.62<section end=Әуезов-coverage />%
* coverage: ~<section begin=Әуезов-coverage />92.89<section end=Әуезов-coverage />%
* as of: r55794
* as of: r65751
* wikipage: <section begin=Әуезов-wikipage />Әуезов corpus<section end=Әуезов-wikipage />
* wikipage: <section begin=Әуезов-wikipage />Әуезов corpus<section end=Әуезов-wikipage />


bible
bible
* words: <section begin=bible-words />577K<section end=bible-words />
* words: <section begin=bible-words />577K<section end=bible-words />
* coverage: ~<section begin=bible-coverage />93.02<section end=bible-coverage />%
* coverage: ~<section begin=bible-coverage />95.29<section end=bible-coverage />%
* as of: r55794
* as of: r65751


azattyq2010
azattyq2010
* words: <section begin=azattyq2010-words />3.2M<section end=azattyq2010-words />
* words: <section begin=azattyq2010-words />3.2M<section end=azattyq2010-words />
* coverage: ~<section begin=azattyq2010-coverage />93.20<section end=azattyq2010-coverage />%
* coverage: ~<section begin=azattyq2010-coverage />95.07<section end=azattyq2010-coverage />%
* as of: r55794
* as of: r65817
* wikipage: <section begin=azattyq2010-wikipage />RFERL_corpora<section end=azattyq2010-wikipage />
* wikipage: <section begin=azattyq2010-wikipage />RFERL_corpora<section end=azattyq2010-wikipage />


wp2011
wp2011
* words: <section begin=wp2011-words />850K<section end=wp2011-words />
* words: <section begin=wp2011-words />850K<section end=wp2011-words />
* coverage: ~<section begin=wp2011-coverage />86.93<section end=wp2011-coverage />%
* coverage: ~<section begin=wp2011-coverage />90.72<section end=wp2011-coverage />%
* as of: r55794
* as of: r65751


wp2013
wp2013
* words: <section begin=wp2013-words />18.2M<section end=wp2013-words />
* words: <section begin=wp2013-words />18.2M<section end=wp2013-words />
* coverage: ~<section begin=wp2013-coverage />87.11<section end=wp2013-coverage />%
* coverage: ~<section begin=wp2013-coverage />90.10<section end=wp2013-coverage />%
* as of: r55794
* as of: r65751


quran
quran
* words: <section begin=quran-words />107K<section end=quran-words />
* words: <section begin=quran-words />107K<section end=quran-words />
* coverage: ~<section begin=quran-coverage />95.40<section end=quran-coverage />%
* coverage: ~<section begin=quran-coverage />96.71<section end=quran-coverage />%
* as of: r55794
* as of: r65751


UDHR
UDHR
* words: <section begin=udhr-words />1.5K<section end=udhr-words />
* words: <section begin=udhr-words />1.5K<section end=udhr-words />
* coverage: ~<section begin=udhr-coverage />93.07<section end=udhr-coverage />%
* coverage: ~<section begin=udhr-coverage />96.86<section end=udhr-coverage />%
* as of: r55794
* as of: r65817
* wikipage: <section begin=udhr-wikipage />UDHR<section end=udhr-wikipage />
* wikipage: <section begin=udhr-wikipage />UDHR<section end=udhr-wikipage />
[[Category:Datastats]]
[[Category:Datastats]]

Latest revision as of 02:06, 28 February 2020

The language[edit]

In Apertium[edit]


Over-all stats[edit]

Corpora[edit]

Әуезов

  • words: 155K
  • coverage: ~92.89%
  • as of: r65751
  • wikipage: Әуезов corpus

bible

  • words: 577K
  • coverage: ~95.29%
  • as of: r65751

azattyq2010

  • words: 3.2M
  • coverage: ~95.07%
  • as of: r65817
  • wikipage: RFERL_corpora

wp2011

  • words: 850K
  • coverage: ~90.72%
  • as of: r65751

wp2013

  • words: 18.2M
  • coverage: ~90.10%
  • as of: r65751

quran

  • words: 107K
  • coverage: ~96.71%
  • as of: r65751

UDHR

  • words: 1.5K
  • coverage: ~96.86%
  • as of: r65817
  • wikipage: UDHR