Difference between revisions of "Apertium-kaz/stats"

From Apertium
Jump to navigation Jump to search
(oops)
Line 1: Line 1:
== The language ==
== The language ==
* '''native name''': <section begin=nativename />қазақ тілі<section end=nativename />
* '''native name''': <section begin=nativename />қазақ тілі<section end=nativename />
* '''families''': <section begin=families />[[Turkic languages]]<section end=location />
* '''families''': <section begin=families />[[Turkic languages]]<section end=families />
* '''areas''': <section begin=families />[[Languages of Central Asia]], [[Languages of the former Soviet Union]]<section end=location />
* '''areas''': <section begin=areas />[[Languages of Central Asia]], [[Languages of the former Soviet Union]]<section end=areas />





Revision as of 15:38, 11 April 2015

The language


Over-all stats


Corpora

Әуезов

  • words: 155K
  • coverage: ~90.62%
  • as of: r55794
  • wikipage: Әуезов corpus

bible

  • words: 577K
  • coverage: ~93.02%
  • as of: r55794

azattyq2010

  • words: 3.2M
  • coverage: ~93.20%
  • as of: r55794
  • wikipage: RFERL_corpora

wp2011

  • words: 850K
  • coverage: ~86.93%
  • as of: r55794

wp2013

  • words: 18.2M
  • coverage: ~87.11%
  • as of: r55794

quran

  • words: 107K
  • coverage: ~95.40%
  • as of: r55794

UDHR

  • words: 1.5K
  • coverage: ~93.07%
  • as of: r55794
  • wikipage: UDHR