Difference between revisions of "Apertium-kaz/stats"

From Apertium
Jump to navigation Jump to search
Line 39: Line 39:
 
UDHR
 
UDHR
 
* words: <section begin=udhr-words />1.5K<section end=udhr-words />
 
* words: <section begin=udhr-words />1.5K<section end=udhr-words />
* coverage: ~<section begin=quran-coverage />91.2<section end=quran-coverage />%
+
* coverage: ~<section begin=udhr-coverage />91.2<section end=udhr-coverage />%
 
* as of: r48275
 
* as of: r48275
 
* wikipage: <section begin=udhr-wikipage />UDHR<section end=udhr-wikipage />
 
* wikipage: <section begin=udhr-wikipage />UDHR<section end=udhr-wikipage />

Revision as of 01:43, 3 November 2013

Over-all stats

  • stems: 10,942
  • as of: r48112
  • average: ~90.8%

Corpora

Әуезов

  • words: 155K
  • coverage: ~89.7%
  • as of: r48112
  • wikipage: Әуезов corpus

bible

  • words: 577K
  • coverage: ~92.2%
  • as of: r48112

azattyq2010

  • words: 3.2M
  • coverage: ~92.0%
  • as of: r48112
  • wikipage: RFERL_corpora

wp2011

  • words: 850K
  • coverage: ~84.5%
  • as of: r48112

wp2013

  • words: 18.2M
  • coverage: ~85.1%
  • as of: r48186

quran

  • words: 107K
  • coverage: ~94.7%
  • as of: r48112

UDHR

  • words: 1.5K
  • coverage: ~91.2%
  • as of: r48275
  • wikipage: UDHR