Difference between revisions of "Apertium-kaz/stats"

From Apertium
Jump to navigation Jump to search
(→‎Corpora: r54942)
(r55310 first round)
Line 1: Line 1:
 
== Over-all stats ==
 
== Over-all stats ==
* stems: <section begin=stems />11,690<section end=stems />
+
* stems: <section begin=stems />12,619<section end=stems />
* as of: r54942
+
* as of: r55310
 
* average: ~<section begin=average />{{:Apertium-kaz/stats/average}}<section end=average />%
 
* average: ~<section begin=average />{{:Apertium-kaz/stats/average}}<section end=average />%
   
Line 7: Line 7:
 
Әуезов
 
Әуезов
 
* words: <section begin=Әуезов-words />155K<section end=Әуезов-words />
 
* words: <section begin=Әуезов-words />155K<section end=Әуезов-words />
* coverage: ~<section begin=Әуезов-coverage />89.9<section end=Әуезов-coverage />%
+
* coverage: ~<section begin=Әуезов-coverage />90.2<section end=Әуезов-coverage />%
* as of: r54942
+
* as of: r55310
 
* wikipage: <section begin=Әуезов-wikipage />Әуезов corpus<section end=Әуезов-wikipage />
 
* wikipage: <section begin=Әуезов-wikipage />Әуезов corpus<section end=Әуезов-wikipage />
   
 
bible
 
bible
 
* words: <section begin=bible-words />577K<section end=bible-words />
 
* words: <section begin=bible-words />577K<section end=bible-words />
* coverage: ~<section begin=bible-coverage />92.4<section end=bible-coverage />%
+
* coverage: ~<section begin=bible-coverage />92.7<section end=bible-coverage />%
* as of: r54942
+
* as of: r55310
   
 
azattyq2010
 
azattyq2010
Line 24: Line 24:
 
wp2011
 
wp2011
 
* words: <section begin=wp2011-words />850K<section end=wp2011-words />
 
* words: <section begin=wp2011-words />850K<section end=wp2011-words />
* coverage: ~<section begin=wp2011-coverage />85.4<section end=wp2011-coverage />%
+
* coverage: ~<section begin=wp2011-coverage />86.0<section end=wp2011-coverage />%
* as of: r54942
+
* as of: r55310
   
 
wp2013
 
wp2013
Line 34: Line 34:
 
quran
 
quran
 
* words: <section begin=quran-words />107K<section end=quran-words />
 
* words: <section begin=quran-words />107K<section end=quran-words />
* coverage: ~<section begin=quran-coverage />95.0<section end=quran-coverage />%
+
* coverage: ~<section begin=quran-coverage />95.2<section end=quran-coverage />%
* as of: r54942
+
* as of: r55310
   
 
UDHR
 
UDHR
 
* words: <section begin=udhr-words />1.5K<section end=udhr-words />
 
* words: <section begin=udhr-words />1.5K<section end=udhr-words />
* coverage: ~<section begin=udhr-coverage />91.9<section end=udhr-coverage />%
+
* coverage: ~<section begin=udhr-coverage />92.7<section end=udhr-coverage />%
* as of: r54942
+
* as of: r55310
 
* wikipage: <section begin=udhr-wikipage />UDHR<section end=udhr-wikipage />
 
* wikipage: <section begin=udhr-wikipage />UDHR<section end=udhr-wikipage />

Revision as of 22:16, 6 July 2014

Over-all stats

  • stems: 12,619
  • as of: r55310
  • average: ~91.5%

Corpora

Әуезов

  • words: 155K
  • coverage: ~90.2%
  • as of: r55310
  • wikipage: Әуезов corpus

bible

  • words: 577K
  • coverage: ~92.7%
  • as of: r55310

azattyq2010

  • words: 3.2M
  • coverage: ~92.4%
  • as of: r54942
  • wikipage: RFERL_corpora

wp2011

  • words: 850K
  • coverage: ~86.0%
  • as of: r55310

wp2013

  • words: 18.2M
  • coverage: ~85.9%
  • as of: r54942

quran

  • words: 107K
  • coverage: ~95.2%
  • as of: r55310

UDHR

  • words: 1.5K
  • coverage: ~92.7%
  • as of: r55310
  • wikipage: UDHR