Difference between revisions of "Apertium-kaz-kir/stats"
		
		
		
		
		
		
		Jump to navigation
		Jump to search
		
				
		
		
		
		
		
		
		
	
Firespeaker (talk | contribs)  (→Over-all stats:  number of stems)  | 
				Firespeaker (talk | contribs)   (→Trimmed coverage:  r46152 numbers, all but azattyk)  | 
				||
| Line 44: | Line 44: | ||
* words: {{#lst:Apertium-kaz/stats|Әуезов-words}}  | 
  * words: {{#lst:Apertium-kaz/stats|Әуезов-words}}  | 
||
* regular coverage: ~{{#lst:Apertium-kaz/stats|Әуезов-coverage}}%  | 
  * regular coverage: ~{{#lst:Apertium-kaz/stats|Әуезов-coverage}}%  | 
||
* trimmed coverage: ~<section begin=Әуезов-coverage />  | 
  * trimmed coverage: ~<section begin=Әуезов-coverage />71.6<section end=Әуезов-coverage />%  | 
||
* as of:   | 
  * as of: r46152  | 
||
* wikipage: {{#lst:Apertium-kaz/stats|Әуезов-wikipage}}  | 
  * wikipage: {{#lst:Apertium-kaz/stats|Әуезов-wikipage}}  | 
||
| Line 51: | Line 51: | ||
* words: {{#lst:Apertium-kaz/stats|bible-words}}  | 
  * words: {{#lst:Apertium-kaz/stats|bible-words}}  | 
||
* regular coverage: ~{{#lst:Apertium-kaz/stats|bible-coverage}}%  | 
  * regular coverage: ~{{#lst:Apertium-kaz/stats|bible-coverage}}%  | 
||
* trimmed coverage: ~<section begin=bible-coverage />  | 
  * trimmed coverage: ~<section begin=bible-coverage />72.6<section end=bible-coverage />%  | 
||
* as of:   | 
  * as of: r46152  | 
||
azattyq2010  | 
  azattyq2010  | 
||
* words: <section begin=azattyq2010-words />3.2M<section end=azattyq2010-words />  | 
  * words: <section begin=azattyq2010-words />3.2M<section end=azattyq2010-words />  | 
||
* regular coverage: ~{{#lst:Apertium-kaz/stats|azattyq2010-coverage}}%  | 
  * regular coverage: ~{{#lst:Apertium-kaz/stats|azattyq2010-coverage}}%  | 
||
* trimmed coverage: ~<section begin=azattyq2010-coverage />  | 
  * trimmed coverage: ~<section begin=azattyq2010-coverage />70.4<section end=azattyq2010-coverage />%  | 
||
* as of:   | 
  * as of: r46152  | 
||
* wikipage: <section begin=azattyq2010-wikipage />RFERL_corpora<section end=azattyq2010-wikipage />  | 
  * wikipage: <section begin=azattyq2010-wikipage />RFERL_corpora<section end=azattyq2010-wikipage />  | 
||
| Line 64: | Line 64: | ||
* words: <section begin=wp2011-words />850K<section end=wp2011-words />  | 
  * words: <section begin=wp2011-words />850K<section end=wp2011-words />  | 
||
* regular coverage: ~{{#lst:Apertium-kaz/stats|wp2011-coverage}}%  | 
  * regular coverage: ~{{#lst:Apertium-kaz/stats|wp2011-coverage}}%  | 
||
* trimmed coverage: ~<section begin=wp2011-coverage />  | 
  * trimmed coverage: ~<section begin=wp2011-coverage />64.0<section end=wp2011-coverage />%  | 
||
* as of:   | 
  * as of: r46152  | 
||
quran  | 
  quran  | 
||
* words: <section begin=quran-words />107K<section end=quran-words />  | 
  * words: <section begin=quran-words />107K<section end=quran-words />  | 
||
* regular coverage: ~{{#lst:Apertium-kaz/stats|quran-coverage}}%  | 
  * regular coverage: ~{{#lst:Apertium-kaz/stats|quran-coverage}}%  | 
||
* trimmed coverage: ~<section begin=quran-coverage />81.  | 
  * trimmed coverage: ~<section begin=quran-coverage />81.4<section end=quran-coverage />%  | 
||
* as of:   | 
  * as of: r46152  | 
||
=== kir ===  | 
  === kir ===  | 
||
| Line 89: | Line 89: | ||
* words: <section begin=bible-words />174K<section end=bible-words />  | 
  * words: <section begin=bible-words />174K<section end=bible-words />  | 
||
* regular coverage: ~{{#lst:Apertium-kir/stats|bible-coverage}}%  | 
  * regular coverage: ~{{#lst:Apertium-kir/stats|bible-coverage}}%  | 
||
* trimmer coverage: ~<section begin=kirbible-coverage />  | 
  * trimmer coverage: ~<section begin=kirbible-coverage />68.5<section end=kirbible-coverage />%  | 
||
* as of:   | 
  * as of: r46152  | 
||
wp2011/04  | 
  wp2011/04  | 
||
* words: <section begin=wp2011/04-words />545K<section end=wp2011/04-words />  | 
  * words: <section begin=wp2011/04-words />545K<section end=wp2011/04-words />  | 
||
* regular coverage: ~{{#lst:Apertium-kir/stats|wp2011/04-coverage}}%  | 
  * regular coverage: ~{{#lst:Apertium-kir/stats|wp2011/04-coverage}}%  | 
||
* trimmer coverage: ~<section begin=wp2011/04-coverage />  | 
  * trimmer coverage: ~<section begin=wp2011/04-coverage />61.3<section end=wp2011/04-coverage />%  | 
||
* as of:   | 
  * as of: r46152  | 
||
== WER ==  | 
  == WER ==  | 
||
Revision as of 18:03, 1 August 2013
Contents
Over-all stats
apertium-kaz
- trimmed stems as of: r46152
 - stats:
 
| full | trimmed | |
|---|---|---|
| coverage | ~94.5% | ~72% | 
| stems | 36,595 | 5656 | 
apertium-kir
- trimmed stems as of: r46152
 - stats:
 
| full | trimmed | |
|---|---|---|
| coverage | ~90.4% | ~65.4% | 
| stems | 14,424 | 5801 | 
bidix
- stems: 5552
 - as of: r46152
 
Trimmed coverage
kaz
Әуезов
- words: 155K
 - regular coverage: ~92.89%
 - trimmed coverage: ~71.6%
 - as of: r46152
 - wikipage: Әуезов corpus
 
bible
- words: 577K
 - regular coverage: ~95.29%
 - trimmed coverage: ~72.6%
 - as of: r46152
 
azattyq2010
- words: 3.2M
 - regular coverage: ~95.07%
 - trimmed coverage: ~70.4%
 - as of: r46152
 - wikipage: RFERL_corpora
 
wp2011
- words: 850K
 - regular coverage: ~90.72%
 - trimmed coverage: ~64.0%
 - as of: r46152
 
quran
- words: 107K
 - regular coverage: ~96.71%
 - trimmed coverage: ~81.4%
 - as of: r46152
 
kir
azattyk2010
- words: 3.4M
 - regular coverage: ~92.11%
 - trimmer coverage: ~64.0%
 - as of: r45866
 
azattyk2009
- words: 4.1M
 - regular coverage: ~92.04%
 - trimmer coverage: ~63.5%
 - as of: r45866
 
bible
- words: 174K
 - regular coverage: ~92.25%
 - trimmer coverage: ~68.5%
 - as of: r46152
 
wp2011/04
- words: 545K
 - regular coverage: ~85.37%
 - trimmer coverage: ~61.3%
 - as of: r46152
 
WER
trt_2013-04-05
- words: 245
 - WER: 6.12%
 - PER: 4.90%
 - as of: r44975
 - type: training
 
story
- words: 351
 - WER: 4.84%
 - PER: 3.99%
 - as of: r44975
 - type: training