Difference between revisions of "Apertium-kaz-kir/stats"
		
		
		
		
		
		
		Jump to navigation
		Jump to search
		
				
		
		
		
		
		
		
		
	
Firespeaker (talk | contribs)  (→kaz:  r45866)  | 
				Firespeaker (talk | contribs)   (→kir:  r45866 coverage)  | 
				||
| Line 77: | Line 77: | ||
* words: <section begin=azattyk2010-words />3.4M<section end=azattyk2010-words />  | 
  * words: <section begin=azattyk2010-words />3.4M<section end=azattyk2010-words />  | 
||
* regular coverage: ~{{#lst:Apertium-kir/stats|azattyk2010-coverage}}%  | 
  * regular coverage: ~{{#lst:Apertium-kir/stats|azattyk2010-coverage}}%  | 
||
* trimmer coverage: ~<section begin=azattyk2010-coverage />  | 
  * trimmer coverage: ~<section begin=azattyk2010-coverage />64.0<section end=azattyk2010-coverage />%  | 
||
* as of:   | 
  * as of: r45866  | 
||
azattyk2009  | 
  azattyk2009  | 
||
* words: <section begin=azattyk2009-words />4.1M<section end=azattyk2009-words />  | 
  * words: <section begin=azattyk2009-words />4.1M<section end=azattyk2009-words />  | 
||
* regular coverage: ~{{#lst:Apertium-kir/stats|azattyk2009-coverage}}%  | 
  * regular coverage: ~{{#lst:Apertium-kir/stats|azattyk2009-coverage}}%  | 
||
* trimmer coverage: ~<section begin=azattyk2009-coverage />  | 
  * trimmer coverage: ~<section begin=azattyk2009-coverage />63.5<section end=azattyk2009-coverage />%  | 
||
* as of:   | 
  * as of: r45866  | 
||
bible  | 
  bible  | 
||
* words: <section begin=bible-words />174K<section end=bible-words />  | 
  * words: <section begin=bible-words />174K<section end=bible-words />  | 
||
* regular coverage: ~{{#lst:Apertium-kir/stats|bible-coverage}}%  | 
  * regular coverage: ~{{#lst:Apertium-kir/stats|bible-coverage}}%  | 
||
* trimmer coverage: ~<section begin=kirbible-coverage />  | 
  * trimmer coverage: ~<section begin=kirbible-coverage />65.6<section end=kirbible-coverage />%  | 
||
* as of:   | 
  * as of: r45866  | 
||
wp2011/04  | 
  wp2011/04  | 
||
* words: <section begin=wp2011/04-words />545K<section end=wp2011/04-words />  | 
  * words: <section begin=wp2011/04-words />545K<section end=wp2011/04-words />  | 
||
* regular coverage: ~{{#lst:Apertium-kir/stats|wp2011/04-coverage}}%  | 
  * regular coverage: ~{{#lst:Apertium-kir/stats|wp2011/04-coverage}}%  | 
||
* trimmer coverage: ~<section begin=wp2011/04-coverage />  | 
  * trimmer coverage: ~<section begin=wp2011/04-coverage />58.4<section end=wp2011/04-coverage />%  | 
||
* as of:   | 
  * as of: r45866  | 
||
== WER ==  | 
  == WER ==  | 
||
Revision as of 22:07, 22 July 2013
Contents
Over-all stats
apertium-kaz
- trimmed stems as of: r44710
 - stats:
 
| full | trimmed | |
|---|---|---|
| coverage | ~94.5% | ~69.3% | 
| stems | 36,595 | 711 | 
apertium-kir
- trimmed stems as of: r44710
 - stats:
 
| full | trimmed | |
|---|---|---|
| coverage | ~90.4% | ~63.8% | 
| stems | 14,424 | 670 | 
bidix
- stems: 508
 - as of: r45628
 
Trimmed coverage
kaz
Әуезов
- words: 155K
 - regular coverage: ~92.89%
 - trimmed coverage: ~66.3%
 - as of: r45866
 - wikipage: Әуезов corpus
 
bible
- words: 577K
 - regular coverage: ~95.29%
 - trimmed coverage: ~69.4%
 - as of: r45866
 
azattyq2010
- words: 3.2M
 - regular coverage: ~95.07%
 - trimmed coverage: ~67.3%
 - as of: r45866
 - wikipage: RFERL_corpora
 
wp2011
- words: 850K
 - regular coverage: ~90.72%
 - trimmed coverage: ~61.9%
 - as of: r45866
 
quran
- words: 107K
 - regular coverage: ~96.71%
 - trimmed coverage: ~81.5%
 - as of: r45866
 
kir
azattyk2010
- words: 3.4M
 - regular coverage: ~92.11%
 - trimmer coverage: ~64.0%
 - as of: r45866
 
azattyk2009
- words: 4.1M
 - regular coverage: ~92.04%
 - trimmer coverage: ~63.5%
 - as of: r45866
 
bible
- words: 174K
 - regular coverage: ~92.25%
 - trimmer coverage: ~65.6%
 - as of: r45866
 
wp2011/04
- words: 545K
 - regular coverage: ~85.37%
 - trimmer coverage: ~58.4%
 - as of: r45866
 
WER
trt_2013-04-05
- words: 245
 - WER: 6.12%
 - PER: 4.90%
 - as of: r44975
 - type: training
 
story
- words: 351
 - WER: 4.84%
 - PER: 3.99%
 - as of: r44975
 - type: training