Difference between revisions of "Apertium-kaz-kir/stats"
		
		
		
		
		
		
		Jump to navigation
		Jump to search
		
				
		
		
		
		
		
		
		
	
Firespeaker (talk | contribs)  (→kaz:  r46670 most)  | 
				Firespeaker (talk | contribs)   (→kir:  some r46670)  | 
				||
| Line 89: | Line 89: | ||
* words: <section begin=bible-words />174K<section end=bible-words />  | 
  * words: <section begin=bible-words />174K<section end=bible-words />  | 
||
* regular coverage: ~{{#lst:Apertium-kir/stats|bible-coverage}}%  | 
  * regular coverage: ~{{#lst:Apertium-kir/stats|bible-coverage}}%  | 
||
* trimmer coverage: ~<section begin=kirbible-coverage />  | 
  * trimmer coverage: ~<section begin=kirbible-coverage />75.5<section end=kirbible-coverage />%  | 
||
* as of:   | 
  * as of: r46670  | 
||
wp2011/04  | 
  wp2011/04  | 
||
* words: <section begin=wp2011/04-words />545K<section end=wp2011/04-words />  | 
  * words: <section begin=wp2011/04-words />545K<section end=wp2011/04-words />  | 
||
* regular coverage: ~{{#lst:Apertium-kir/stats|wp2011/04-coverage}}%  | 
  * regular coverage: ~{{#lst:Apertium-kir/stats|wp2011/04-coverage}}%  | 
||
* trimmer coverage: ~<section begin=wp2011/04-coverage />  | 
  * trimmer coverage: ~<section begin=wp2011/04-coverage />68.1<section end=wp2011/04-coverage />%  | 
||
* as of:   | 
  * as of: r46670  | 
||
== WER ==  | 
  == WER ==  | 
||
Revision as of 03:22, 21 August 2013
Contents
Over-all stats
apertium-kaz
- trimmed stems as of: r46670
 - stats:
 
| full | trimmed | |
|---|---|---|
| coverage | ~94.5% | ~77.8% | 
| stems | 36,595 | 6512 | 
apertium-kir
- trimmed stems as of: r46670
 - stats:
 
| full | trimmed | |
|---|---|---|
| coverage | ~90.4% | ~70.6% | 
| stems | 14,424 | 6532 | 
bidix
- stems: 6493
 - as of: r46670
 
Trimmed coverage
kaz
Әуезов
- words: 155K
 - regular coverage: ~92.89%
 - trimmed coverage: ~78.6%
 - as of: r46670
 - wikipage: Әуезов corpus
 
bible
- words: 577K
 - regular coverage: ~95.29%
 - trimmed coverage: ~80.1%
 - as of: r46670
 
azattyq2010
- words: 3.2M
 - regular coverage: ~95.07%
 - trimmed coverage: ~70.4%
 - as of: r46152
 - wikipage: RFERL_corpora
 
wp2011
- words: 850K
 - regular coverage: ~90.72%
 - trimmed coverage: ~72.5%
 - as of: r46670
 
quran
- words: 107K
 - regular coverage: ~96.71%
 - trimmed coverage: ~87.2%
 - as of: r46670
 
kir
azattyk2010
- words: 3.4M
 - regular coverage: ~92.11%
 - trimmer coverage: ~67.3%
 - as of: r46154
 
azattyk2009
- words: 4.1M
 - regular coverage: ~92.04%
 - trimmer coverage: ~66.8%
 - as of: r46154
 
bible
- words: 174K
 - regular coverage: ~92.25%
 - trimmer coverage: ~75.5%
 - as of: r46670
 
wp2011/04
- words: 545K
 - regular coverage: ~85.37%
 - trimmer coverage: ~68.1%
 - as of: r46670
 
WER
trt_2013-04-05
- words: 245
 - WER: 6.12%
 - PER: 4.90%
 - as of: r44975
 - type: training
 
story
- words: 351
 - WER: 4.84%
 - PER: 3.99%
 - as of: r44975
 - type: training