Difference between revisions of "Apertium-kaz-kir/stats"
		
		
		
		
		
		
		Jump to navigation
		Jump to search
		
				
		
		
		
		
		
		
		
	
Firespeaker (talk | contribs)  (→Corpora:  r44810 kir-kaz trimmed coverage)  | 
				Firespeaker (talk | contribs)   | 
				||
| Line 74: | Line 74: | ||
=== kir ===  | 
  === kir ===  | 
||
| ⚫ | |||
azattyk2010  | 
  azattyk2010  | 
||
* words: <section begin=azattyk2010-words />3.4M<section end=azattyk2010-words />  | 
  * words: <section begin=azattyk2010-words />3.4M<section end=azattyk2010-words />  | 
||
| Line 98: | Line 97: | ||
* trimmer coverage: ~<section begin=wp2011/04-coverage />45.3<section end=wp2011/04-coverage />%  | 
  * trimmer coverage: ~<section begin=wp2011/04-coverage />45.3<section end=wp2011/04-coverage />%  | 
||
* as of: r44810  | 
  * as of: r44810  | 
||
| ⚫ | |||
trt_2013-04-05  | 
|||
* words: 245  | 
|||
* WER: 6.12%  | 
|||
* PER: 4.90%  | 
|||
* as of: r44975  | 
|||
* type: training  | 
|||
story  | 
|||
* words: 351  | 
|||
* WER: 4.84%  | 
|||
* PER: 3.99%  | 
|||
* as of: r44975  | 
|||
* type: training  | 
|||
Revision as of 06:14, 9 June 2013
Contents
Over-all stats
apertium-kaz
- trimmed stems as of: r44710
 - stats:
 
| full | trimmed | |
|---|---|---|
| coverage | ~94.5% | ~53.8% | 
| stems | 36,595 | 711 | 
apertium-kir
- trimmed stems as of: r44710
 - stats:
 
| full | trimmed | |
|---|---|---|
| coverage | ~90.4% | ~49.8% | 
| stems | 14,424 | 670 | 
bidix
- stems: 408
 - as of: r44710
 
Trimmed coverage
kaz
Әуезов
- words: 155K
 - regular coverage: ~92.89%
 - trimmed coverage: ~58.0%
 - as of: r44710
 - wikipage: Әуезов corpus
 
bible
- words: 577K
 - regular coverage: ~95.29%
 - trimmed coverage: ~56.2%
 - as of: r44710
 
azattyq2010
- words: 3.2M
 - regular coverage: ~95.07%
 - trimmed coverage: ~46.9%
 - as of: r44710
 - wikipage: RFERL_corpora
 
wp2011
- words: 850K
 - regular coverage: ~90.72%
 - trimmed coverage: ~43.9%
 - as of: r44710
 
quran
- words: 107K
 - regular coverage: ~96.71%
 - trimmed coverage: ~64.0%
 - as of: r44710
 
kir
azattyk2010
- words: 3.4M
 - regular coverage: ~92.11%
 - trimmer coverage: ~49.4%
 - as of: r44810
 
azattyk2009
- words: 4.1M
 - regular coverage: ~92.04%
 - trimmer coverage: ~48.2%
 - as of: r44810
 
bible
- words: 174K
 - regular coverage: ~92.25%
 - trimmer coverage: ~57.0%
 - as of: r44810
 
wp2011/04
- words: 545K
 - regular coverage: ~85.37%
 - trimmer coverage: ~45.3%
 - as of: r44810
 
WER
trt_2013-04-05
- words: 245
 - WER: 6.12%
 - PER: 4.90%
 - as of: r44975
 - type: training
 
story
- words: 351
 - WER: 4.84%
 - PER: 3.99%
 - as of: r44975
 - type: training