Difference between revisions of "Apertium-kaz-kir/stats"
		
		
		
		
		
		
		Jump to navigation
		Jump to search
		
				
		
		
		
		
		
		
		
	
Firespeaker (talk | contribs)  (→bidix:  r45628)  | 
				Firespeaker (talk | contribs)   (→Trimmed coverage:  first set of r45628 numbers)  | 
				||
| Line 44: | Line 44: | ||
* words: {{#lst:Apertium-kaz/stats|Әуезов-words}}  | 
  * words: {{#lst:Apertium-kaz/stats|Әуезов-words}}  | 
||
* regular coverage: ~{{#lst:Apertium-kaz/stats|Әуезов-coverage}}%  | 
  * regular coverage: ~{{#lst:Apertium-kaz/stats|Әуезов-coverage}}%  | 
||
* trimmed coverage: ~<section begin=Әуезов-coverage />  | 
  * trimmed coverage: ~<section begin=Әуезов-coverage />60.0<section end=Әуезов-coverage />%  | 
||
* as of:   | 
  * as of: r45628  | 
||
* wikipage: {{#lst:Apertium-kaz/stats|Әуезов-wikipage}}  | 
  * wikipage: {{#lst:Apertium-kaz/stats|Әуезов-wikipage}}  | 
||
| Line 51: | Line 51: | ||
* words: {{#lst:Apertium-kaz/stats|bible-words}}  | 
  * words: {{#lst:Apertium-kaz/stats|bible-words}}  | 
||
* regular coverage: ~{{#lst:Apertium-kaz/stats|bible-coverage}}%  | 
  * regular coverage: ~{{#lst:Apertium-kaz/stats|bible-coverage}}%  | 
||
* trimmed coverage: ~<section begin=bible-coverage />  | 
  * trimmed coverage: ~<section begin=bible-coverage />62.0<section end=bible-coverage />%  | 
||
* as of:   | 
  * as of: r45628  | 
||
azattyq2010  | 
  azattyq2010  | 
||
| Line 64: | Line 64: | ||
* words: <section begin=wp2011-words />850K<section end=wp2011-words />  | 
  * words: <section begin=wp2011-words />850K<section end=wp2011-words />  | 
||
* regular coverage: ~{{#lst:Apertium-kaz/stats|wp2011-coverage}}%  | 
  * regular coverage: ~{{#lst:Apertium-kaz/stats|wp2011-coverage}}%  | 
||
* trimmed coverage: ~<section begin=wp2011-coverage />  | 
  * trimmed coverage: ~<section begin=wp2011-coverage />46.6<section end=wp2011-coverage />%  | 
||
* as of:   | 
  * as of: r45628  | 
||
quran  | 
  quran  | 
||
* words: <section begin=quran-words />107K<section end=quran-words />  | 
  * words: <section begin=quran-words />107K<section end=quran-words />  | 
||
* regular coverage: ~{{#lst:Apertium-kaz/stats|quran-coverage}}%  | 
  * regular coverage: ~{{#lst:Apertium-kaz/stats|quran-coverage}}%  | 
||
* trimmed coverage: ~<section begin=quran-coverage />  | 
  * trimmed coverage: ~<section begin=quran-coverage />79.3<section end=quran-coverage />%  | 
||
* as of:   | 
  * as of: r45628  | 
||
=== kir ===  | 
  === kir ===  | 
||
| Line 89: | Line 89: | ||
* words: <section begin=bible-words />174K<section end=bible-words />  | 
  * words: <section begin=bible-words />174K<section end=bible-words />  | 
||
* regular coverage: ~{{#lst:Apertium-kir/stats|bible-coverage}}%  | 
  * regular coverage: ~{{#lst:Apertium-kir/stats|bible-coverage}}%  | 
||
* trimmer coverage: ~<section begin=kirbible-coverage />  | 
  * trimmer coverage: ~<section begin=kirbible-coverage />60.7<section end=kirbible-coverage />%  | 
||
* as of:   | 
  * as of: r45628  | 
||
wp2011/04  | 
  wp2011/04  | 
||
* words: <section begin=wp2011/04-words />545K<section end=wp2011/04-words />  | 
  * words: <section begin=wp2011/04-words />545K<section end=wp2011/04-words />  | 
||
* regular coverage: ~{{#lst:Apertium-kir/stats|wp2011/04-coverage}}%  | 
  * regular coverage: ~{{#lst:Apertium-kir/stats|wp2011/04-coverage}}%  | 
||
* trimmer coverage: ~<section begin=wp2011/04-coverage />  | 
  * trimmer coverage: ~<section begin=wp2011/04-coverage />47.5<section end=wp2011/04-coverage />%  | 
||
* as of:   | 
  * as of: r45628  | 
||
== WER ==  | 
  == WER ==  | 
||
Revision as of 19:51, 8 July 2013
Contents
Over-all stats
apertium-kaz
- trimmed stems as of: r44710
 - stats:
 
| full | trimmed | |
|---|---|---|
| coverage | ~94.5% | ~59% | 
| stems | 36,595 | 711 | 
apertium-kir
- trimmed stems as of: r44710
 - stats:
 
| full | trimmed | |
|---|---|---|
| coverage | ~90.4% | ~51.8% | 
| stems | 14,424 | 670 | 
bidix
- stems: 508
 - as of: r45628
 
Trimmed coverage
kaz
Әуезов
- words: 155K
 - regular coverage: ~92.89%
 - trimmed coverage: ~60.0%
 - as of: r45628
 - wikipage: Әуезов corpus
 
bible
- words: 577K
 - regular coverage: ~95.29%
 - trimmed coverage: ~62.0%
 - as of: r45628
 
azattyq2010
- words: 3.2M
 - regular coverage: ~95.07%
 - trimmed coverage: ~46.9%
 - as of: r44710
 - wikipage: RFERL_corpora
 
wp2011
- words: 850K
 - regular coverage: ~90.72%
 - trimmed coverage: ~46.6%
 - as of: r45628
 
quran
- words: 107K
 - regular coverage: ~96.71%
 - trimmed coverage: ~79.3%
 - as of: r45628
 
kir
azattyk2010
- words: 3.4M
 - regular coverage: ~92.11%
 - trimmer coverage: ~49.4%
 - as of: r44810
 
azattyk2009
- words: 4.1M
 - regular coverage: ~92.04%
 - trimmer coverage: ~48.2%
 - as of: r44810
 
bible
- words: 174K
 - regular coverage: ~92.25%
 - trimmer coverage: ~60.7%
 - as of: r45628
 
wp2011/04
- words: 545K
 - regular coverage: ~85.37%
 - trimmer coverage: ~47.5%
 - as of: r45628
 
WER
trt_2013-04-05
- words: 245
 - WER: 6.12%
 - PER: 4.90%
 - as of: r44975
 - type: training
 
story
- words: 351
 - WER: 4.84%
 - PER: 3.99%
 - as of: r44975
 - type: training