Difference between revisions of "Apertium-kaz-kir/stats"
		
		
		
		
		
		
		Jump to navigation
		Jump to search
		
				
		
		
		
		
		
		
		
	
Firespeaker (talk | contribs)  | 
				Firespeaker (talk | contribs)   | 
				||
| (61 intermediate revisions by 4 users not shown) | |||
| Line 1: | Line 1: | ||
== Over-all stats ==  | 
  == Over-all stats ==  | 
||
*'''[https://raw.githubusercontent.com/apertium/apertium-kaz-kir/master/apertium-kaz-kir.kaz-kir.t1x kaz-kir t1x rules]''': <section begin=kaz-kir_t1x_rules />27<section end=kaz-kir_t1x_rules /> as of [https://raw.githubusercontent.com/apertium/apertium-kaz-kir/968b74bbe15435bc4a14609bc8d58ac1154bdbe4/apertium-kaz-kir.kaz-kir.t1x 968b74] by jonorthwash ~ [[User:Firespeaker|Firespeaker]] ([[User talk:Firespeaker|talk]]) 18:23, 3 April 2021 (UTC), run by firespeaker  | 
|||
*'''[https://raw.githubusercontent.com/apertium/apertium-kaz-kir/master/apertium-kaz-kir.kaz-kir.t2x kaz-kir t2x rules]''': <section begin=kaz-kir_t2x_rules />6<section end=kaz-kir_t2x_rules /> as of [https://raw.githubusercontent.com/apertium/apertium-kaz-kir/9e3144c0be2d63b45b52783a3202c4ab48359aa3/apertium-kaz-kir.kaz-kir.t2x 9e3144] by jonathan.north.washington ~ [[User:Firespeaker|Firespeaker]] ([[User talk:Firespeaker|talk]]) 18:23, 3 April 2021 (UTC), run by firespeaker  | 
|||
*'''[https://raw.githubusercontent.com/apertium/apertium-kaz-kir/master/apertium-kaz-kir.kir-kaz.t1x kir-kaz t1x rules]''': <section begin=kir-kaz_t1x_rules />3<section end=kir-kaz_t1x_rules /> as of [https://raw.githubusercontent.com/apertium/apertium-kaz-kir/5902573ef18a665ed4512d2ce62bdb21c6507921/apertium-kaz-kir.kir-kaz.t1x 590257] by francis.m..tyers ~ [[User:Firespeaker|Firespeaker]] ([[User talk:Firespeaker|talk]]) 18:23, 3 April 2021 (UTC), run by firespeaker  | 
|||
*'''[https://raw.githubusercontent.com/apertium/apertium-kaz-kir/master/apertium-kaz-kir.kir-kaz.t2x kir-kaz t2x rules]''': <section begin=kir-kaz_t2x_rules />4<section end=kir-kaz_t2x_rules /> as of [https://raw.githubusercontent.com/apertium/apertium-kaz-kir/5902573ef18a665ed4512d2ce62bdb21c6507921/apertium-kaz-kir.kir-kaz.t2x 590257] by francis.m..tyers ~ [[User:Firespeaker|Firespeaker]] ([[User talk:Firespeaker|talk]]) 18:23, 3 April 2021 (UTC), run by firespeaker  | 
|||
*'''[https://raw.githubusercontent.com/apertium/apertium-kaz-kir/master/apertium-kaz-kir.kaz-kir.t1x kaz-kir t1x macros]''': <section begin=kaz-kir_t1x_macros />6<section end=kaz-kir_t1x_macros /> as of [https://raw.githubusercontent.com/apertium/apertium-kaz-kir/968b74bbe15435bc4a14609bc8d58ac1154bdbe4/apertium-kaz-kir.kaz-kir.t1x 968b74] by jonorthwash ~ [[User:Firespeaker|Firespeaker]] ([[User talk:Firespeaker|talk]]) 18:23, 3 April 2021 (UTC), run by firespeaker  | 
|||
*'''[https://raw.githubusercontent.com/apertium/apertium-kaz-kir/master/apertium-kaz-kir.kaz-kir.t2x kaz-kir t2x macros]''': <section begin=kaz-kir_t2x_macros />0<section end=kaz-kir_t2x_macros /> as of [https://raw.githubusercontent.com/apertium/apertium-kaz-kir/9e3144c0be2d63b45b52783a3202c4ab48359aa3/apertium-kaz-kir.kaz-kir.t2x 9e3144] by jonathan.north.washington ~ [[User:Firespeaker|Firespeaker]] ([[User talk:Firespeaker|talk]]) 18:23, 3 April 2021 (UTC), run by firespeaker  | 
|||
*'''[https://raw.githubusercontent.com/apertium/apertium-kaz-kir/master/apertium-kaz-kir.kir-kaz.t1x kir-kaz t1x macros]''': <section begin=kir-kaz_t1x_macros />3<section end=kir-kaz_t1x_macros /> as of [https://raw.githubusercontent.com/apertium/apertium-kaz-kir/5902573ef18a665ed4512d2ce62bdb21c6507921/apertium-kaz-kir.kir-kaz.t1x 590257] by francis.m..tyers ~ [[User:Firespeaker|Firespeaker]] ([[User talk:Firespeaker|talk]]) 18:23, 3 April 2021 (UTC), run by firespeaker  | 
|||
*'''[https://raw.githubusercontent.com/apertium/apertium-kaz-kir/master/apertium-kaz-kir.kir-kaz.t2x kir-kaz t2x macros]''': <section begin=kir-kaz_t2x_macros />0<section end=kir-kaz_t2x_macros /> as of [https://raw.githubusercontent.com/apertium/apertium-kaz-kir/5902573ef18a665ed4512d2ce62bdb21c6507921/apertium-kaz-kir.kir-kaz.t2x 590257] by francis.m..tyers ~ [[User:Firespeaker|Firespeaker]] ([[User talk:Firespeaker|talk]]) 18:23, 3 April 2021 (UTC), run by firespeaker  | 
|||
*'''[https://raw.githubusercontent.com/apertium/apertium-kaz-kir/master/apertium-kaz-kir.kaz-kir.dix kaz-kir dix stems]''': <section begin=kaz-kir_dix_stems />8,174<section end=kaz-kir_dix_stems /> as of [https://raw.githubusercontent.com/apertium/apertium-kaz-kir/968b74bbe15435bc4a14609bc8d58ac1154bdbe4/apertium-kaz-kir.kaz-kir.dix 968b74] by jonorthwash ~ [[User:Firespeaker|Firespeaker]] ([[User talk:Firespeaker|talk]]) 18:23, 3 April 2021 (UTC), run by firespeaker  | 
|||
*'''kaz-kir stems''': <section begin=kaz-kir_stems />?<section end=kaz-kir_stems /> ~ [[User:Firespeaker|Firespeaker]] ([[User talk:Firespeaker|talk]]) 15:01, 6 February 2021 (UTC), run by firespeaker  | 
|||
=== apertium-kaz ===  | 
  === apertium-kaz ===  | 
||
* trimmed stems as of:   | 
  * trimmed stems as of: r46670  | 
||
* stats:  | 
  * stats:  | 
||
{|class="wikitable"  | 
  {|class="wikitable"  | 
||
| Line 14: | Line 25: | ||
! stems  | 
  ! stems  | 
||
| {{#lst:apertium-kaz/stats|stems}}  | 
  | {{#lst:apertium-kaz/stats|stems}}  | 
||
| <section begin=kaz-trimmedstems />  | 
  | <section begin=kaz-trimmedstems />6512<section end=kaz-trimmedstems />  | 
||
|}  | 
  |}  | 
||
=== apertium-kir ===  | 
  === apertium-kir ===  | 
||
* trimmed stems as of:   | 
  * trimmed stems as of: r46670  | 
||
* stats:  | 
  * stats:  | 
||
{|class="wikitable"  | 
  {|class="wikitable"  | 
||
| Line 32: | Line 43: | ||
! stems  | 
  ! stems  | 
||
| {{#lst:apertium-kir/stats|stems}}  | 
  | {{#lst:apertium-kir/stats|stems}}  | 
||
| <section begin=kir-trimmedstems />  | 
  | <section begin=kir-trimmedstems />6532<section end=kir-trimmedstems />  | 
||
|}  | 
  |}  | 
||
=== bidix ===  | 
  === bidix ===  | 
||
| ⚫ | |||
| ⚫ | |||
| ⚫ | |||
== Trimmed coverage ==  | 
  == Trimmed coverage ==  | 
||
| Line 44: | Line 54: | ||
* words: {{#lst:Apertium-kaz/stats|Әуезов-words}}  | 
  * words: {{#lst:Apertium-kaz/stats|Әуезов-words}}  | 
||
* regular coverage: ~{{#lst:Apertium-kaz/stats|Әуезов-coverage}}%  | 
  * regular coverage: ~{{#lst:Apertium-kaz/stats|Әуезов-coverage}}%  | 
||
* trimmed coverage: ~<section begin=Әуезов-coverage />  | 
  * trimmed coverage: ~<section begin=Әуезов-coverage />85.51<section end=Әуезов-coverage />%  | 
||
* as of:   | 
  * as of: r70278  | 
||
* wikipage: {{#lst:Apertium-kaz/stats|Әуезов-wikipage}}  | 
  * wikipage: {{#lst:Apertium-kaz/stats|Әуезов-wikipage}}  | 
||
| Line 51: | Line 61: | ||
* words: {{#lst:Apertium-kaz/stats|bible-words}}  | 
  * words: {{#lst:Apertium-kaz/stats|bible-words}}  | 
||
* regular coverage: ~{{#lst:Apertium-kaz/stats|bible-coverage}}%  | 
  * regular coverage: ~{{#lst:Apertium-kaz/stats|bible-coverage}}%  | 
||
* trimmed coverage: ~<section begin=bible-coverage />  | 
  * trimmed coverage: ~<section begin=bible-coverage />88.9<section end=bible-coverage />%  | 
||
* as of:   | 
  * as of: r70278  | 
||
azattyq2010  | 
  azattyq2010  | 
||
* words: <section begin=azattyq2010-words />3.2M<section end=azattyq2010-words />  | 
  * words: <section begin=azattyq2010-words />3.2M<section end=azattyq2010-words />  | 
||
* regular coverage: ~{{#lst:Apertium-kaz/stats|azattyq2010-coverage}}%  | 
  * regular coverage: ~{{#lst:Apertium-kaz/stats|azattyq2010-coverage}}%  | 
||
* trimmed coverage: ~<section begin=azattyq2010-coverage />  | 
  * trimmed coverage: ~<section begin=azattyq2010-coverage />89.52<section end=azattyq2010-coverage />%  | 
||
* as of:   | 
  * as of: r70278  | 
||
* wikipage: <section begin=azattyq2010-wikipage />RFERL_corpora<section end=azattyq2010-wikipage />  | 
  * wikipage: <section begin=azattyq2010-wikipage />RFERL_corpora<section end=azattyq2010-wikipage />  | 
||
azattyq2012  | 
|||
* words: <section begin=azattyq2012-words />3.9M<section end=azattyq2012-words />  | 
|||
* regular coverage: ~{{#lst:Apertium-kaz/stats|azattyq2012-coverage}}%  | 
|||
* trimmed coverage: ~<section begin=azattyq2012-coverage />89.20<section end=azattyq2012-coverage />%  | 
|||
* as of: r70278  | 
|||
* wikipage: <section begin=azattyq2012-wikipage />RFERL_corpora<section end=azattyq2012-wikipage />  | 
|||
wp2011  | 
  wp2011  | 
||
* words: <section begin=wp2011-words />850K<section end=wp2011-words />  | 
  * words: <section begin=wp2011-words />850K<section end=wp2011-words />  | 
||
* regular coverage: ~{{#lst:Apertium-kaz/stats|wp2011-coverage}}%  | 
  * regular coverage: ~{{#lst:Apertium-kaz/stats|wp2011-coverage}}%  | 
||
* trimmed coverage: ~<section begin=wp2011-coverage />  | 
  * trimmed coverage: ~<section begin=wp2011-coverage />80.1<section end=wp2011-coverage />%  | 
||
* as of:   | 
  * as of: r47474  | 
||
wp2013  | 
|||
* words: <section begin=wp2013-words />26.2M<section end=wp2013-words />  | 
|||
* regular coverage: ~{{#lst:Apertium-kaz/stats|wp2013-coverage}}%  | 
|||
* trimmed coverage: ~<section begin=wp2013-coverage />82.94<section end=wp2013-coverage />%  | 
|||
* as of: r70278  | 
|||
wp100K  | 
|||
| ⚫ | |||
* regular coverage: ~{{#lst:Apertium-kaz/stats|wp100K-coverage}}%  | 
|||
* trimmed coverage: ~<section begin=wp100K-coverage />82.61<section end=wp100K-coverage />%  | 
|||
* as of: r70274  | 
|||
quran  | 
  quran  | 
||
* words: <section begin=quran-words />107K<section end=quran-words />  | 
  * words: <section begin=quran-words />107K<section end=quran-words />  | 
||
* regular coverage: ~{{#lst:Apertium-kaz/stats|quran-coverage}}%  | 
  * regular coverage: ~{{#lst:Apertium-kaz/stats|quran-coverage}}%  | 
||
* trimmed coverage: ~<section begin=quran-coverage />  | 
  * trimmed coverage: ~<section begin=quran-coverage />92.49<section end=quran-coverage />%  | 
||
* as of:   | 
  * as of: r70278  | 
||
=== kir ===  | 
  === kir ===  | 
||
| Line 77: | Line 106: | ||
* words: <section begin=azattyk2010-words />3.4M<section end=azattyk2010-words />  | 
  * words: <section begin=azattyk2010-words />3.4M<section end=azattyk2010-words />  | 
||
* regular coverage: ~{{#lst:Apertium-kir/stats|azattyk2010-coverage}}%  | 
  * regular coverage: ~{{#lst:Apertium-kir/stats|azattyk2010-coverage}}%  | 
||
* trimmer coverage: ~<section begin=azattyk2010-coverage />  | 
  * trimmer coverage: ~<section begin=azattyk2010-coverage />89.2<section end=azattyk2010-coverage />%  | 
||
* as of:   | 
  * as of: r70278  | 
||
azattyk2009  | 
  azattyk2009  | 
||
* words: <section begin=azattyk2009-words />4.1M<section end=azattyk2009-words />  | 
  * words: <section begin=azattyk2009-words />4.1M<section end=azattyk2009-words />  | 
||
* regular coverage: ~{{#lst:Apertium-kir/stats|azattyk2009-coverage}}%  | 
  * regular coverage: ~{{#lst:Apertium-kir/stats|azattyk2009-coverage}}%  | 
||
* trimmer coverage: ~<section begin=azattyk2009-coverage />  | 
  * trimmer coverage: ~<section begin=azattyk2009-coverage />86.25<section end=azattyk2009-coverage />%  | 
||
* as of:   | 
  * as of: r70278  | 
||
bible  | 
  bible  | 
||
* words: <section begin=bible-words />174K<section end=bible-words />  | 
  * words: <section begin=bible-words />174K<section end=bible-words />  | 
||
* regular coverage: ~{{#lst:Apertium-kir/stats|bible-coverage}}%  | 
  * regular coverage: ~{{#lst:Apertium-kir/stats|bible-coverage}}%  | 
||
* trimmer coverage: ~<section begin=kirbible-coverage />  | 
  * trimmer coverage: ~<section begin=kirbible-coverage />86.01<section end=kirbible-coverage />%  | 
||
* as of:   | 
  * as of: r70278  | 
||
wp2011/04  | 
  wp2011/04  | 
||
* words: <section begin=wp2011/04-words />545K<section end=wp2011/04-words />  | 
  * words: <section begin=wp2011/04-words />545K<section end=wp2011/04-words />  | 
||
* regular coverage: ~{{#lst:Apertium-kir/stats|wp2011/04-coverage}}%  | 
  * regular coverage: ~{{#lst:Apertium-kir/stats|wp2011/04-coverage}}%  | 
||
* trimmer coverage: ~<section begin=wp2011/04-coverage />  | 
  * trimmer coverage: ~<section begin=wp2011/04-coverage />78.92<section end=wp2011/04-coverage />%  | 
||
* as of:   | 
  * as of: r70278  | 
||
== WER ==  | 
  == WER ==  | 
||
=== kaz→kir ===  | 
|||
trt_2013-04-05  | 
  trt_2013-04-05  | 
||
| Line 113: | Line 144: | ||
* as of: r44975  | 
  * as of: r44975  | 
||
* type: training  | 
  * type: training  | 
||
[[Category:Datastats]]  | 
|||
Latest revision as of 18:23, 3 April 2021
Contents
Over-all stats[edit]
- kaz-kir t1x rules: 27 as of 968b74 by jonorthwash ~ Firespeaker (talk) 18:23, 3 April 2021 (UTC), run by firespeaker
 - kaz-kir t2x rules: 6 as of 9e3144 by jonathan.north.washington ~ Firespeaker (talk) 18:23, 3 April 2021 (UTC), run by firespeaker
 - kir-kaz t1x rules: 3 as of 590257 by francis.m..tyers ~ Firespeaker (talk) 18:23, 3 April 2021 (UTC), run by firespeaker
 - kir-kaz t2x rules: 4 as of 590257 by francis.m..tyers ~ Firespeaker (talk) 18:23, 3 April 2021 (UTC), run by firespeaker
 - kaz-kir t1x macros: 6 as of 968b74 by jonorthwash ~ Firespeaker (talk) 18:23, 3 April 2021 (UTC), run by firespeaker
 - kaz-kir t2x macros: 0 as of 9e3144 by jonathan.north.washington ~ Firespeaker (talk) 18:23, 3 April 2021 (UTC), run by firespeaker
 - kir-kaz t1x macros: 3 as of 590257 by francis.m..tyers ~ Firespeaker (talk) 18:23, 3 April 2021 (UTC), run by firespeaker
 - kir-kaz t2x macros: 0 as of 590257 by francis.m..tyers ~ Firespeaker (talk) 18:23, 3 April 2021 (UTC), run by firespeaker
 - kaz-kir dix stems: 8,174 as of 968b74 by jonorthwash ~ Firespeaker (talk) 18:23, 3 April 2021 (UTC), run by firespeaker
 - kaz-kir stems: ? ~ Firespeaker (talk) 15:01, 6 February 2021 (UTC), run by firespeaker
 
apertium-kaz[edit]
- trimmed stems as of: r46670
 - stats:
 
| full | trimmed | |
|---|---|---|
| coverage | ~94.5% | ~87.3% | 
| stems | 36,595 | 6512 | 
apertium-kir[edit]
- trimmed stems as of: r46670
 - stats:
 
| full | trimmed | |
|---|---|---|
| coverage | ~90.4% | ~85.8% | 
| stems | 14,424 | 6532 | 
bidix[edit]
- as of: r49573
 
Trimmed coverage[edit]
kaz[edit]
Әуезов
- words: 155K
 - regular coverage: ~92.89%
 - trimmed coverage: ~85.51%
 - as of: r70278
 - wikipage: Әуезов corpus
 
bible
- words: 577K
 - regular coverage: ~95.29%
 - trimmed coverage: ~88.9%
 - as of: r70278
 
azattyq2010
- words: 3.2M
 - regular coverage: ~95.07%
 - trimmed coverage: ~89.52%
 - as of: r70278
 - wikipage: RFERL_corpora
 
azattyq2012
- words: 3.9M
 - regular coverage: ~%
 - trimmed coverage: ~89.20%
 - as of: r70278
 - wikipage: RFERL_corpora
 
wp2011
- words: 850K
 - regular coverage: ~90.72%
 - trimmed coverage: ~80.1%
 - as of: r47474
 
wp2013
- words: 26.2M
 - regular coverage: ~90.10%
 - trimmed coverage: ~82.94%
 - as of: r70278
 
wp100K
- words: 1.7M
 - regular coverage: ~%
 - trimmed coverage: ~82.61%
 - as of: r70274
 
quran
- words: 107K
 - regular coverage: ~96.71%
 - trimmed coverage: ~92.49%
 - as of: r70278
 
kir[edit]
azattyk2010
- words: 3.4M
 - regular coverage: ~92.11%
 - trimmer coverage: ~89.2%
 - as of: r70278
 
azattyk2009
- words: 4.1M
 - regular coverage: ~92.04%
 - trimmer coverage: ~86.25%
 - as of: r70278
 
bible
- words: 174K
 - regular coverage: ~92.25%
 - trimmer coverage: ~86.01%
 - as of: r70278
 
wp2011/04
- words: 545K
 - regular coverage: ~85.37%
 - trimmer coverage: ~78.92%
 - as of: r70278
 
WER[edit]
kaz→kir[edit]
trt_2013-04-05
- words: 245
 - WER: 6.12%
 - PER: 4.90%
 - as of: r44975
 - type: training
 
story
- words: 351
 - WER: 4.84%
 - PER: 3.99%
 - as of: r44975
 - type: training