Difference between revisions of "Apertium-tat"
Line 5: | Line 5: | ||
== Current State == |
== Current State == |
||
+ | {{LangStats | lang = tat | corpus1 = quran | corpus2 = NewTestament | corpus3 = aytmatov | corpus4 = wp2011 | corpus5 = tatnews2005/11}} |
||
− | * Number of stems: {{#lst:Apertium-tat/stats|stems}} |
||
− | * Coverage: ~{{:Apertium-tat/stats/average}}% |
||
− | |||
− | {| class="wikitable" |
||
− | |- |
||
− | ! corpus !! words !! coverage |
||
− | |- |
||
− | |quran |
||
− | |align="right"| {{#lst:Apertium-tat/stats|quran-words}} |
||
− | | ~{{#lst:Apertium-tat/stats|quran-coverage}}% |
||
− | |- |
||
− | |new testament |
||
− | |align="right"| {{#lst:Apertium-tat/stats|NewTestament-words}} |
||
− | | ~{{#lst:Apertium-tat/stats|NewTestament-coverage}}% |
||
− | |- |
||
− | |aytmatov |
||
− | |align="right"| {{#lst:Apertium-tat/stats|aytmatov-words}} |
||
− | | ~{{#lst:Apertium-tat/stats|aytmatov-coverage}}% |
||
− | |- |
||
− | |wp 2011-12-15 |
||
− | |align="right"| {{#lst:Apertium-tat/stats|wp2011-words}} |
||
− | | ~{{#lst:Apertium-tat/stats|wp2011-coverage}}% |
||
− | |- |
||
− | | tat.news.2005-2011_300K |
||
− | |align="right"| {{#lst:Apertium-tat/stats|tatnews2005/11-words}} |
||
− | | ~{{#lst:Apertium-tat/stats|tatnews2005/11-coverage}}% |
||
− | |} |
||
=== See also === |
=== See also === |
Revision as of 03:24, 9 January 2013
apertium-tat (or tatmorph) is a morphological analyser/generator for Tatar, currently under development. It is intended to be compatible with transducers for other Turkic languages so that they can be translated between.
Installation
apertium-tat is currently located in [1].
Current State
{{#set_param_default | corpus1 | None }} {{#set_param_default | corpus2 | None }} {{#set_param_default | corpus3 | None }} {{#set_param_default | corpus4 | None }} {{#set_param_default | corpus5 | None }} {{#set_param_default | corpus6 | None }} {{#set_param_default | corpus7 | None }} {{#set_param_default | corpus8 | None }} {{#set_param_default | corpus9 | None }} {{#set_param_default | corpus10 | None }}
- Number of stems: 55,702 {{#ifneq | | | () }}
- Disambiguation rules: 123
- Coverage: ~91%
{{#ifneq | quran | None |
{{#ifneq | | | | }}}}
{{#ifneq | NewTestament | None |
{{#ifneq | | | | }}}}
{{#ifneq | aytmatov | None |
{{#ifneq | | | | }}}}
{{#ifneq | wp2011 | None |
{{#ifneq | | | | }}}}
{{#ifneq | tatnews2005/11 | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus6}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus7}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus8}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus9}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus10}}} | None |
{{#ifneq | | | | }}}}
corpus | words | coverage | |
---|---|---|---|
<nowinter>[[|quran]]</nowinter> | quran | 165K | ~89.2% |
<nowinter>[[|NewTestament]]</nowinter> | NewTestament | 137K | ~94.2% |
<nowinter>[[|aytmatov]]</nowinter> | aytmatov | 5K | ~93.4% |
<nowinter>[[|wp2011]]</nowinter> | wp2011 | ~% | |
<nowinter>[[|tatnews2005/11]]</nowinter> | tatnews2005/11 | 4.6M | ~90.7% |
<nowinter>[[|{{{corpus6}}}]]</nowinter> | {{{corpus6}}} | ~% | |
<nowinter>[[|{{{corpus7}}}]]</nowinter> | {{{corpus7}}} | ~% | |
<nowinter>[[|{{{corpus8}}}]]</nowinter> | {{{corpus8}}} | ~% | |
<nowinter>[[|{{{corpus9}}}]]</nowinter> | {{{corpus9}}} | ~% | |
<nowinter>[[|{{{corpus10}}}]]</nowinter> | {{{corpus10}}} | ~% |