Difference between revisions of "Apertium-tat"
Line 5: | Line 5: | ||
== Current State == |
== Current State == |
||
{{LangStats | lang = tat | corpus1 = quran | corpus2 = NewTestament | corpus3 = aytmatov | corpus4 = wp2011 | corpus5 = tatnews2005/11}} |
|||
* Number of stems: {{#lst:Apertium-tat/stats|stems}} |
|||
* Coverage: ~{{:Apertium-tat/stats/average}}% |
|||
{| class="wikitable" |
|||
|- |
|||
! corpus !! words !! coverage |
|||
|- |
|||
|quran |
|||
|align="right"| {{#lst:Apertium-tat/stats|quran-words}} |
|||
| ~{{#lst:Apertium-tat/stats|quran-coverage}}% |
|||
|- |
|||
|new testament |
|||
|align="right"| {{#lst:Apertium-tat/stats|NewTestament-words}} |
|||
| ~{{#lst:Apertium-tat/stats|NewTestament-coverage}}% |
|||
|- |
|||
|aytmatov |
|||
|align="right"| {{#lst:Apertium-tat/stats|aytmatov-words}} |
|||
| ~{{#lst:Apertium-tat/stats|aytmatov-coverage}}% |
|||
|- |
|||
|wp 2011-12-15 |
|||
|align="right"| {{#lst:Apertium-tat/stats|wp2011-words}} |
|||
| ~{{#lst:Apertium-tat/stats|wp2011-coverage}}% |
|||
|- |
|||
| tat.news.2005-2011_300K |
|||
|align="right"| {{#lst:Apertium-tat/stats|tatnews2005/11-words}} |
|||
| ~{{#lst:Apertium-tat/stats|tatnews2005/11-coverage}}% |
|||
|} |
|||
=== See also === |
=== See also === |
Revision as of 03:24, 9 January 2013
apertium-tat (or tatmorph) is a morphological analyser/generator for Tatar, currently under development. It is intended to be compatible with transducers for other Turkic languages so that they can be translated between.
Installation
apertium-tat is currently located in [1].
Current State
{{#set_param_default | corpus1 | None }} {{#set_param_default | corpus2 | None }} {{#set_param_default | corpus3 | None }} {{#set_param_default | corpus4 | None }} {{#set_param_default | corpus5 | None }} {{#set_param_default | corpus6 | None }} {{#set_param_default | corpus7 | None }} {{#set_param_default | corpus8 | None }} {{#set_param_default | corpus9 | None }} {{#set_param_default | corpus10 | None }}
- Number of stems: 55,702 {{#ifneq | | | () }}
- Disambiguation rules: 123
- Coverage: ~91%
{{#ifneq | quran | None |
{{#ifneq | | | | }}}}
{{#ifneq | NewTestament | None |
{{#ifneq | | | | }}}}
{{#ifneq | aytmatov | None |
{{#ifneq | | | | }}}}
{{#ifneq | wp2011 | None |
{{#ifneq | | | | }}}}
{{#ifneq | tatnews2005/11 | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus6}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus7}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus8}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus9}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus10}}} | None |
{{#ifneq | | | | }}}}
corpus | words | coverage | |
---|---|---|---|
<nowinter>[[|quran]]</nowinter> | quran | 165K | ~89.2% |
<nowinter>[[|NewTestament]]</nowinter> | NewTestament | 137K | ~94.2% |
<nowinter>[[|aytmatov]]</nowinter> | aytmatov | 5K | ~93.4% |
<nowinter>[[|wp2011]]</nowinter> | wp2011 | ~% | |
<nowinter>[[|tatnews2005/11]]</nowinter> | tatnews2005/11 | 4.6M | ~90.7% |
<nowinter>[[|{{{corpus6}}}]]</nowinter> | {{{corpus6}}} | ~% | |
<nowinter>[[|{{{corpus7}}}]]</nowinter> | {{{corpus7}}} | ~% | |
<nowinter>[[|{{{corpus8}}}]]</nowinter> | {{{corpus8}}} | ~% | |
<nowinter>[[|{{{corpus9}}}]]</nowinter> | {{{corpus9}}} | ~% | |
<nowinter>[[|{{{corpus10}}}]]</nowinter> | {{{corpus10}}} | ~% |