Difference between revisions of "Apertium-tat"
| Line 5: | Line 5: | ||
| == Current State == | == Current State == | ||
| {{LangStats | lang = tat | corpus1 = quran | corpus2 = NewTestament | corpus3 = aytmatov | corpus4 = wp2011 | corpus5 = tatnews2005/11}} | |||
| * Number of stems: {{#lst:Apertium-tat/stats|stems}} | |||
| * Coverage: ~{{:Apertium-tat/stats/average}}% | |||
| {| class="wikitable" | |||
| |- | |||
| ! corpus !! words !! coverage | |||
| |- | |||
| |quran | |||
| |align="right"| {{#lst:Apertium-tat/stats|quran-words}} | |||
| | ~{{#lst:Apertium-tat/stats|quran-coverage}}% | |||
| |- | |||
| |new testament | |||
| |align="right"| {{#lst:Apertium-tat/stats|NewTestament-words}} | |||
| | ~{{#lst:Apertium-tat/stats|NewTestament-coverage}}% | |||
| |- | |||
| |aytmatov | |||
| |align="right"| {{#lst:Apertium-tat/stats|aytmatov-words}} | |||
| | ~{{#lst:Apertium-tat/stats|aytmatov-coverage}}% | |||
| |- | |||
| |wp 2011-12-15 | |||
| |align="right"| {{#lst:Apertium-tat/stats|wp2011-words}} | |||
| | ~{{#lst:Apertium-tat/stats|wp2011-coverage}}% | |||
| |- | |||
| | tat.news.2005-2011_300K | |||
| |align="right"| {{#lst:Apertium-tat/stats|tatnews2005/11-words}} | |||
| | ~{{#lst:Apertium-tat/stats|tatnews2005/11-coverage}}% | |||
| |} | |||
| === See also === | === See also === | ||
Revision as of 03:24, 9 January 2013
apertium-tat (or tatmorph) is a morphological analyser/generator for Tatar, currently under development. It is intended to be compatible with transducers for other Turkic languages so that they can be translated between.
Installation
apertium-tat is currently located in [1].
Current State
{{#set_param_default | corpus1 | None }} {{#set_param_default | corpus2 | None }} {{#set_param_default | corpus3 | None }} {{#set_param_default | corpus4 | None }} {{#set_param_default | corpus5 | None }} {{#set_param_default | corpus6 | None }} {{#set_param_default | corpus7 | None }} {{#set_param_default | corpus8 | None }} {{#set_param_default | corpus9 | None }} {{#set_param_default | corpus10 | None }}
- Number of stems: 55,702 {{#ifneq | | | () }}
- Disambiguation rules: 123
- Coverage: ~91%
{{#ifneq | quran | None |
{{#ifneq | | | | }}}}
{{#ifneq | NewTestament | None |
{{#ifneq | | | | }}}}
{{#ifneq | aytmatov | None |
{{#ifneq | | | | }}}}
{{#ifneq | wp2011 | None |
{{#ifneq | | | | }}}}
{{#ifneq | tatnews2005/11 | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus6}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus7}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus8}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus9}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus10}}} | None |
{{#ifneq | | | | }}}}
| corpus | words | coverage | |
|---|---|---|---|
| <nowinter>[[|quran]]</nowinter> | quran | 165K | ~89.2% | 
| <nowinter>[[|NewTestament]]</nowinter> | NewTestament | 137K | ~94.2% | 
| <nowinter>[[|aytmatov]]</nowinter> | aytmatov | 5K | ~93.4% | 
| <nowinter>[[|wp2011]]</nowinter> | wp2011 | ~% | |
| <nowinter>[[|tatnews2005/11]]</nowinter> | tatnews2005/11 | 4.6M | ~90.7% | 
| <nowinter>[[|{{{corpus6}}}]]</nowinter> | {{{corpus6}}} | ~% | |
| <nowinter>[[|{{{corpus7}}}]]</nowinter> | {{{corpus7}}} | ~% | |
| <nowinter>[[|{{{corpus8}}}]]</nowinter> | {{{corpus8}}} | ~% | |
| <nowinter>[[|{{{corpus9}}}]]</nowinter> | {{{corpus9}}} | ~% | |
| <nowinter>[[|{{{corpus10}}}]]</nowinter> | {{{corpus10}}} | ~% | 

