Apertium-tat

From Apertium
Jump to navigation Jump to search

Apertium-tat is a morphological analyser/generator and CG tagger for Tatar, currently under development. It is intended to be compatible with transducers for other Turkic languages so that they can be translated between. It's used in the following language pairs:

Installation

apertium-tat is currently located in [1].

Current State

{{#set_param_default | corpus1 | None }} {{#set_param_default | corpus2 | None }} {{#set_param_default | corpus3 | None }} {{#set_param_default | corpus4 | None }} {{#set_param_default | corpus5 | None }} {{#set_param_default | corpus6 | None }} {{#set_param_default | corpus7 | None }} {{#set_param_default | corpus8 | None }} {{#set_param_default | corpus9 | None }} {{#set_param_default | corpus10 | None }}

  • Number of stems: 55,702 {{#ifneq | | | () }}
  • Disambiguation rules: 123
  • Coverage: ~91%

{{#ifneq | quran | None |

{{#ifneq | | | | }}

}}

{{#ifneq | NewTestament | None |

{{#ifneq | | | | }}

}}

{{#ifneq | aytmatov | None |

{{#ifneq | | | | }}

}}

{{#ifneq | wp2013 | None |

{{#ifneq | | | | }}

}}

{{#ifneq | tatnews2005/11 | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus6}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus7}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus8}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus9}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus10}}} | None |

{{#ifneq | | | | }}

}}

corpuswordscoverage
<nowinter>[[|quran]]</nowinter>quran165K ~89.2%
<nowinter>[[|NewTestament]]</nowinter>NewTestament137K ~94.2%
<nowinter>[[|aytmatov]]</nowinter>aytmatov5K ~93.4%
<nowinter>[[|wp2013]]</nowinter>wp2013128K ~87.3%
<nowinter>[[|tatnews2005/11]]</nowinter>tatnews2005/114.6M ~90.7%
<nowinter>[[|{{{corpus6}}}]]</nowinter>{{{corpus6}}} ~%
<nowinter>[[|{{{corpus7}}}]]</nowinter>{{{corpus7}}} ~%
<nowinter>[[|{{{corpus8}}}]]</nowinter>{{{corpus8}}} ~%
<nowinter>[[|{{{corpus9}}}]]</nowinter>{{{corpus9}}} ~%
<nowinter>[[|{{{corpus10}}}]]</nowinter>{{{corpus10}}} ~%