Apertium-uig

From Apertium
Jump to navigation Jump to search

Apertium-uig is a morphological analyser/generator and CG tagger for Uyghur, currently under development. It is intended to be compatible with transducers for other Turkic languages so that they can be translated between. It's used in the following language pairs:

Installation[edit]

Apertium-uig is currently located in our git repository at languages/apertium-uig. A live version may be evaluated at turkic.apertium.org, though no guarantees are made as to whether it is the latest version.

Dependency tree[edit]

To use apertium-uig, you'll need the following software. See this guide for how to install these dependencies. If you use Windows (and not Linux or Mac OS), you'll probably want to use Apertium VirtualBox, which comes with all of the dependencies.

  • hfst
    • foma
      • flex
  • apertium
    • lttoolbox
  • VISL-CG

You can also try our linux binary repositories, e.g. for Debian/Ubuntu.

Contributing / collaborating[edit]

See our document on contributing.

Current State[edit]

{{#set_param_default | corpus1 | None }} {{#set_param_default | corpus2 | None }} {{#set_param_default | corpus3 | None }} {{#set_param_default | corpus4 | None }} {{#set_param_default | corpus5 | None }} {{#set_param_default | corpus6 | None }} {{#set_param_default | corpus7 | None }} {{#set_param_default | corpus8 | None }} {{#set_param_default | corpus9 | None }} {{#set_param_default | corpus10 | None }}

  • Number of stems: 17,585 {{#ifneq | | | () }}
  • Disambiguation rules: 7
  • Coverage: ~54.2%

{{#ifneq | wikipedia | None |

{{#ifneq | | | | }}

}}

{{#ifneq | udhr | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus3}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus4}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus5}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus6}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus7}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus8}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus9}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus10}}} | None |

{{#ifneq | | | | }}

}}

corpuswordscoverage
<nowinter>[[|wikipedia]]</nowinter>wikipedia1.4M ~53.2%
<nowinter>[[|udhr]]</nowinter>udhr1.8K ~55.1%
<nowinter>[[|{{{corpus3}}}]]</nowinter>{{{corpus3}}} ~%
<nowinter>[[|{{{corpus4}}}]]</nowinter>{{{corpus4}}} ~%
<nowinter>[[|{{{corpus5}}}]]</nowinter>{{{corpus5}}} ~%
<nowinter>[[|{{{corpus6}}}]]</nowinter>{{{corpus6}}} ~%
<nowinter>[[|{{{corpus7}}}]]</nowinter>{{{corpus7}}} ~%
<nowinter>[[|{{{corpus8}}}]]</nowinter>{{{corpus8}}} ~%
<nowinter>[[|{{{corpus9}}}]]</nowinter>{{{corpus9}}} ~%
<nowinter>[[|{{{corpus10}}}]]</nowinter>{{{corpus10}}} ~%