Difference between revisions of "Apertium-uig"

From Apertium
Jump to navigation Jump to search
Line 18: Line 18:
 
** lttoolbox
 
** lttoolbox
 
* VISL-CG
 
* VISL-CG
  +
  +
You can also try our linux binary repositories, e.g. [[Prerequisites for Debian|for Debian/Ubuntu]].
  +
  +
== Contributing / collaborating ==
  +
  +
See [[Contributing|our document on contributing]].
   
 
== Current State ==
 
== Current State ==

Revision as of 07:11, 1 February 2016

Apertium-uig is a morphological analyser/generator and CG tagger for Uyghur, currently under development. It is intended to be compatible with transducers for other Turkic languages so that they can be translated between. It's used in the following language pairs:

Installation

Apertium-uig is currently located in our incubator in svn at incubator/apertium-uig. A live version may be evaluated at turkic.apertium.org, though no guarantees are made as to whether it is the latest version.

Dependency tree

To use apertium-uig, you'll need the following software. See this guide for how to install these dependencies. If you use Windows (and not Linux or Mac OS), you'll probably want to use Apertium VirtualBox, which comes with all of the dependencies.

  • hfst
    • foma
      • flex
  • apertium
    • lttoolbox
  • VISL-CG

You can also try our linux binary repositories, e.g. for Debian/Ubuntu.

Contributing / collaborating

See our document on contributing.

Current State

{{#set_param_default | corpus1 | None }} {{#set_param_default | corpus2 | None }} {{#set_param_default | corpus3 | None }} {{#set_param_default | corpus4 | None }} {{#set_param_default | corpus5 | None }} {{#set_param_default | corpus6 | None }} {{#set_param_default | corpus7 | None }} {{#set_param_default | corpus8 | None }} {{#set_param_default | corpus9 | None }} {{#set_param_default | corpus10 | None }}

  • Number of stems: 17,585 {{#ifneq | | | () }}
  • Disambiguation rules: 7
  • Coverage: ~54.2%

{{#ifneq | wikipedia | None |

{{#ifneq | | | | }}

}}

{{#ifneq | udhr | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus3}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus4}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus5}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus6}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus7}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus8}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus9}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus10}}} | None |

{{#ifneq | | | | }}

}}

corpuswordscoverage
<nowinter>[[|wikipedia]]</nowinter>wikipedia1.4M ~53.2%
<nowinter>[[|udhr]]</nowinter>udhr1.8K ~55.1%
<nowinter>[[|{{{corpus3}}}]]</nowinter>{{{corpus3}}} ~%
<nowinter>[[|{{{corpus4}}}]]</nowinter>{{{corpus4}}} ~%
<nowinter>[[|{{{corpus5}}}]]</nowinter>{{{corpus5}}} ~%
<nowinter>[[|{{{corpus6}}}]]</nowinter>{{{corpus6}}} ~%
<nowinter>[[|{{{corpus7}}}]]</nowinter>{{{corpus7}}} ~%
<nowinter>[[|{{{corpus8}}}]]</nowinter>{{{corpus8}}} ~%
<nowinter>[[|{{{corpus9}}}]]</nowinter>{{{corpus9}}} ~%
<nowinter>[[|{{{corpus10}}}]]</nowinter>{{{corpus10}}} ~%