Difference between revisions of "Apertium-uig"
Firespeaker (talk | contribs) (Created page with "{{TOCD}} '''Apertium-uig''' is a morphological analyser/generator and CG tagger for Uyghur, currently under development. It is intended to be compatible with transducers...") |
Firespeaker (talk | contribs) |
||
Line 10: | Line 10: | ||
=== Dependency tree === |
=== Dependency tree === |
||
To use apertium-uig, you'll need the following software. See [[Minimal installation from SVN|this guide]] for how to install these dependencies. If you use Windows (and not Linux or Mac OS), you'll probably want to use [[Apertium VirtualBox]], which comes with all of the dependencies. |
|||
* hfst |
* hfst |
Revision as of 19:51, 8 June 2014
Apertium-uig is a morphological analyser/generator and CG tagger for Uyghur, currently under development. It is intended to be compatible with transducers for other Turkic languages so that they can be translated between. It's used in the following language pairs:
Installation
Apertium-uig is currently located in [1].
Dependency tree
To use apertium-uig, you'll need the following software. See this guide for how to install these dependencies. If you use Windows (and not Linux or Mac OS), you'll probably want to use Apertium VirtualBox, which comes with all of the dependencies.
- hfst
- foma
- flex
- foma
- apertium
- lttoolbox
- VISL-CG
Current State
{{#set_param_default | corpus1 | None }} {{#set_param_default | corpus2 | None }} {{#set_param_default | corpus3 | None }} {{#set_param_default | corpus4 | None }} {{#set_param_default | corpus5 | None }} {{#set_param_default | corpus6 | None }} {{#set_param_default | corpus7 | None }} {{#set_param_default | corpus8 | None }} {{#set_param_default | corpus9 | None }} {{#set_param_default | corpus10 | None }}
- Number of stems: 17,585 {{#ifneq | | | () }}
- Disambiguation rules: 7
- Coverage: ~54.2%
{{#ifneq | wikipedia | None |
{{#ifneq | | | | }}}}
{{#ifneq | udhr | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus3}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus4}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus5}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus6}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus7}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus8}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus9}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus10}}} | None |
{{#ifneq | | | | }}}}
corpus | words | coverage | |
---|---|---|---|
<nowinter>[[|wikipedia]]</nowinter> | wikipedia | 1.4M | ~53.2% |
<nowinter>[[|udhr]]</nowinter> | udhr | 1.8K | ~55.1% |
<nowinter>[[|{{{corpus3}}}]]</nowinter> | {{{corpus3}}} | ~% | |
<nowinter>[[|{{{corpus4}}}]]</nowinter> | {{{corpus4}}} | ~% | |
<nowinter>[[|{{{corpus5}}}]]</nowinter> | {{{corpus5}}} | ~% | |
<nowinter>[[|{{{corpus6}}}]]</nowinter> | {{{corpus6}}} | ~% | |
<nowinter>[[|{{{corpus7}}}]]</nowinter> | {{{corpus7}}} | ~% | |
<nowinter>[[|{{{corpus8}}}]]</nowinter> | {{{corpus8}}} | ~% | |
<nowinter>[[|{{{corpus9}}}]]</nowinter> | {{{corpus9}}} | ~% | |
<nowinter>[[|{{{corpus10}}}]]</nowinter> | {{{corpus10}}} | ~% |