Difference between revisions of "Apertium-uig"
Firespeaker (talk | contribs) |
(GitHub migration) |
||
(5 intermediate revisions by one other user not shown) | |||
Line 7: | Line 7: | ||
== Installation == |
== Installation == |
||
'''Apertium-uig''' is currently located in our |
'''Apertium-uig''' is currently located in our [[Using git|git repository]] at [https://github.com/apertium/apertium-uig languages/apertium-uig]. A live version may be evaluated at [http://turkic.apertium.org/?choice=uig#analyzation turkic.apertium.org], though no guarantees are made as to whether it is the latest version. |
||
=== Dependency tree === |
=== Dependency tree === |
||
Line 18: | Line 18: | ||
** lttoolbox |
** lttoolbox |
||
* VISL-CG |
* VISL-CG |
||
You can also try our linux binary repositories, e.g. [[Prerequisites for Debian|for Debian/Ubuntu]]. |
|||
== Contributing / collaborating == |
|||
See [[Contributing|our document on contributing]]. |
|||
== Current State == |
== Current State == |
Latest revision as of 05:57, 8 March 2018
Apertium-uig is a morphological analyser/generator and CG tagger for Uyghur, currently under development. It is intended to be compatible with transducers for other Turkic languages so that they can be translated between. It's used in the following language pairs:
Installation[edit]
Apertium-uig is currently located in our git repository at languages/apertium-uig. A live version may be evaluated at turkic.apertium.org, though no guarantees are made as to whether it is the latest version.
Dependency tree[edit]
To use apertium-uig, you'll need the following software. See this guide for how to install these dependencies. If you use Windows (and not Linux or Mac OS), you'll probably want to use Apertium VirtualBox, which comes with all of the dependencies.
- hfst
- foma
- flex
- foma
- apertium
- lttoolbox
- VISL-CG
You can also try our linux binary repositories, e.g. for Debian/Ubuntu.
Contributing / collaborating[edit]
See our document on contributing.
Current State[edit]
{{#set_param_default | corpus1 | None }} {{#set_param_default | corpus2 | None }} {{#set_param_default | corpus3 | None }} {{#set_param_default | corpus4 | None }} {{#set_param_default | corpus5 | None }} {{#set_param_default | corpus6 | None }} {{#set_param_default | corpus7 | None }} {{#set_param_default | corpus8 | None }} {{#set_param_default | corpus9 | None }} {{#set_param_default | corpus10 | None }}
- Number of stems: 17,585 {{#ifneq | | | () }}
- Disambiguation rules: 7
- Coverage: ~54.2%
{{#ifneq | wikipedia | None |
{{#ifneq | | | | }}}}
{{#ifneq | udhr | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus3}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus4}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus5}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus6}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus7}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus8}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus9}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus10}}} | None |
{{#ifneq | | | | }}}}
corpus | words | coverage | |
---|---|---|---|
<nowinter>[[|wikipedia]]</nowinter> | wikipedia | 1.4M | ~53.2% |
<nowinter>[[|udhr]]</nowinter> | udhr | 1.8K | ~55.1% |
<nowinter>[[|{{{corpus3}}}]]</nowinter> | {{{corpus3}}} | ~% | |
<nowinter>[[|{{{corpus4}}}]]</nowinter> | {{{corpus4}}} | ~% | |
<nowinter>[[|{{{corpus5}}}]]</nowinter> | {{{corpus5}}} | ~% | |
<nowinter>[[|{{{corpus6}}}]]</nowinter> | {{{corpus6}}} | ~% | |
<nowinter>[[|{{{corpus7}}}]]</nowinter> | {{{corpus7}}} | ~% | |
<nowinter>[[|{{{corpus8}}}]]</nowinter> | {{{corpus8}}} | ~% | |
<nowinter>[[|{{{corpus9}}}]]</nowinter> | {{{corpus9}}} | ~% | |
<nowinter>[[|{{{corpus10}}}]]</nowinter> | {{{corpus10}}} | ~% |