Difference between revisions of "Apertium-kir"
Firespeaker (talk | contribs) |
(GitHub migration) |
||
(6 intermediate revisions by 3 users not shown) | |||
Line 1: | Line 1: | ||
+ | {{TOCD}} |
||
'''Kymorph''' is a morphological analyser/generator for the [[Kyrgyz language]], currently working. It is intended to be compatible with transducers for other [[Turkic languages]] so that they can be translated between. |
'''Kymorph''' is a morphological analyser/generator for the [[Kyrgyz language]], currently working. It is intended to be compatible with transducers for other [[Turkic languages]] so that they can be translated between. |
||
== Installation == |
== Installation == |
||
− | '''kymorph''' is currently located in [[tr-ky]]. |
||
+ | <pre> |
||
− | === Dependency tree === |
||
+ | $ git clone https://github.com/apertium/apertium-kir.git |
||
− | * apertium-tr-ky |
||
− | ** apertium-tr-az |
||
− | *** apertium |
||
− | **** lttoolbox |
||
− | *** VISL CG3 |
||
− | **** cmake |
||
− | **** libicu-dev |
||
− | **** tmalloc (libgoogle-perftools-dev) |
||
− | ***** libtcmalloc-minimal0 |
||
− | ***** libgoogle-perftools0 |
||
− | **** boost |
||
− | *** trmorph |
||
− | **** hfst (≥r1559 for kymorph) |
||
− | ***** openfst |
||
− | ***** sfst |
||
− | ****** libreadline6-dev |
||
− | ***** foma |
||
− | ****** flex |
||
− | ****** bison |
||
− | *** azmorph |
||
+ | </pre> |
||
⚫ | |||
== Current State == |
== Current State == |
||
+ | {{LangStats | lang = kir | corpus1 = azattyk2010 | corpus2 = azattyk2009 | corpus3 = bible | corpus4 = wp2011/04}} |
||
− | * Number of stems: {{:Kymorph/stems}} |
||
− | * Coverage: {{:Kymorph/coverage/average}} |
||
− | |||
− | {|class="wikitable" |
||
− | |- |
||
− | ! corpus !! words !! coverage |
||
− | |- |
||
− | || [[RFERL corpora|azattyk]] 2010 |
||
− | |align="right"| {{:RFERL corpus/ky/2010/stems}} |
||
− | || ~{{:Kymorph/coverage/rferl2010}}% |
||
− | |- |
||
− | || [[RFERL corpora|azattyk]] 2009 |
||
− | |align="right"| {{:RFERL corpus/ky/2009/stems}} |
||
− | || ~{{:Kymorph/coverage/rferl2009}}% |
||
− | |- |
||
− | ||bible |
||
− | |align="right"|174K |
||
− | || ~{{:Kymorph/coverage/bible}}% |
||
− | |- |
||
− | ||WP 2011-04 |
||
− | |align="right"|545k |
||
− | || ~{{:Kymorph/coverage/wikipedia}}% |
||
− | |} |
||
== To-do == |
== To-do == |
||
+ | |||
⚫ |
Latest revision as of 05:53, 8 March 2018
Contents |
Kymorph is a morphological analyser/generator for the Kyrgyz language, currently working. It is intended to be compatible with transducers for other Turkic languages so that they can be translated between.
Installation[edit]
$ git clone https://github.com/apertium/apertium-kir.git
Current State[edit]
{{#set_param_default | corpus1 | None }} {{#set_param_default | corpus2 | None }} {{#set_param_default | corpus3 | None }} {{#set_param_default | corpus4 | None }} {{#set_param_default | corpus5 | None }} {{#set_param_default | corpus6 | None }} {{#set_param_default | corpus7 | None }} {{#set_param_default | corpus8 | None }} {{#set_param_default | corpus9 | None }} {{#set_param_default | corpus10 | None }}
- Number of stems: 14,424 {{#ifneq | | | () }}
- Disambiguation rules: 16
- Coverage: ~90.4%
{{#ifneq | azattyk2010 | None |
{{#ifneq | RFERL_corpora | | | }}}}
{{#ifneq | azattyk2009 | None |
{{#ifneq | RFERL_corpora | | | }}}}
{{#ifneq | bible | None |
{{#ifneq | | | | }}}}
{{#ifneq | wp2011/04 | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus5}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus6}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus7}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus8}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus9}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus10}}} | None |
{{#ifneq | | | | }}}}
corpus | words | coverage | |
---|---|---|---|
<nowinter>azattyk2010</nowinter> | azattyk2010 | 3.4M | ~92.11% |
<nowinter>azattyk2009</nowinter> | azattyk2009 | 4.1M | ~92.04% |
<nowinter>[[|bible]]</nowinter> | bible | 174K | ~92.25% |
<nowinter>[[|wp2011/04]]</nowinter> | wp2011/04 | 545K | ~85.37% |
<nowinter>[[|{{{corpus5}}}]]</nowinter> | {{{corpus5}}} | ~% | |
<nowinter>[[|{{{corpus6}}}]]</nowinter> | {{{corpus6}}} | ~% | |
<nowinter>[[|{{{corpus7}}}]]</nowinter> | {{{corpus7}}} | ~% | |
<nowinter>[[|{{{corpus8}}}]]</nowinter> | {{{corpus8}}} | ~% | |
<nowinter>[[|{{{corpus9}}}]]</nowinter> | {{{corpus9}}} | ~% | |
<nowinter>[[|{{{corpus10}}}]]</nowinter> | {{{corpus10}}} | ~% |