Difference between revisions of "Apertium-kir"

From Apertium
Jump to navigation Jump to search
(GitHub migration)
 
(5 intermediate revisions by 3 users not shown)
Line 3: Line 3:
   
 
== Installation ==
 
== Installation ==
'''kymorph''' is currently located in [[tr-ky]].
 
   
  +
<pre>
=== Dependency tree ===
 
  +
$ git clone https://github.com/apertium/apertium-kir.git
* apertium-tr-ky
 
** apertium-tr-az
 
*** apertium
 
**** lttoolbox
 
*** VISL CG3
 
**** cmake
 
**** libicu-dev
 
**** tmalloc (libgoogle-perftools-dev)
 
***** libtcmalloc-minimal0
 
***** libgoogle-perftools0
 
**** boost
 
*** trmorph
 
**** hfst (≥r1559 for kymorph)
 
***** openfst
 
***** sfst
 
****** libreadline6-dev
 
***** foma
 
****** flex
 
****** bison
 
*** azmorph
 
   
  +
</pre>
[[Category:Tools]]
 
   
 
== Current State ==
 
== Current State ==
  +
{{LangStats | lang = kir | corpus1 = azattyk2010 | corpus2 = azattyk2009 | corpus3 = bible | corpus4 = wp2011/04}}
* Number of stems: {{:Kymorph/stems}}
 
* Coverage: {{:Kymorph/coverage/average}}
 
 
{|class="wikitable"
 
|-
 
! corpus !! words !! coverage
 
|-
 
|| [[RFERL corpora|azattyk]] 2010
 
|align="right"| {{:RFERL corpus/ky/2010/stems}}
 
|| ~{{:Kymorph/coverage/rferl2010}}%
 
|-
 
|| [[RFERL corpora|azattyk]] 2009
 
|align="right"| {{:RFERL corpus/ky/2009/stems}}
 
|| ~{{:Kymorph/coverage/rferl2009}}%
 
|-
 
||bible
 
|align="right"|174K
 
|| ~{{:Kymorph/coverage/bible}}%
 
|-
 
||WP 2011-04
 
|align="right"|545k
 
|| ~{{:Kymorph/coverage/wikipedia}}%
 
|}
 
   
 
== To-do ==
 
== To-do ==
  +
 
[[Category:Tools]]

Latest revision as of 05:53, 8 March 2018

Kymorph is a morphological analyser/generator for the Kyrgyz language, currently working. It is intended to be compatible with transducers for other Turkic languages so that they can be translated between.

Installation[edit]

$ git clone https://github.com/apertium/apertium-kir.git

Current State[edit]

{{#set_param_default | corpus1 | None }} {{#set_param_default | corpus2 | None }} {{#set_param_default | corpus3 | None }} {{#set_param_default | corpus4 | None }} {{#set_param_default | corpus5 | None }} {{#set_param_default | corpus6 | None }} {{#set_param_default | corpus7 | None }} {{#set_param_default | corpus8 | None }} {{#set_param_default | corpus9 | None }} {{#set_param_default | corpus10 | None }}

  • Number of stems: 14,424 {{#ifneq | | | () }}
  • Disambiguation rules: 16
  • Coverage: ~90.4%

{{#ifneq | azattyk2010 | None |

{{#ifneq | RFERL_corpora | | | }}

}}

{{#ifneq | azattyk2009 | None |

{{#ifneq | RFERL_corpora | | | }}

}}

{{#ifneq | bible | None |

{{#ifneq | | | | }}

}}

{{#ifneq | wp2011/04 | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus5}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus6}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus7}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus8}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus9}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus10}}} | None |

{{#ifneq | | | | }}

}}

corpuswordscoverage
<nowinter>azattyk2010</nowinter>azattyk20103.4M ~92.11%
<nowinter>azattyk2009</nowinter>azattyk20094.1M ~92.04%
<nowinter>[[|bible]]</nowinter>bible174K ~92.25%
<nowinter>[[|wp2011/04]]</nowinter>wp2011/04545K ~85.37%
<nowinter>[[|{{{corpus5}}}]]</nowinter>{{{corpus5}}} ~%
<nowinter>[[|{{{corpus6}}}]]</nowinter>{{{corpus6}}} ~%
<nowinter>[[|{{{corpus7}}}]]</nowinter>{{{corpus7}}} ~%
<nowinter>[[|{{{corpus8}}}]]</nowinter>{{{corpus8}}} ~%
<nowinter>[[|{{{corpus9}}}]]</nowinter>{{{corpus9}}} ~%
<nowinter>[[|{{{corpus10}}}]]</nowinter>{{{corpus10}}} ~%

To-do[edit]