Difference between revisions of "Khalkha"
Firespeaker (talk | contribs) (Created page with ''''Khalkha''' is the prestige variety of '''Mongolian''' spoken in Mongolia. A transducer for it written using HFST is under development in <code>incubator/apertium-khk</code>. …') |
Firespeaker (talk | contribs) m (Firespeaker moved page Khalkha language to Khalkha over redirect) |
||
(10 intermediate revisions by 2 users not shown) | |||
Line 1: | Line 1: | ||
'''Khalkha''' is the prestige variety of '''Mongolian''' spoken in Mongolia |
'''Khalkha''' is the prestige variety of '''Mongolian''' spoken in Mongolia. |
||
A morphological analyser/generator for Khalkha is currently under development. It is intended to be largely compatible with transducers for [[Turkic languages]]. It is written using HFST and is under development in <code>incubator/apertium-khk</code>. Currently it's used in the following pairs: |
|||
== Coverage == |
|||
* [[Khalkha and Kazakh]] |
|||
== Current State == |
|||
{{LangStats | lang = khk | corpus1 = bible | corpus2 = olloo2012 | corpus3 = wp2013 }} |
|||
== Current Roadmap == |
== Current Roadmap == |
||
Line 7: | Line 11: | ||
** Figure out [[Morphology of Khalkha/АА vowel harmony issue|АА vowel harmony issue]] |
** Figure out [[Morphology of Khalkha/АА vowel harmony issue|АА vowel harmony issue]] |
||
** Figure out and correctly implement <code>{а}</code> insertion/deletion stuff. |
** Figure out and correctly implement <code>{а}</code> insertion/deletion stuff. |
||
[[Category:Khalkha]] |
Latest revision as of 03:38, 2 September 2014
Khalkha is the prestige variety of Mongolian spoken in Mongolia.
A morphological analyser/generator for Khalkha is currently under development. It is intended to be largely compatible with transducers for Turkic languages. It is written using HFST and is under development in incubator/apertium-khk
. Currently it's used in the following pairs:
Current State[edit]
{{#set_param_default | corpus1 | None }} {{#set_param_default | corpus2 | None }} {{#set_param_default | corpus3 | None }} {{#set_param_default | corpus4 | None }} {{#set_param_default | corpus5 | None }} {{#set_param_default | corpus6 | None }} {{#set_param_default | corpus7 | None }} {{#set_param_default | corpus8 | None }} {{#set_param_default | corpus9 | None }} {{#set_param_default | corpus10 | None }}
- Number of stems: 441 {{#ifneq | | | () }}
- Disambiguation rules:
- Coverage: ~50.6%
{{#ifneq | bible | None |
{{#ifneq | | | | }}}}
{{#ifneq | olloo2012 | None |
{{#ifneq | | | | }}}}
{{#ifneq | wp2013 | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus4}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus5}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus6}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus7}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus8}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus9}}} | None |
{{#ifneq | | | | }}}}
{{#ifneq | {{{corpus10}}} | None |
{{#ifneq | | | | }}}}
corpus | words | coverage | |
---|---|---|---|
<nowinter>[[|bible]]</nowinter> | bible | 606K | ~59.2% |
<nowinter>[[|olloo2012]]</nowinter> | olloo2012 | 5.9M | ~46.8% |
<nowinter>[[|wp2013]]</nowinter> | wp2013 | 125K | ~45.7% |
<nowinter>[[|{{{corpus4}}}]]</nowinter> | {{{corpus4}}} | ~% | |
<nowinter>[[|{{{corpus5}}}]]</nowinter> | {{{corpus5}}} | ~% | |
<nowinter>[[|{{{corpus6}}}]]</nowinter> | {{{corpus6}}} | ~% | |
<nowinter>[[|{{{corpus7}}}]]</nowinter> | {{{corpus7}}} | ~% | |
<nowinter>[[|{{{corpus8}}}]]</nowinter> | {{{corpus8}}} | ~% | |
<nowinter>[[|{{{corpus9}}}]]</nowinter> | {{{corpus9}}} | ~% | |
<nowinter>[[|{{{corpus10}}}]]</nowinter> | {{{corpus10}}} | ~% |
Current Roadmap[edit]
- Fully implement morphophonology
- Figure out АА vowel harmony issue
- Figure out and correctly implement
{а}
insertion/deletion stuff.