Difference between revisions of "Apertium-aze"

From Apertium
Jump to navigation Jump to search
 
(8 intermediate revisions by 4 users not shown)
Line 1: Line 1:
{{TOCD}}
Azmorph, a morphological analyzer for Azerbaijani
Azmorph, a morphological analyzer for Azerbaijani


Azmorph current version is 0.2.1 (preALPHA)
Azmorph current version is 0.2.1 (preALPHA)


===What it is, what it does and what it does not ===
==What it is, what it does and what it does not ==


Azmorph is a morphological analyzer for Azerbaijani (Azerbaycan dili).
Azmorph is a morphological analyzer for Azerbaijani (Azerbaycan dili).
Line 11: Line 12:
Azmorph is at preALPHA stage of developement: this means that it works for very few features of the language (which will be explained later). If you are not familiar with the nerdish jargon, preALPHA means roughly that the software is in its embryonal state. Is it already a life? We don't know, better you consult your local Church. What we know is that, beside being a problematic embryo we decided to keep it and try to provide it with a decent development. We know it will be a problematic child, probably with several impairments, but we decided to keep it anyway.
Azmorph is at preALPHA stage of developement: this means that it works for very few features of the language (which will be explained later). If you are not familiar with the nerdish jargon, preALPHA means roughly that the software is in its embryonal state. Is it already a life? We don't know, better you consult your local Church. What we know is that, beside being a problematic embryo we decided to keep it and try to provide it with a decent development. We know it will be a problematic child, probably with several impairments, but we decided to keep it anyway.


== Current State ==

{{LangStats | lang = aze | corpus1 = azadliq2012 | corpus2 = quran | corpus3 = udhr }}


==What works? What does not?==
==What works? What does not?==
Line 24: Line 28:
|-
|-
! scope="row" | Present Progressive (alıram)
! scope="row" | Present Progressive (alıram)
| || Consonant alternation, negative doesn't work well ||
| || negative doesn't work well ||
|-
|-
! scope="row" | Imperative
! scope="row" | Imperative
Line 30: Line 34:
|-
|-
! scope="row" | Future indicative (alacağım)
! scope="row" | Future indicative (alacağım)
| || Consonant alternation and devoicing ||
| || 1p and 1s devoicing ||
|-
|-
! scope="row" | Evidential/Past perfect (almışam)
! scope="row" | Evidential/Past perfect (almışam)
| || Consonant alternation ||
| Works! || ||
|-
|-
! scope="row" | Indefinite future (alaram)
! scope="row" | Indefinite future / Aorist (alaram) <t_aor>
| || || Absent
| Works! || ||
|-
|-
! scope="row" | Optative present(alam)
! scope="row" | Optative present(alam)
| || Consonant Alternation ||
| Works! || ||
|-
|-
! scope="row" | Optative past (ala idim)
! scope="row" | Optative past (ala idim)
| Works || ||
| Works! || ||
|-
|-
! scope="row" | Necessitative present (almalıyım)
! scope="row" | Necessitative present (almalıyım)
| || Consonant Alternation ||
| Works || ||
|-
|-
! scope="row" | Necessitative past (almalı idim)
! scope="row" | Necessitative past (almalı idim)
Line 51: Line 55:
|-
|-
! scope="row" | Abilitative(bil-)
! scope="row" | Abilitative(bil-)
| || Has the same problems of indicative ||
| || has to be split ||
|-
|-
! scope="row" | i- copula (idim)
! scope="row" | i- copula (idim)
Line 67: Line 71:
! scope="col" | Absent
! scope="col" | Absent
|-
|-
! scope="row" | Cases (Nominative, Genitive, Ablative, Locative, Accusative, ?instrumental?)
! scope="row" | Cases (n, g, d, acc, abl, loc)
| Works! || ||
| Works! || ||
|-
|-
Line 74: Line 78:
|-
|-
! scope="row" | -l<I><Q>
! scope="row" | -l<I><Q>
| || Consonant alternation ||
| || Devoicing ||
|-
|-
! scope="row" | -L<A>
! scope="row" | -L<A>
Line 86: Line 90:
|-
|-
! scope="row" | Possessives
! scope="row" | Possessives
| Works! || ||
|-
! scope="row" | -ki realized as k<I>
| Works! || ||
| Works! || ||
|-
|-
|}
|}

== Known problems ==

===Phonology===

# <Q> should be replaced by <q> and <k>, and not simply by "q" and "k"
# Devoicing should be expanded, adding <q>

Latest revision as of 06:09, 6 December 2013

Azmorph, a morphological analyzer for Azerbaijani

Azmorph current version is 0.2.1 (preALPHA)

What it is, what it does and what it does not[edit]

Azmorph is a morphological analyzer for Azerbaijani (Azerbaycan dili).

Due to the similarities between Azerbaijani and Turkish Azmorph is being developed starting from TRmorph of Çağrı Çöltekin.

Azmorph is at preALPHA stage of developement: this means that it works for very few features of the language (which will be explained later). If you are not familiar with the nerdish jargon, preALPHA means roughly that the software is in its embryonal state. Is it already a life? We don't know, better you consult your local Church. What we know is that, beside being a problematic embryo we decided to keep it and try to provide it with a decent development. We know it will be a problematic child, probably with several impairments, but we decided to keep it anyway.

Current State[edit]

{{#set_param_default | corpus1 | None }} {{#set_param_default | corpus2 | None }} {{#set_param_default | corpus3 | None }} {{#set_param_default | corpus4 | None }} {{#set_param_default | corpus5 | None }} {{#set_param_default | corpus6 | None }} {{#set_param_default | corpus7 | None }} {{#set_param_default | corpus8 | None }} {{#set_param_default | corpus9 | None }} {{#set_param_default | corpus10 | None }}

{{#ifneq | azadliq2012 | None |

{{#ifneq | RFERL corpora | | | }}

}}

{{#ifneq | quran | None |

{{#ifneq | | | | }}

}}

{{#ifneq | udhr | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus4}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus5}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus6}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus7}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus8}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus9}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus10}}} | None |

{{#ifneq | | | | }}

}}

corpuswordscoverage
<nowinter>azadliq2012</nowinter>azadliq20122.2M ~-%
<nowinter>[[|quran]]</nowinter>quran153K ~-%
<nowinter>[[|udhr]]</nowinter>udhr1.5K ~-%
<nowinter>[[|{{{corpus4}}}]]</nowinter>{{{corpus4}}} ~%
<nowinter>[[|{{{corpus5}}}]]</nowinter>{{{corpus5}}} ~%
<nowinter>[[|{{{corpus6}}}]]</nowinter>{{{corpus6}}} ~%
<nowinter>[[|{{{corpus7}}}]]</nowinter>{{{corpus7}}} ~%
<nowinter>[[|{{{corpus8}}}]]</nowinter>{{{corpus8}}} ~%
<nowinter>[[|{{{corpus9}}}]]</nowinter>{{{corpus9}}} ~%
<nowinter>[[|{{{corpus10}}}]]</nowinter>{{{corpus10}}} ~%

What works? What does not?[edit]

Verbal moods and tenses
Works Minor Problems Absent
Present Progressive (alıram) negative doesn't work well
Imperative Works!
Future indicative (alacağım) 1p and 1s devoicing
Evidential/Past perfect (almışam) Works!
Indefinite future / Aorist (alaram) <t_aor> Works!
Optative present(alam) Works!
Optative past (ala idim) Works!
Necessitative present (almalıyım) Works
Necessitative past (almalı idim) Works
Abilitative(bil-) has to be split
i- copula (idim) Works


Noun Inflection
Works Minor Problems Absent
Cases (n, g, d, acc, abl, loc) Works!
Number (-lAr) Works!
-l Devoicing
-L<A> Works!
-L Works!
-C<A> (makes things like italyanca, inglizce) Works!
Possessives Works!
-ki realized as k Works!

Known problems[edit]

Phonology[edit]

  1. should be replaced by and <k>, and not simply by "q" and "k"
  2. Devoicing should be expanded, adding