Turkish and Kyrgyz/Kymorph article

From Apertium
< Turkish and Kyrgyz
Revision as of 16:50, 13 October 2011 by Firespeaker (talk | contribs) (Undo revision 28838 by Firespeaker (Talk))
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Outline[edit]

General background[edit]

Similar articles[edit]

Morphotactica[edit]

  • Irregular negatives of many verb forms

Morphophonologia[edit]

  • /рн/ nouns

Corpora[edit]

  • Which corpora to use?
  • concerns
    • Wikipedia is messy; should we have an automated cleaning process or get stats as-is?
      • Use aq-wikicrp, this way it is reproducible .

Numbers[edit]

size of corpora
wikipedia azattyk 2010 all azattyk
num articles 1531(?, ?) 9803 (6627?)
num words 271005 3394686
xml file size 3.8MB 49MB