Turkish and Kyrgyz/Kymorph article

From Apertium
< Turkish and Kyrgyz
Revision as of 18:32, 6 October 2011 by Firespeaker (talk | contribs) (→‎Numbers: another ? source)
Jump to navigation Jump to search

Outline

General background

Similar articles

Morphotactica

  • Irregular negatives of many verb forms

Morphophonologia

  • /рн/ nouns

Corpora

  • Which corpora to use?
  • concerns
    • Wikipedia is messy; should we have an automated cleaning process or get stats as-is?
      • Use aq-wikicrp, this way it is reproducible .

Numbers

size of corpora
wikipedia azattyk 2010 all azattyk
num articles 1531(?, ?) 9803
num words 271005
xml file size >3.8MB