Difference between revisions of "Turkic lexicon"

From Apertium
Jump to navigation Jump to search
Line 9: Line 9:
 
* The file will be laid out in the following order:
 
* The file will be laid out in the following order:
 
*# The multicharacter symbols
 
*# The multicharacter symbols
  +
*# The <code>Root</code> lexicon, pointing to the stem lexicons
 
*# The morphotactics (continuation lexica)
 
*# The morphotactics (continuation lexica)
 
*# The stem lexicons
 
*# The stem lexicons
Line 18: Line 19:
 
===Morphotactics===
 
===Morphotactics===
   
===Stem lexicon===
+
====Naming continuation lexica====
  +
  +
===Stem lexicons===
  +
  +
   
 
==Categorisation==
 
==Categorisation==

Revision as of 03:26, 20 April 2012

Some notes on how to go about making a Turkic lexicon for use in Apertium.

Layout

General points:

  • The lexicon will be made in one file, it will have the suffix .lexc
  • The file will be laid out in the following order:
    1. The multicharacter symbols
    2. The Root lexicon, pointing to the stem lexicons
    3. The morphotactics (continuation lexica)
    4. The stem lexicons

Multicharacter symbols

Morphotactics

Naming continuation lexica

Stem lexicons

Categorisation

Nominals

Finite verbs

Non-finite verbs

Language specific issues