Difference between revisions of "Turkic lexicon"

Revision as of 03:38, 20 April 2012

General points:

The lexicon will be made in one file, it will have the suffix .lexc
The file will be laid out in the following order:
1. The multicharacter symbols
2. The Root lexicon, pointing to the stem lexicons
3. The morphotactics (continuation lexica)
4. The stem lexicons

The list of symbols should be laid out in the following order:

Every symbol should have a comment. The comments should line up.

Continuation lexica will be named in upper case, and may contain letters, numbers and the symbol -.
- Examples: LEXICON N1, LEXICON DET-DEM, LEXICON ADV

Lines in the stem lexicons should follow the following pattern:

@@ Line 33: / Line 33: @@
 ===Stem lexicons===
+Lines in the stem lexicons should follow the following pattern:
+* Left side (lexical form)
+* Colon <code>:</code>
+* Right side (surface form)
+* Continuation lexicon
+* Semicolon <code>;</code>
+* Space <code> </code>
+* Exclamation mark
+* Open quote <code>"</code>
+* Gloss (optional)
+* Close quote <code>"</code>
 ==Categorisation==