Difference between revisions of "Uighur and Turkish"
Jump to navigation
Jump to search
(8 intermediate revisions by 2 users not shown) | |||
Line 2: | Line 2: | ||
== Vowel Harmony == |
== Vowel Harmony == |
||
Vowel harmony can often be ambiguous after the blank ى character, meaning that this calls for new classes, most urgently for nouns but possibly also for other classes. |
Vowel harmony can often be ambiguous after the blank ى character, <s>meaning that this calls for new classes, most urgently for nouns but possibly also for other classes.</s> |
||
Examples: til-lar, éghiz-lar |
Examples: til-lar, éghiz-lar, ish-lar |
||
This is already solved in twol, using the {☫} symbol. |
|||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
== Vowel dropping issues == |
== Vowel dropping issues == |
||
A number of issues, examples will be added. |
A number of issues, examples will be added. |
||
Line 12: | Line 15: | ||
With possessive suffix: |
With possessive suffix: |
||
isim-ismi |
isim-ismi ئىسىم-ئىسمى |
||
orun-orni |
|||
orun-orni ئورۇن-ئورنى |
|||
qisim-qismi |
|||
qisim-qismi قىسىم -قىسمى |
|||
== Orthographic Errors == |
== Orthographic Errors == |
||
Line 21: | Line 26: | ||
== See also == |
== See also == |
||
* [[/Work plan|Work plan]] |
* [[/Work plan|Work plan]] |
||
* [[/Paper|Paper]] |
|||
* [[/GSoC2018_report|GSoC report]] |
Latest revision as of 18:21, 11 August 2018
https://github.com/apertium/apertium-uig-tur
Contents
Vowel Harmony[edit]
Vowel harmony can often be ambiguous after the blank ى character, meaning that this calls for new classes, most urgently for nouns but possibly also for other classes.
Examples: til-lar, éghiz-lar, ish-lar
This is already solved in twol, using the {☫} symbol.
Suffixes after Numbers or Special Characters[edit]
Suffixes after Numbers or characters like " << >> are currently not analyzed at all
Vowel dropping issues[edit]
A number of issues, examples will be added.
With possessive suffix:
isim-ismi ئىسىم-ئىسمى
orun-orni ئورۇن-ئورنى
qisim-qismi قىسىم -قىسمى
Orthographic Errors[edit]
voicing/devoicing mistakes seem to come up frequently with loan words, e.g. with cenubda being written as cenubta