Курсы машинного перевода для языков России/Session 1

From Apertium
< Курсы машинного перевода для языков России
Revision as of 16:35, 18 December 2011 by Francis Tyers (talk | contribs) (Created page with 'This session has two objectives, the first is to give an overview of the theory of morphology, how words are inflected and how new words are formed. And the second is to demonstr…')
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

This session has two objectives, the first is to give an overview of the theory of morphology, how words are inflected and how new words are formed. And the second is to demonstrate how the analysis and generation of morphology is dealt with in Apertium.

Theory

The theory section is split into three subsections: The first deals with morphotactics, that is how morphemes (parts of words) occur and are joined together. The second gives some details of morphophonology, or how changes in morphemes happen as a result of them being joined together. The final section covers a theoretical description of how this is treated with computers.

Morphotactics

The morphotactics of a language is the way that morphemes in that language are joined together to form words. Morphemes are the smallest units of meaning. Morphemes can be free, or bound. They are free if they can occur on their own, and bound if they must be connected to another word. A single morpheme may have several allomorphs which mean the same thing but are written or spoken differently. For example the dative case (used to indicate movement in the direction of) in Chuvash has several allomorphs, which change depending on the vowel quality of the stem to which it attaches.

ача·м·а aчама "to my child"
ача·м·сен·е ачамсене "to my children"
уӗҫ·ӗм·е уӗҫӗме "to my street"

Morphemes can be further split into two subtypes, inflectional and derivational. In the two examples above, signifies a derivational boundary, and + signifies an inflectional boundary.

TODO; something about derivation here