User:Firespeaker/Apertium-turkic talk outline

From Apertium
< User:Firespeaker
Revision as of 15:32, 28 September 2012 by Firespeaker (talk | contribs) (Created page with '== Morphological transducers: what and why == * slide 1: definition, example (sample input/output) * slide 2: use in RBMT, specifically apertium * slide 3: other uses: spell chec…')
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Morphological transducers: what and why

  • slide 1: definition, example (sample input/output)
  • slide 2: use in RBMT, specifically apertium
  • slide 3: other uses: spell checkers, ...?

Turkic languages

Geographical/demographic overview of Turkic languages

  • slides 4, 5?
    • a map, numbers of speakers, wikipedia presence

Morphological and phonological properties encountered in Turkic languages

  • slide 5: Agglutination
  • slide 6: Vowel harmony
  • slide 7: Consonantal processes
  • slide 8: "buffer" segments
  • slide 9: Cyrillic orthographical issues
  • something on morpho-syntactic issues that've come up a lot? E.g.,
    • Adjective classes (e.g., whether used as <attr>/<subst>/<advl>, +comparative, etc.)
    • Non-finite verb forms
    •  ?

Developing a morphological transducer

  • Important resources to start with:
    • a corpus
    • some grammars and dictionaries
    • linguistic knowledge of the language
    • native speakers!
      • ability to work with informants
      • patience!

HFST and how we use it

  • slide: HFST: what and who
  • slide: our purposes: using two two-level systems together for a three-level system (?):
    • slide: overview of lexc
    • slide: overview of twol

Examples: how morphophonological issues above are dealt with

  • bing
  • bang
  • bam

State of affairs now with apertium-turkic