Chukchi morphological analyser using HFST
bas_____ on irc
Chukchi is a language with rich and complicated morphology and incorporation.
By now morphological parsers using regular expressions were not able to handle it properly
HFST seems to be the solution
Anticipated result: morphological analyser for Chukchi that handles
- incorporation (probably)
The result of this work, if it succeeds, would be of great use for linguists investigating Chukchi and an important brick for building a corpus of Chukchi.
- Getting to know HFST better
- improve skills in building finite-state transducers
Community bonding period
- getting the whole picture of Chukchi morphology
- planning the architecture
- Week 1 nouns
- Week 2
- Week 3
- Week 4
Milestone #1 HFST for nouns (and adjectives?)
- Week 5 verbs
- Week 6
- Week 7
- Week 8
Milestone #2 HFST for verbs?
- Week 9
- Week 10
- Week 11
- Week 12 final debugging, writing documentation
Skills and Qualifications
4 years of Fundamental and applied linguistics, (almost completed Bachelor degree in linguistics)
Languages: Russian (native), English (advanced), German (intermediate), Yiddish (intermediate), Norwegian (intermediate), French (elementary) Programming skills: Python, R, bash
Non-GSoC summer plans
I am going to write my bachelor thesis by middle June, so I will only be able to spend 10-15 hours per week.
I am also going for a conference on 9-15 July, so I will be able to spend 15-20 hours for the project that week.
Apart from that, I am going to work full-time up to 50 hours a week.