User:Bandrandr/proposal

From Apertium
Jump to navigation Jump to search

Chukchi morphological analyser using HFST

Contacts

Vasilisa Andriyanets
blindedbysunshine@gmail.com
github.com/basilisandr
bas_____ on irc
Moscow (GMT+3)

Synopsis

Chukchi is a language with rich and complicated morphology and incorporation.
By now morphological parsers using regular expressions were not able to handle it properly
HFST seems to be the solution

Deliverables

Anticipated result: morphological analyser for Chukchi that handles

  • nouns
  • verbs
  • incorporation (probably)

Benefits

The result of this work, if it succeeds, would be of great use for linguists investigating Chukchi and an important brick for building a corpus of Chukchi.

Timeline

Post-application period

Getting to know HFST better,
improve skills in building finite-state transducers

Community bonding period

Investigation time:

  • getting the whole picture of Chukchi morphology
  • planning the architecture

Work period

  • Week 1 nouns
  • Week 2
  • Week 3
  • Week 4

Milestone #1 HFST for nouns (and adjectives?)

  • Week 5 verbs
  • Week 6
  • Week 7
  • Week 8

Milestone #2 HFST for verbs?

  • Week 9
  • Week 10
  • Week 11
  • Week 12 final debugging, writing documentation

Personal information

Skills and Qualifications

4 years of Fundamental and applied linguistics
Programming skills: Python, R, bash

Non-GSoC summer plans

I am going to write my bachelor thesis by middle June, so I will only be able to spend 10-15 hours per week.
I am also going for a conference on 9-15 July, so I will be able to spend 15-20 hours for the project that week.
Apart from that, I am going to work full-time up to 50 hours a week.