Difference between revisions of "User:Bandrandr/proposal"

From Apertium
Jump to navigation Jump to search
Line 1: Line 1:
=Chukchi morphological analyser using HFST=
Chukchi morphological analyser using HFST




===Contacts===
==Contacts==
Vasilisa Andriyanets<br />
Vasilisa Andriyanets<br />
blindedbysunshine@gmail.com<br />
blindedbysunshine@gmail.com<br />
Line 9: Line 9:
Moscow (GMT+3)
Moscow (GMT+3)


==Synopsis==
=Synopsis=
Chukchi is a language with rich and complicated morphology and incorporation.<br />
Chukchi is a language with rich and complicated morphology and incorporation.<br />
By now morphological parsers using regular expressions were not able to handle it properly <br />
By now morphological parsers using regular expressions were not able to handle it properly <br />
HFST seems to be the solution
HFST seems to be the solution


===Deliverables===
==Deliverables==
Anticipated result: morphological analyser for Chukchi that handles
Anticipated result: morphological analyser for Chukchi that handles
* nouns
* nouns
Line 20: Line 20:
* incorporation (probably)
* incorporation (probably)


===Benefits===
==Benefits==
The result of this work, if it succeeds, would be of great use for linguists investigating Chukchi and an important brick for building a corpus of Chukchi.
The result of this work, if it succeeds, would be of great use for linguists investigating Chukchi and an important brick for building a corpus of Chukchi.


==Timeline==
=Timeline=
===Post-application period===
==Post-application period==
*Getting to know HFST better
*Getting to know HFST better
*improve skills in building finite-state transducers
*improve skills in building finite-state transducers


===Community bonding period===
==Community bonding period==
Investigation time:
Investigation time:
* getting the whole picture of Chukchi morphology
* getting the whole picture of Chukchi morphology
* planning the architecture
* planning the architecture


===Work period===
==Work period==
*'''Week 1''' nouns
*'''Week 1''' nouns
*'''Week 2'''
*'''Week 2'''
Line 49: Line 49:
*'''Week 12''' final debugging, writing documentation
*'''Week 12''' final debugging, writing documentation


==Personal information==
=Personal information=
===Skills and Qualifications===
==Skills and Qualifications==
4 years of Fundamental and applied linguistics, (almost completed Bachelor degree in linguistics)<br />
4 years of Fundamental and applied linguistics, (almost completed Bachelor degree in linguistics)<br />
'''Languages:''' Russian (native), English (advanced), German (intermediate), Yiddish (intermediate), Norwegian (intermediate), French (elementary)
'''Languages:''' Russian (native), English (advanced), German (intermediate), Yiddish (intermediate), Norwegian (intermediate), French (elementary)
'''Programming skills:''' Python, R, bash
'''Programming skills:''' Python, R, bash


===Non-GSoC summer plans===
==Non-GSoC summer plans==
I am going to write my bachelor thesis by middle June, so I will only be able to spend 10-15 hours per week.<br />
I am going to write my bachelor thesis by middle June, so I will only be able to spend 10-15 hours per week.<br />
I am also going for a conference on 9-15 July, so I will be able to spend 15-20 hours for the project that week.<br />
I am also going for a conference on 9-15 July, so I will be able to spend 15-20 hours for the project that week.<br />

Revision as of 12:11, 24 March 2017

Chukchi morphological analyser using HFST


Contacts

Vasilisa Andriyanets
blindedbysunshine@gmail.com
github.com/basilisandr
bas_____ on irc
Moscow (GMT+3)

Synopsis

Chukchi is a language with rich and complicated morphology and incorporation.
By now morphological parsers using regular expressions were not able to handle it properly
HFST seems to be the solution

Deliverables

Anticipated result: morphological analyser for Chukchi that handles

  • nouns
  • verbs
  • incorporation (probably)

Benefits

The result of this work, if it succeeds, would be of great use for linguists investigating Chukchi and an important brick for building a corpus of Chukchi.

Timeline

Post-application period

  • Getting to know HFST better
  • improve skills in building finite-state transducers

Community bonding period

Investigation time:

  • getting the whole picture of Chukchi morphology
  • planning the architecture

Work period

  • Week 1 nouns
  • Week 2
  • Week 3
  • Week 4

Milestone #1 HFST for nouns (and adjectives?)

  • Week 5 verbs
  • Week 6
  • Week 7
  • Week 8

Milestone #2 HFST for verbs?

  • Week 9
  • Week 10
  • Week 11
  • Week 12 final debugging, writing documentation

Personal information

Skills and Qualifications

4 years of Fundamental and applied linguistics, (almost completed Bachelor degree in linguistics)
Languages: Russian (native), English (advanced), German (intermediate), Yiddish (intermediate), Norwegian (intermediate), French (elementary) Programming skills: Python, R, bash

Non-GSoC summer plans

I am going to write my bachelor thesis by middle June, so I will only be able to spend 10-15 hours per week.
I am also going for a conference on 9-15 July, so I will be able to spend 15-20 hours for the project that week.
Apart from that, I am going to work full-time up to 50 hours a week.