Difference between revisions of "User:Bandrandr/proposal"
Line 1: | Line 1: | ||
Chukchi morphological analyser using HFST |
|||
==Contacts== |
|||
Vasilisa Andriyanets<br /> |
Vasilisa Andriyanets<br /> |
||
blindedbysunshine@gmail.com<br /> |
blindedbysunshine@gmail.com<br /> |
||
Line 9: | Line 9: | ||
Moscow (GMT+3) |
Moscow (GMT+3) |
||
=Synopsis= |
|||
Chukchi is a language with rich and complicated morphology and incorporation.<br /> |
Chukchi is a language with rich and complicated morphology and incorporation.<br /> |
||
By now morphological parsers using regular expressions were not able to handle it properly <br /> |
By now morphological parsers using regular expressions were not able to handle it properly <br /> |
||
HFST seems to be the solution |
HFST seems to be the solution |
||
==Deliverables== |
|||
Anticipated result: morphological analyser for Chukchi that handles |
Anticipated result: morphological analyser for Chukchi that handles |
||
* nouns |
* nouns |
||
Line 20: | Line 20: | ||
* incorporation (probably) |
* incorporation (probably) |
||
==Benefits== |
|||
The result of this work, if it succeeds, would be of great use for linguists investigating Chukchi and an important brick for building a corpus of Chukchi. |
The result of this work, if it succeeds, would be of great use for linguists investigating Chukchi and an important brick for building a corpus of Chukchi. |
||
=Timeline= |
|||
==Post-application period== |
|||
*Getting to know HFST better |
*Getting to know HFST better |
||
*improve skills in building finite-state transducers |
*improve skills in building finite-state transducers |
||
==Community bonding period== |
|||
Investigation time: |
Investigation time: |
||
* getting the whole picture of Chukchi morphology |
* getting the whole picture of Chukchi morphology |
||
* planning the architecture |
* planning the architecture |
||
==Work period== |
|||
*'''Week 1''' nouns |
*'''Week 1''' nouns |
||
*'''Week 2''' |
*'''Week 2''' |
||
Line 49: | Line 49: | ||
*'''Week 12''' final debugging, writing documentation |
*'''Week 12''' final debugging, writing documentation |
||
=Personal information= |
|||
==Skills and Qualifications== |
|||
4 years of Fundamental and applied linguistics, (almost completed Bachelor degree in linguistics)<br /> |
4 years of Fundamental and applied linguistics, (almost completed Bachelor degree in linguistics)<br /> |
||
'''Languages:''' Russian (native), English (advanced), German (intermediate), Yiddish (intermediate), Norwegian (intermediate), French (elementary) |
'''Languages:''' Russian (native), English (advanced), German (intermediate), Yiddish (intermediate), Norwegian (intermediate), French (elementary) |
||
'''Programming skills:''' Python, R, bash |
'''Programming skills:''' Python, R, bash |
||
==Non-GSoC summer plans== |
|||
I am going to write my bachelor thesis by middle June, so I will only be able to spend 10-15 hours per week.<br /> |
I am going to write my bachelor thesis by middle June, so I will only be able to spend 10-15 hours per week.<br /> |
||
I am also going for a conference on 9-15 July, so I will be able to spend 15-20 hours for the project that week.<br /> |
I am also going for a conference on 9-15 July, so I will be able to spend 15-20 hours for the project that week.<br /> |
Revision as of 12:11, 24 March 2017
Chukchi morphological analyser using HFST
Contents
Contacts
Vasilisa Andriyanets
blindedbysunshine@gmail.com
github.com/basilisandr
bas_____ on irc
Moscow (GMT+3)
Synopsis
Chukchi is a language with rich and complicated morphology and incorporation.
By now morphological parsers using regular expressions were not able to handle it properly
HFST seems to be the solution
Deliverables
Anticipated result: morphological analyser for Chukchi that handles
- nouns
- verbs
- incorporation (probably)
Benefits
The result of this work, if it succeeds, would be of great use for linguists investigating Chukchi and an important brick for building a corpus of Chukchi.
Timeline
Post-application period
- Getting to know HFST better
- improve skills in building finite-state transducers
Community bonding period
Investigation time:
- getting the whole picture of Chukchi morphology
- planning the architecture
Work period
- Week 1 nouns
- Week 2
- Week 3
- Week 4
Milestone #1 HFST for nouns (and adjectives?)
- Week 5 verbs
- Week 6
- Week 7
- Week 8
Milestone #2 HFST for verbs?
- Week 9
- Week 10
- Week 11
- Week 12 final debugging, writing documentation
Personal information
Skills and Qualifications
4 years of Fundamental and applied linguistics, (almost completed Bachelor degree in linguistics)
Languages: Russian (native), English (advanced), German (intermediate), Yiddish (intermediate), Norwegian (intermediate), French (elementary)
Programming skills: Python, R, bash
Non-GSoC summer plans
I am going to write my bachelor thesis by middle June, so I will only be able to spend 10-15 hours per week.
I am also going for a conference on 9-15 July, so I will be able to spend 15-20 hours for the project that week.
Apart from that, I am going to work full-time up to 50 hours a week.