User:Mary.szmary/proposal
Contents
- 1 Contact information
- 2 Why is it you are interested in machine translation?
- 3 Why is it that you are interested in the Apertium project?
- 4 Which of the published tasks are you interested in? What do you plan to do?
- 5 List your skills and give evidence of your qualifications
- 6 List any non-Summer-of-Code plans you have for the Summer
Contact information
Name: Maria Sheyanova
E-mail: masha.shejanova@gmail.com
IRC: mary-szmary
SourceForge: maryszmary
Phone number: +79169223114
Why is it you are interested in machine translation?
On the one hand, I'm a student of linguistic faculty, so working with language material and understanding more about language structure while contributing to machine translation systems is one of my primary interests as a linguist. On the other hand, I also have a great interest in coding and I'd like to use any opportunity to learn to code better or learn new things concerning programming. Working with machine translation seems to ideally fit both interests.
Why is it that you are interested in the Apertium project?
Apertium does rule-based machine translation. On the contrast to, it seems, more widespread nowadays corpus-based translation, it requires working with language structure, which it's attracts me as a linguist. It is also very attractive for me that Apertium supports the minority languages, because working with them is much more interesting that with the big ones.
Which of the published tasks are you interested in? What do you plan to do?
Title
Adopt Polish -> Russian language pair
Reasons why Google and Apertium should sponsor it
Currently the pol-rus language pair is in the beginning state (in the incubator). There are very few words in the bilingual dictionary and no rules. The pol dictionary also does not have enough words. My goal is to fill the dictionaries, make as many rules as I'll be able to and bring it near to the release quality.
A description of how and who it will benefit in society
The result of this work is going to be a free and open source Polish-Russian translation system.
Work plan
community bonding period
- Getting closer with Apertium, its systems and tools - Finding and studying the sources of language data that can be used (already found: ...) - Bringing my knowledge of bash to a better state
work period
- week 1: adding the basic lexics to the bilingual dictionary
- week 2:
- week 3
- week 4
- week 5
- week 6
- week 7
- week 8
- week 9
- week 10
- week 11
- week 12
- week 13
List your skills and give evidence of your qualifications
I'm a 3rd year bachelor student of Linguistic Faculty in NRU HSE[1] (Russia). Languages: Russian (native), Polish, English, Toki Pona :), German, basic knowledge of Indonesian. Programming skills: Python (both 2nd and 3rd), R, basic knowledge of bash. Other computer skills: HTML, XML, CSS.
List any non-Summer-of-Code plans you have for the Summer
I have exams till 3rd-4th weeks of June so I won't be able to work full-time at this period, but I can spend 20-25 hours per week on the task. After the end of exams I'm going to visit my parents for some 4-5 days and also would be able to spend only 25-30 hours per week on the task. After that I'm ready to work full time and spend up to 45-50 hours on the task.