User:Deltamachine/proposal2018
Contents
- 1 Contact information
- 2 Skills and experience
- 3 Why is it you are interested in machine translation?
- 4 Why is it that you are interested in Apertium?
- 5 Which of the published tasks are you interested in? What do you plan to do?
- 6 Reasons why Google and Apertium should sponsor it
- 7 A description of how and who it will benefit in society
- 8 Work plan
- 9 Non-Summer-of-Code plans you have for the Summer
- 10 Coding challenge
Contact information
Name: Anna Kondrateva
Location: Moscow, Russia
E-mail: an-an-kondratjeva@yandex.ru
Phone number: +79250374221
IRC: deltamachine
SourceForge: deltamachine
Timezone: UTC+3
Skills and experience
I am a third-year bachelor student of Linguistics Faculty in National Research University «Higher School of Economics» (NRU HSE)
Main university courses:
- Programming (Python, R)
- Computer Tools for Linguistic Research
- Theory of Language (Phonetics, Morphology, Syntax, Semantics)
- Language Diversity and Typology
- Machine Learning
- Math (Discrete Math, Linear Algebra and Calculus, Probability Theory and Mathematical Statistics)
- Theory of Algorithms
- Databases
Technical skills:
- Programming languages: Python, R, Javascript
- Web design: HTML, CSS
- Frameworks: Flask, Django
- Databases: SQLite, PostgreSQL, MySQL
Projects and experience: http://github.com/deltamachine
Languages: Russian (native), English, German
Why is it you are interested in machine translation?
Why is it that you are interested in Apertium?
I have participated in GSoC 2017 with Apertium and it was a great experience. I have successfully finished my project, learned a lot of new things and had a lot of fun. Also I have participated in GCI 2017 as a mentor for Apertium and it was great too. So I am very interested in contributing to Apertium.
Apertium community is very friendly and open to new members, people here are always ready to help you. Also this organisation works on things which are very interesting for me as a computational linguist: (rule-based) machine translation, minority languages, NLP and so on.
Which of the published tasks are you interested in? What do you plan to do?
I would like to work on improving language pairs by mining MediaWiki Content Translation postedits.
Reasons why Google and Apertium should sponsor it
A description of how and who it will benefit in society
Work plan
Post application period
Community bonding period
Work period
- Week 1:
- Week 2:
- Week 3:
- Week 4:
- Deliverable #1, June 26 - 30
- Week 5:
- Week 6:
- Week 7:
- Week 8:
- Deliverable #2, July 24 - 28
- Week 9:
- Week 10:
- Week 11: testing, fixing bugs
- Week 12: cleaning up the code, writing documentation
- Project completed:
Part 1, weeks 1-4:
Part 2, weeks 5-8:
Part 3, weeks 9-12:
Also I am going to write short notes about work process on the page of my project during the whole summer.
Non-Summer-of-Code plans you have for the Summer
I have exams at the university until the third week of June, so I will be able to work only 20-25 hours per week. But since I am already familiar with Apertium system I can work on the project during the community bonding period. After exams I will be able to work full time and spend 45-50 hours per week on the task.