Difference between revisions of "User:Edgeandpearl/proposal"
Edgeandpearl (talk | contribs) |
Edgeandpearl (talk | contribs) |
||
Line 6: | Line 6: | ||
Moscow (GMT+3)<br /> |
Moscow (GMT+3)<br /> |
||
==Why? == |
|||
⚫ | |||
⚫ | |||
I am now finishing my bachelor’s degree in theoretical and computational linguistics at HSE, Moscow. Though most of my study was about the former aspect of the specialization, I am very interested in the latter as well. Machine translation appears to me, on the one hand, an exciting topic for research and, on the other hand, a very useful and perspective line of work. |
I am now finishing my bachelor’s degree in theoretical and computational linguistics at HSE, Moscow. Though most of my study was about the former aspect of the specialization, I am very interested in the latter as well. Machine translation appears to me, on the one hand, an exciting topic for research and, on the other hand, a very useful and perspective line of work. |
||
==Why |
===Why Apertium?=== |
||
I first learned about Apertium from a friend of mine, who participated in GSoC last year. Even though I haven’t worked with rule-based machine translation before, I was fascinated by the possibility of contributing to a project like that.<br /> |
I first learned about Apertium from a friend of mine, who participated in GSoC last year. Even though I haven’t worked with rule-based machine translation before, I was fascinated by the possibility of contributing to a project like that.<br /> |
||
What impresses me most is that Apertium works with minority languages. Apart from that, the mechanism of rule-based machine translation implies that, when adopting a language pair, one has to deal with language structure. As a linguist, these factors make me extremely interested in working for Apertium. |
What impresses me most is that Apertium works with minority languages. Apart from that, the mechanism of rule-based machine translation implies that, when adopting a language pair, one has to deal with language structure. As a linguist, these factors make me extremely interested in working for Apertium. |
||
==The task== |
|||
⚫ | |||
⚫ | |||
Adopting Faroese -> Norwegian (Bokmål) pair. |
Adopting Faroese -> Norwegian (Bokmål) pair. |
||
==Why Google and Apertium should sponsor it?== |
===Why Google and Apertium should sponsor it?=== |
||
As a result of my work, a free open source Faroese-Norwegian translator will be brought into existence. It is going to be the first machine translation system for these two languages ever. |
As a result of my work, a free open source Faroese-Norwegian translator will be brought into existence. It is going to be the first machine translation system for these two languages ever. |
||
==Work plan== |
===Work plan=== |
||
===Post-application period=== |
====Post-application period==== |
||
* diving into Apertium documentation and manuals,<br /> |
* diving into Apertium documentation and manuals,<br /> |
||
Line 31: | Line 35: | ||
* working on the coding challenge.<br /> |
* working on the coding challenge.<br /> |
||
===Community bonding period=== |
====Community bonding period==== |
||
* exploring and evaluating the available resources for Faroese. |
* exploring and evaluating the available resources for Faroese. |
||
===Work period=== |
====Work period==== |
||
/will be possible to write in detail after getting done with the coding challenge and learning more about the field of work/ |
/will be possible to write in detail after getting done with the coding challenge and learning more about the field of work/ |
||
Line 42: | Line 46: | ||
By summer 2017 I will have graduated with a bachelor’s degree of theoretical and applied linguistics at NRU HSE, Moscow.<br /> |
By summer 2017 I will have graduated with a bachelor’s degree of theoretical and applied linguistics at NRU HSE, Moscow.<br /> |
||
Languages: Russian (native), English (advanced), French (intermediate), Mandarin (elementary), Norwegian Bokmål (elementary).<br /> |
'''Languages:''' Russian (native), English (advanced), French (intermediate), Mandarin (elementary), Norwegian Bokmål (elementary).<br /> |
||
Programming skills: Python, R, bash.<br /> |
'''Programming skills:''' Python, R, bash.<br /> |
||
Other computer skills: HTML+CSS.<br /> |
'''Other computer skills:''' HTML+CSS.<br /> |
||
==Non-GSoC summer plans== |
==Non-GSoC summer plans== |
Revision as of 18:07, 15 March 2017
Contacts
Marina Kustova
marinakoustova@gmail.com
Github, SourceForge, IRC: edgeandpearl
Moscow (GMT+3)
Why?
Why machine translation?
I am now finishing my bachelor’s degree in theoretical and computational linguistics at HSE, Moscow. Though most of my study was about the former aspect of the specialization, I am very interested in the latter as well. Machine translation appears to me, on the one hand, an exciting topic for research and, on the other hand, a very useful and perspective line of work.
Why Apertium?
I first learned about Apertium from a friend of mine, who participated in GSoC last year. Even though I haven’t worked with rule-based machine translation before, I was fascinated by the possibility of contributing to a project like that.
What impresses me most is that Apertium works with minority languages. Apart from that, the mechanism of rule-based machine translation implies that, when adopting a language pair, one has to deal with language structure. As a linguist, these factors make me extremely interested in working for Apertium.
The task
Which of the published tasks are you interested in?
Adopting Faroese -> Norwegian (Bokmål) pair.
Why Google and Apertium should sponsor it?
As a result of my work, a free open source Faroese-Norwegian translator will be brought into existence. It is going to be the first machine translation system for these two languages ever.
Work plan
Post-application period
- diving into Apertium documentation and manuals,
- improving my knowledge of Faroese and Norwegian,
- working on the coding challenge.
Community bonding period
- exploring and evaluating the available resources for Faroese.
Work period
/will be possible to write in detail after getting done with the coding challenge and learning more about the field of work/
Skills and qualifications
By summer 2017 I will have graduated with a bachelor’s degree of theoretical and applied linguistics at NRU HSE, Moscow.
Languages: Russian (native), English (advanced), French (intermediate), Mandarin (elementary), Norwegian Bokmål (elementary).
Programming skills: Python, R, bash.
Other computer skills: HTML+CSS.
Non-GSoC summer plans
I’m defending my thesis in June, so in the very first week I will probably be unable to spend more than 10-15 hours on the task.
I’m also going to go visit some friends of mine in late June or July, right after the first or the second evaluation. That week I will probably work for about 20-25 hours.
For the rest of the time, I plan to work full-time, up to 50 hours a week.