Difference between revisions of "User:Edgeandpearl/proposal"

From Apertium
Jump to navigation Jump to search
Line 6: Line 6:
Moscow (GMT+3)<br />
Moscow (GMT+3)<br />


==Why? ==
==Why is it you are interested in machine translation? ==

===Why machine translation? ===


I am now finishing my bachelor’s degree in theoretical and computational linguistics at HSE, Moscow. Though most of my study was about the former aspect of the specialization, I am very interested in the latter as well. Machine translation appears to me, on the one hand, an exciting topic for research and, on the other hand, a very useful and perspective line of work.
I am now finishing my bachelor’s degree in theoretical and computational linguistics at HSE, Moscow. Though most of my study was about the former aspect of the specialization, I am very interested in the latter as well. Machine translation appears to me, on the one hand, an exciting topic for research and, on the other hand, a very useful and perspective line of work.


==Why is it that you are interested in Apertium?==
===Why Apertium?===


I first learned about Apertium from a friend of mine, who participated in GSoC last year. Even though I haven’t worked with rule-based machine translation before, I was fascinated by the possibility of contributing to a project like that.<br />
I first learned about Apertium from a friend of mine, who participated in GSoC last year. Even though I haven’t worked with rule-based machine translation before, I was fascinated by the possibility of contributing to a project like that.<br />
What impresses me most is that Apertium works with minority languages. Apart from that, the mechanism of rule-based machine translation implies that, when adopting a language pair, one has to deal with language structure. As a linguist, these factors make me extremely interested in working for Apertium.
What impresses me most is that Apertium works with minority languages. Apart from that, the mechanism of rule-based machine translation implies that, when adopting a language pair, one has to deal with language structure. As a linguist, these factors make me extremely interested in working for Apertium.


==The task==
==Which of the published tasks are you interested in? ==

===Which of the published tasks are you interested in? ===


Adopting Faroese -> Norwegian (Bokmål) pair.
Adopting Faroese -> Norwegian (Bokmål) pair.


==Why Google and Apertium should sponsor it?==
===Why Google and Apertium should sponsor it?===


As a result of my work, a free open source Faroese-Norwegian translator will be brought into existence. It is going to be the first machine translation system for these two languages ever.
As a result of my work, a free open source Faroese-Norwegian translator will be brought into existence. It is going to be the first machine translation system for these two languages ever.


==Work plan==
===Work plan===


===Post-application period===
====Post-application period====


* diving into Apertium documentation and manuals,<br />
* diving into Apertium documentation and manuals,<br />
Line 31: Line 35:
* working on the coding challenge.<br />
* working on the coding challenge.<br />


===Community bonding period===
====Community bonding period====


* exploring and evaluating the available resources for Faroese.
* exploring and evaluating the available resources for Faroese.


===Work period===
====Work period====


/will be possible to write in detail after getting done with the coding challenge and learning more about the field of work/
/will be possible to write in detail after getting done with the coding challenge and learning more about the field of work/
Line 42: Line 46:


By summer 2017 I will have graduated with a bachelor’s degree of theoretical and applied linguistics at NRU HSE, Moscow.<br />
By summer 2017 I will have graduated with a bachelor’s degree of theoretical and applied linguistics at NRU HSE, Moscow.<br />
Languages: Russian (native), English (advanced), French (intermediate), Mandarin (elementary), Norwegian Bokmål (elementary).<br />
'''Languages:''' Russian (native), English (advanced), French (intermediate), Mandarin (elementary), Norwegian Bokmål (elementary).<br />
Programming skills: Python, R, bash.<br />
'''Programming skills:''' Python, R, bash.<br />
Other computer skills: HTML+CSS.<br />
'''Other computer skills:''' HTML+CSS.<br />


==Non-GSoC summer plans==
==Non-GSoC summer plans==

Revision as of 18:07, 15 March 2017

Contacts

Marina Kustova
marinakoustova@gmail.com
Github, SourceForge, IRC: edgeandpearl
Moscow (GMT+3)

Why?

Why machine translation?

I am now finishing my bachelor’s degree in theoretical and computational linguistics at HSE, Moscow. Though most of my study was about the former aspect of the specialization, I am very interested in the latter as well. Machine translation appears to me, on the one hand, an exciting topic for research and, on the other hand, a very useful and perspective line of work.

Why Apertium?

I first learned about Apertium from a friend of mine, who participated in GSoC last year. Even though I haven’t worked with rule-based machine translation before, I was fascinated by the possibility of contributing to a project like that.
What impresses me most is that Apertium works with minority languages. Apart from that, the mechanism of rule-based machine translation implies that, when adopting a language pair, one has to deal with language structure. As a linguist, these factors make me extremely interested in working for Apertium.

The task

Which of the published tasks are you interested in?

Adopting Faroese -> Norwegian (Bokmål) pair.

Why Google and Apertium should sponsor it?

As a result of my work, a free open source Faroese-Norwegian translator will be brought into existence. It is going to be the first machine translation system for these two languages ever.

Work plan

Post-application period

  • diving into Apertium documentation and manuals,
  • improving my knowledge of Faroese and Norwegian,
  • working on the coding challenge.

Community bonding period

  • exploring and evaluating the available resources for Faroese.

Work period

/will be possible to write in detail after getting done with the coding challenge and learning more about the field of work/

Skills and qualifications

By summer 2017 I will have graduated with a bachelor’s degree of theoretical and applied linguistics at NRU HSE, Moscow.
Languages: Russian (native), English (advanced), French (intermediate), Mandarin (elementary), Norwegian Bokmål (elementary).
Programming skills: Python, R, bash.
Other computer skills: HTML+CSS.

Non-GSoC summer plans

I’m defending my thesis in June, so in the very first week I will probably be unable to spend more than 10-15 hours on the task.
I’m also going to go visit some friends of mine in late June or July, right after the first or the second evaluation. That week I will probably work for about 20-25 hours.
For the rest of the time, I plan to work full-time, up to 50 hours a week.