Difference between revisions of "User:Mary.szmary/proposal"

From Apertium
Jump to navigation Jump to search
Line 21: Line 21:
 
== Which of the published tasks are you interested in? What do you plan to do? ==
 
== Which of the published tasks are you interested in? What do you plan to do? ==
   
'''Title'''
+
===Title===
   
 
Adopt ''Polish -> Russian'' language pair
 
Adopt ''Polish -> Russian'' language pair
   
'''Reasons why Google and Apertium should sponsor it'''
+
===Reasons why Google and Apertium should sponsor it===
   
 
Currently the pol-rus language pair is in the beginning state (in the incubator). There are very few words in the bilingual dictionary and no rules. The pol dictionary also does not have enough words. My goal is to fill the dictionaries, make as many rules as I'll be able to and bring it near to the release quality.
 
Currently the pol-rus language pair is in the beginning state (in the incubator). There are very few words in the bilingual dictionary and no rules. The pol dictionary also does not have enough words. My goal is to fill the dictionaries, make as many rules as I'll be able to and bring it near to the release quality.
   
'''A description of how and who it will benefit in society'''
+
===A description of how and who it will benefit in society===
   
 
The result of this work is going to be a free and open source Polish-Russian translation system.
 
The result of this work is going to be a free and open source Polish-Russian translation system.
   
'''Work plan'''
+
===Work plan===
   
 
''community bonding period''
 
''community bonding period''
Line 43: Line 43:
 
''work period''
 
''work period''
   
  +
* week 1: adding the basic lexics to the bilingual dictionary
-- week 1
 
-- week 2
+
* week 2:
 
* week 3
  +
* week 4
  +
* week 5
  +
* week 6
  +
* week 7
  +
* week 8
  +
* week 9
  +
* week 10
  +
* week 11
  +
* week 12
  +
* week 13
  +
  +
  +
== List your skills and give evidence of your qualifications ==
  +
  +
I'm a 3rd year bachelor student of Linguistic Faculty in NRU HSE[https://www.hse.ru/en/] (Russia).
  +
'''Languages:''' Russian (native), Polish, English, Toki Pona :), German, basic knowledge of Indonesian.
  +
'''Programming skills:''' Python (both 2nd and 3rd), R, basic knowledge of bash.
  +
'''Other computer skills''': HTML, XML, CSS.
  +
  +
  +
== List any non-Summer-of-Code plans you have for the Summer ==
  +
  +
I have exams till 3rd-4th weeks of June so I won't be able to work full-time at this period, but I can spend 20-25 hours per week on the task. After the end of exams I'm going to visit my parents for some 4-5 days and also would be able to spend only 25-30 hours per week on the task. After that I'm ready to work full time and spend up to 45-50 hours on the task.

Revision as of 03:51, 24 March 2016

Contact information

Name: Maria Sheyanova
E-mail: masha.shejanova@gmail.com
IRC: mary-szmary
SourceForge: maryszmary
Phone number: +79169223114

Why is it you are interested in machine translation?

On the one hand, I'm a student of linguistic faculty, so working with language material and understanding more about language structure while contributing to machine translation systems is one of my primary interests as a linguist. On the other hand, I also have a great interest in coding and I'd like to use any opportunity to learn to code better or learn new things concerning programming. Working with machine translation seems to ideally fit both interests.


Why is it that you are interested in the Apertium project?

Apertium does rule-based machine translation. On the contrast to, it seems, more widespread nowadays corpus-based translation, it requires working with language structure, which it's attracts me as a linguist. It is also very attractive for me that Apertium supports the minority languages, because working with them is much more interesting that with the big ones.


Which of the published tasks are you interested in? What do you plan to do?

Title

Adopt Polish -> Russian language pair

Reasons why Google and Apertium should sponsor it

Currently the pol-rus language pair is in the beginning state (in the incubator). There are very few words in the bilingual dictionary and no rules. The pol dictionary also does not have enough words. My goal is to fill the dictionaries, make as many rules as I'll be able to and bring it near to the release quality.

A description of how and who it will benefit in society

The result of this work is going to be a free and open source Polish-Russian translation system.

Work plan

community bonding period

- Getting closer with Apertium, its systems and tools - Finding and studying the sources of language data that can be used (already found: ...) - Bringing my knowledge of bash to a better state

work period

  • week 1: adding the basic lexics to the bilingual dictionary
  • week 2:
  • week 3
  • week 4
  • week 5
  • week 6
  • week 7
  • week 8
  • week 9
  • week 10
  • week 11
  • week 12
  • week 13


List your skills and give evidence of your qualifications

I'm a 3rd year bachelor student of Linguistic Faculty in NRU HSE[1] (Russia). Languages: Russian (native), Polish, English, Toki Pona :), German, basic knowledge of Indonesian. Programming skills: Python (both 2nd and 3rd), R, basic knowledge of bash. Other computer skills: HTML, XML, CSS.


List any non-Summer-of-Code plans you have for the Summer

I have exams till 3rd-4th weeks of June so I won't be able to work full-time at this period, but I can spend 20-25 hours per week on the task. After the end of exams I'm going to visit my parents for some 4-5 days and also would be able to spend only 25-30 hours per week on the task. After that I'm ready to work full time and spend up to 45-50 hours on the task.