Difference between revisions of "User:Tachyons/GSoC Proposal"

From Apertium
Jump to navigation Jump to search
Line 1: Line 1:
== Personal Information ==
 
 
Name: Aboobacker MK
 
E-mail address: aboobackervyd@gmail.com
 
Wiki : [http://wiki.apertium.org/w/index.php?title=User:Tachyons tachyons]
 
Irc nick : tachyons
 
GIt : [https://github.com/tachyons tachyons]
 
 
== Why is it you are interested in machine translation? ==
 
 
 
I am computer science student,who is tech enthusiast . when I was chatting with friend he used Google translate for translating contents to his language . But there was no Malayalam in Google translate or other MT tools . so I decide to create one ,But immediately I realized that it is not an easy task as it sounds . I googled about it and got article about MT, I learned basics including automata theory and decided to concentrate more on MT. MT plays an important role in information sharing .Using MT We can even read articles written in other language without learning that language
 
 
== Why is it that they are interested in the Apertium project? ==
 
 
 
Because I am a foss enthusiast and interested in machine language translation. I saw the name of Apertium in GSoC organisation list . I quickly went through the wiki and realised that this is the project I was searching for. I introduced myself in mailing list and irc and got positive response and help
 
'''
 
 
== Which of the published tasks are you interested in? What do you plan to do?''' ==
 
 
Apertium Malayalam-English pair
 
 
== Proposal ==
 
 
'''Adopt a language pair: mal-eng(Malayalam to English)'''
 
 
Malayalam is the official language of Kerala (an Indian state) , It spoken by 40 million people around the world . But there is no machine language translator available for this language. My proposal is to create an English Malayalam pair using Apertium platform, It's complex task because structure of English and Malayalam So It is impossible to create complete translation system in 3 months. but I can do many things in 3 months which can help future development
 
 
'''why Google and Apertium should sponsor it,'''
 
 
because there is no existing machine language tools available for this language ,apertium have a solid background in the MT field so it will helpful to realize my dream . If Google approved this proposal I can spend more time for I will get more technical support
 
 
''' How and who it will benefit in society'''
 
 
Machine language translation can play an important role in Information technology to common people, It will help quick translation of documents s,news etc to other language(Especially to mother tongue)more easily.
 
 
'''work plan (including, if possible, a brief schedule with milestones and deliverables).'''
 
 
 
== Work plan==
 
== Work plan==
   
''I was busy with some other works , so did'nt got enough time for familarising apertium, I will restructure the time line after having a completeidea about this platform''
+
''I was busy with some other works , so didn't got enough time for familiarising apertium, I will restructure the time line after having a completeidea about this platform''
   
 
{| class="wikitable" border="1"
 
{| class="wikitable" border="1"
Line 49: Line 10:
 
|-
 
|-
 
| week 1
 
| week 1
| Study of morphology of malayalam
+
| Study of morphology of Malayalam
   
   
Line 69: Line 30:
 
|-
 
|-
 
| week 5
 
| week 5
| Implementation of disambiguation rules for Marathi.
+
| Implementation of disambiguation rules for Malayalam.
 
|-
 
|-
 
| week 6
 
| week 6
| Study transfer rules and start implementaion
+
| Study transfer rules and start implementation
   
   
 
|-
 
|-
 
| week 7
 
| week 7
| Transfer rules implemenation
+
| Transfer rules implementation
   
 
|-
 
|-
Line 113: Line 74:
 
'''List your skills and give evidence of your qualifications. Tell us what is your current field of study, major, etc.'''
 
'''List your skills and give evidence of your qualifications. Tell us what is your current field of study, major, etc.'''
   
I am a computer science student ,and interested in foss . I can code in python,c,c++,c# ,java,php ,javascript and little objective c and Google Go.Malyalam is my mother tongue and english was my second language at school , I am familiar with both languages
+
I am a computer science student ,and interested in foss . I can code in python,c,c++,c# ,java,php ,javascript and little objective c and Google Go. Malyalam is my mother tongue and English was my second language at school , I am familiar with both languages
   
 
'''Convince us that you can do the work. In particular we would like to know whether you have programmed before in open-source projects.'''
 
'''Convince us that you can do the work. In particular we would like to know whether you have programmed before in open-source projects.'''
 
 
I am new to the open source world so currently my contributions are very less :-(, Check my github profile for my existing open source projects
+
I am new to the open source world so currently my contributions are very less :-(, Check my github [https://github.com/tachyons] profile for my existing open source projects . I am already familiar with apertium ecosystem , current status of my work is available here [https://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-mal-eng]
   
 
'''List any non-Summer-of-Code plans you have for the Summer, especially employment, if you are applying for
 
'''List any non-Summer-of-Code plans you have for the Summer, especially employment, if you are applying for
 
internships, and class-taking. Be specific about schedules and time commitments.'''
 
internships, and class-taking. Be specific about schedules and time commitments.'''
   
 
I can spend ~30 hours per week for this task, I am not doing any internships right now .I am a freelance programmer But I will pause it if I got selected for GSoC
I am a student and my class will start on July 2nd week , but it is possible to spend 30hour per week
 
7*2 (weekend) + 5*3 ( on other days) =29 hours , I am not doing any internships right now .I am a freelance programmer But I will pause it if I got selected for GSoC
 
 
== '''Coding challenge''' ==
 
 
 
* Install Apertium (see Minimal installation from SVN) (done)
 
* Go through the HOW TO (done)
 
* Go through the MT course here (not completed )
 
* Write a translator that translates as much of this story as possible — Minimum one sentence. (doing).
 
* Upload your work to Apertium SVN. (https://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-mal-eng under development)
 
 
== '''My non-Summer-of-Code plans for the Summer''' ==
 
 
I don't have any other plan during this period
 
 
[[Category:GSoC 2014 Student proposals|tachyons]]
 

Revision as of 12:09, 16 March 2014

Work plan

I was busy with some other works , so didn't got enough time for familiarising apertium, I will restructure the time line after having a completeidea about this platform

week plan
week 1 Study of morphology of Malayalam


week 2 Create a monolingual dictionary with 100 most frequently used words for english and malayalam , Start Bilingual dictionary
week 3 Improve bilingual dictionary , Add more words to mono lingual dictionary
week 4 Testing and debugging , Documentation
Deliverable # 1 Dictionary with clear documentation ,
week 5 Implementation of disambiguation rules for Malayalam.
week 6 Study transfer rules and start implementation


week 7 Transfer rules implementation
Deliverable # 2
week 8
week 9 Testvoc


week 10 Testvoc


week 11 Debugging transfer rules and CG.
week 12 Cleanup and dissemination



List your skills and give evidence of your qualifications. Tell us what is your current field of study, major, etc.

I am a computer science student ,and interested in foss . I can code in python,c,c++,c# ,java,php ,javascript and little objective c and Google Go. Malyalam is my mother tongue and English was my second language at school , I am familiar with both languages

Convince us that you can do the work. In particular we would like to know whether you have programmed before in open-source projects.

I am new to the open source world so currently my contributions are very less :-(, Check my github [1] profile for my existing open source projects . I am already familiar with apertium ecosystem , current status of my work is available here [2]

List any non-Summer-of-Code plans you have for the Summer, especially employment, if you are applying for internships, and class-taking. Be specific about schedules and time commitments.

I can spend ~30 hours per week for this task, I am not doing any internships right now .I am a freelance programmer But I will pause it if I got selected for GSoC