User:Sl33k/Application

From Apertium
Revision as of 08:17, 6 April 2011 by Sl33k (talk | contribs)
Jump to navigation Jump to search

Name: Ahsan Bagwan

IRC: sl33k_

Email ahsanbagwan@gmail.com

Skype ahsanbagwan


Why is it you are interested in machine translation?

I am quite impressed by the work of machine translation tools and techniques on most of the Indo-European languages. Although, the work in this sub-field of computational linguistic on Indian subcontinental languages is relatively low, but there are recent trends that show us that this picture is likely to change[1]. As a student interested in natural language processing, MT gives me a great platform to work closely with linguistics and the interaction of natural languages and computer languages.


Why is it that they are interested in the Apertium project?

Apertium was pretty fascinating to me when I first came across it. Largely, because of its appeal as a great open source MT engine and also the converting the linguistic data by the tools in a fairly comprehensive way even at the first glance. Its couples it with some easiest to follow documentation. Having been looking to gain some detailed knowledge on its working, it walked me through some basic concepts and linked pages to extra resources on the wiki, which has provided me a good base to build upon.

Which of the published tasks are you interested in? What do you plan to do?

Apertium-ur-hi: Adopting the Urdu-Hindi language pair.

Why should Google and Apertium sponsor it? How and who will it benefit in society?

The ur-hi language pair has some initial work done to its Urdu and Hindi morphological analyser. Currently, there are no stable Indo-Aryan language pair in the repository. Making a stable language pair with release quality results would trigger the development with other related subcontinental language paired with the Hindi language.


Work Plan:

Week 1:

Week 2:

Week 3:

Week 4:

Deliverable #1:

Week 5:

Week 6:

Week 7:

Week 8:

Deliverable #2:

Week 9:

Week 10:

Week 11:

Week 12:

Deliverable #3:


Bio


I am an undergraduate student of Information Technology in the final year at Sinhagad Academy of Engineering (India). We had a course included finite state transducers, computer laboratory assignments on XML and C++.

Experience

Programming languages: Python, C, Java Limited experience with C++.

Markup languages: HTML, XML, reST


Notes


[1] http://wiki.apertium.org/wiki/Contributing_to_an_existing_pair

http://secure.wikimedia.org/wikipedia/en/wiki/Hindi-Urdu_grammar

Use of Machine Translation in India: Current Status http://www.mt-archive.info/MTS-2005-Naskar-2.pdf