Difference between revisions of "User:Linh le/2014Application"

From Apertium
Jump to navigation Jump to search
(Created page with "Contact information<br> Name: Linh Le<br> Email address: linh.ai.le@gmail.com<br> Nick on IRC#apertium: LinhLe<br> Sourceforge username:<br>")
 
Line 1: Line 1:
Contact information<br>
+
== Contact information<br>==
Name: Linh Le<br>
+
'''Name''': Linh Le<br>
Email address: linh.ai.le@gmail.com<br>
+
'''Email address''': linh.ai.le@gmail.com<br>
Nick on IRC#apertium: LinhLe<br>
+
'''Nick on IRC#apertium''': LinhLe<br>
Sourceforge username:<br>
+
'''Sourceforge username''':<br>
  +
  +
== Why is it that you are interested in learning about machine translation? ==
  +
Born as a Vietnamese in Vietnam, I have learning English since I was a kid, since it has always been deemed as an essential tool for people in my generation. However, I also understand that because learning a language not only takes time, effort and dedication but also a lot of money, and that not everyone has the opportunity and resources to do so. Therefore, in my opinion, machine translation is a useful alternative for people who wish to approach resources in other languages for their careers, entertainment or (life exp).
  +
  +
== Why are you interested in Apertium project? ==
  +
First of all, since Apertium is an open-source project, everyone can contribute and I think that this system is perfect for language translation. Since language is created by, developed by, affected by, and used by people, it requires human involvement and human understanding. *edit*
  +
Second, despite having no immediate relations, there are many similarities in the grammar structures of the specific pair of language that I'm thinking about (Vietnamese<->English), and therefore I believe that the rule-based MT Apertium can provide a system ()
  +
  +
== Which of the published tasks are you interested in? What do you plan to do? ==
  +
I'm interested in '''Adopting an unreleased pair''', specifically the pair Vietnamese - English.
  +
  +
== Proposal ==
  +
==== Adopting unreleased Vietnamese-English pair and bringing it to release quality ====
  +
====== Why Google and Apertium should sponsor it ======
  +
Although Vietnamese is spoken by around 90 million people in Vietnam, and the 7th most spoken language in the US and the 6th in Australia, there is not much ()
  +
  +
====== Work plan ======
  +
Post-application period:
  +
* Continue working on the coding challenge.
  +
* Learn more about XML, CG
  +
  +
Bonding period:
  +
* Continue working on the coding challenge (if it hasn't been completed).
  +
* Get familiar:
  +
** Be on IRC often and get used to the community
  +
** Read through the already available files for vi-en translation. Read through other available documents
  +
** Learn more about XML, CG
  +
* Prepare rough corpora and/or get one that is already available.
  +
* Learn more about both English and Vietnamese, focusing more on Vietnamese, since English has already available file support.
  +
* Gather available online and offline dictionary resources
  +
  +
Week 1 (May 19-25)<br>
  +
Make rough versions of two morphological dictionaries and the bilingual dictionary <br>
  +
(Around 3000 stems in each)<br>
  +
<br>
  +
Week 2 (May 26- June 1)<br>

Revision as of 04:40, 21 March 2014

Contact information

Name: Linh Le
Email address: linh.ai.le@gmail.com
Nick on IRC#apertium: LinhLe
Sourceforge username:

Why is it that you are interested in learning about machine translation?

Born as a Vietnamese in Vietnam, I have learning English since I was a kid, since it has always been deemed as an essential tool for people in my generation. However, I also understand that because learning a language not only takes time, effort and dedication but also a lot of money, and that not everyone has the opportunity and resources to do so. Therefore, in my opinion, machine translation is a useful alternative for people who wish to approach resources in other languages for their careers, entertainment or (life exp).

Why are you interested in Apertium project?

First of all, since Apertium is an open-source project, everyone can contribute and I think that this system is perfect for language translation. Since language is created by, developed by, affected by, and used by people, it requires human involvement and human understanding. *edit* Second, despite having no immediate relations, there are many similarities in the grammar structures of the specific pair of language that I'm thinking about (Vietnamese<->English), and therefore I believe that the rule-based MT Apertium can provide a system ()

Which of the published tasks are you interested in? What do you plan to do?

I'm interested in Adopting an unreleased pair, specifically the pair Vietnamese - English.

Proposal

Adopting unreleased Vietnamese-English pair and bringing it to release quality

Why Google and Apertium should sponsor it

Although Vietnamese is spoken by around 90 million people in Vietnam, and the 7th most spoken language in the US and the 6th in Australia, there is not much ()

Work plan

Post-application period:

  • Continue working on the coding challenge.
  • Learn more about XML, CG

Bonding period:

  • Continue working on the coding challenge (if it hasn't been completed).
  • Get familiar:
    • Be on IRC often and get used to the community
    • Read through the already available files for vi-en translation. Read through other available documents
    • Learn more about XML, CG
  • Prepare rough corpora and/or get one that is already available.
  • Learn more about both English and Vietnamese, focusing more on Vietnamese, since English has already available file support.
  • Gather available online and offline dictionary resources

Week 1 (May 19-25)
Make rough versions of two morphological dictionaries and the bilingual dictionary
(Around 3000 stems in each)

Week 2 (May 26- June 1)