Difference between revisions of "User:Linh le/2014Application"
Jump to navigation
Jump to search
(Blanked the page) |
|||
(34 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
== Contact information<br>== |
|||
'''Name''': Linh Le<br> |
|||
'''Email address''': linh.ai.le@gmail.com<br> |
|||
'''Nick on IRC#apertium''': LinhLe<br> |
|||
'''Sourceforge username''':<br> |
|||
== Why is it that you are interested in learning about machine translation? == |
|||
Born as a Vietnamese in Vietnam, I have learning English since I was a kid, since it has always been deemed as an essential tool for people in my generation. However, I also understand that because learning a language not only takes time, effort and dedication but also a lot of money, and that not everyone has the opportunity and resources to do so. Therefore, in my opinion, machine translation is a useful alternative for people who wish to approach resources in other languages for their careers, entertainment or (life exp). |
|||
== Why are you interested in Apertium project? == |
|||
First of all, since Apertium is an open-source project, everyone can contribute and I think that this system is perfect for language translation. Since language is created by, developed by, affected by, and used by people, it requires human involvement and human understanding. *edit* |
|||
Second, despite having no immediate relations, there are many similarities in the grammar structures of the specific pair of language that I'm thinking about (Vietnamese<->English), and therefore I believe that the rule-based MT Apertium can provide a system () |
|||
== Which of the published tasks are you interested in? What do you plan to do? == |
|||
I'm interested in '''Adopting an unreleased pair''', specifically the pair Vietnamese - English. |
|||
== Proposal == |
|||
==== Adopting unreleased Vietnamese-English pair and bringing it to release quality ==== |
|||
====== Why Google and Apertium should sponsor it ====== |
|||
Although Vietnamese is spoken by around 90 million people in Vietnam, and the 7th most spoken language in the US and the 6th in Australia, there is not much () |
|||
====== Work plan ====== |
|||
Post-application period: |
|||
* Continue working on the coding challenge. |
|||
* Learn more about XML, CG |
|||
Bonding period: |
|||
* Continue working on the coding challenge (if it hasn't been completed). |
|||
* Get familiar: |
|||
** Be on IRC often and get used to the community |
|||
** Read through the already available files for vi-en translation. Read through other available documents |
|||
** Learn more about XML, CG |
|||
* Prepare rough corpora and/or get one that is already available. |
|||
* Learn more about both English and Vietnamese, focusing more on Vietnamese, since English has already available file support. |
|||
* Gather available online and offline dictionary resources |
|||
Week 1 (May 19-25)<br> |
|||
Make rough versions of two morphological dictionaries and the bilingual dictionary <br> |
|||
(Around 3000 stems in each)<br> |
|||
<br> |
|||
Week 2 (May 26- June 1)<br> |