User:Ragib06

From Apertium
Revision as of 19:30, 6 April 2011 by Ragib06 (talk | contribs)
Jump to navigation Jump to search

I'm Ragib Ahsan from Bangladesh. I'm currently an undergrad student of Computer Science and Engineering Department in Bangladesh University of Engineering and Technology.

I'm willing to participate in Google Summer of Code 2011 with apertium. And I'm interested in adopting the new Bengali-English language pair.



Apertium Bengali-English

Currently the morphological analyzer is nearly complete with 68% coverage of wiki. The bilingual dictionary needs a lot of entries and finally the transfer system has only a few rules to work with.

Some example outputs are -

I eat rice -> আমি ধান খাই
I love you -> আমি আপনাকে ভালবাসি

You can find a list of tests here

My project goal should be as follows:

  • Completing the monolingual dictionary for Bengali upto a wide coverage (at least 80%) of wiki.
  • Completing the bilingual dictionary with necessary entries.
  • Writing the transfer rules, that will be a challenging part as the two languages are not closely related.
  • Finally, performing tetsvocing to ensure release quality


Preparing Myself

I've downloaded and installed the "apertium-bn-en" pack from the apertium incubator. And I'm really excited playing around with it in my system. I've gone through the Apertium New Language Pair HOWTO already. I tried to have a look at the Apertium Official Documentation. It seems really complex. I'm discussing various issues with the prospective mentors Francis Tyers and Abu Zaher. With their help and some exploring on the apertium-bn-en project I've finally prepared my proposal for This years GSoC. You can find it here.

I also found the paper on Bengali Morphological Analyzer[1] quite interesting. And last but not the list I'm trying to solve some of the challenges given on this project. Check here.