Difference between revisions of "User:Jmcejuela/GSoC11Application"
Line 1: | Line 1: | ||
I am a Master Computer Science student at Technical University of Munich (TUM), currently in my fourth-last semester and about to start my Master Thesis. As I [https://sourceforge.net/mailarchive/message.php?msg_id=27324802 announced in the mailing list] my intention is to combine into the same endeavor both my thesis and the GSoC project. Having a solid background in transducers and their mathematical foundations, for my project I want to work extensively on transducers and this is my highest motivator. Coming from a more training/learning world, being Apertium rule-based, and also considering that my thesis should expand the work of the GSoC project to comply with a master thesis's higher effort/academic requirements (exactly 6 months at TUM), for my project I expand and elaborate further on an idea discussed with [[Jimregan]] on the use of transducers in replacement of flag diacritics, as used in hfst, to do the same work, and include a part for automatic topology learning to generate such transducers. Furthermore, I suggest my own idea, which involves mostly topology learning and weight training using one of the corpus you list in your corpora page, the [http://xixona.dlsi.ua.es/~fran/setimes/ Southeast European Times] for considering it particularly interesting due to its aligned structure for multiple languages. |
I am a Master Computer Science student at Technical University of Munich (TUM), currently in my fourth-last semester and about to start my Master Thesis. As I [https://sourceforge.net/mailarchive/message.php?msg_id=27324802 announced in the mailing list] my intention is to combine into the same endeavor both my thesis and the GSoC project. Having a solid background in transducers and their mathematical foundations, for my project I want to work extensively on transducers and this is my highest motivator. Coming from a more training/learning world, being Apertium rule-based, and also considering that my thesis should expand the work of the GSoC project to comply with a master thesis's higher effort/academic requirements (exactly 6 months at TUM), for my project I expand and elaborate further on an idea discussed with [User:[Jimregan]] on the use of transducers in replacement of flag diacritics, as used in hfst, to do the same work, and include a part for automatic topology learning to generate such transducers. Furthermore, I suggest my own idea, which involves mostly topology learning and weight training using one of the corpus you list in your corpora page, the [http://xixona.dlsi.ua.es/~fran/setimes/ Southeast European Times] for considering it particularly interesting due to its aligned structure for multiple languages. |
||
* Name: Juan Miguel Cejuela |
* Name: Juan Miguel Cejuela |
Revision as of 07:11, 8 April 2011
I am a Master Computer Science student at Technical University of Munich (TUM), currently in my fourth-last semester and about to start my Master Thesis. As I announced in the mailing list my intention is to combine into the same endeavor both my thesis and the GSoC project. Having a solid background in transducers and their mathematical foundations, for my project I want to work extensively on transducers and this is my highest motivator. Coming from a more training/learning world, being Apertium rule-based, and also considering that my thesis should expand the work of the GSoC project to comply with a master thesis's higher effort/academic requirements (exactly 6 months at TUM), for my project I expand and elaborate further on an idea discussed with [User:[Jimregan]] on the use of transducers in replacement of flag diacritics, as used in hfst, to do the same work, and include a part for automatic topology learning to generate such transducers. Furthermore, I suggest my own idea, which involves mostly topology learning and weight training using one of the corpus you list in your corpora page, the Southeast European Times for considering it particularly interesting due to its aligned structure for multiple languages.
- Name: Juan Miguel Cejuela
- Email: juanmi@jmcejuela.com
- Citizenship: Spanish, European Union
- irc, skype, twitter, ...: jmcejuela
Why is it you are interested in machine translation? Why is it that they are interested in the Apertium project? Which of the published tasks are you interested in? What do you plan to do? Include a proposal, including
* a title, * reasons why Google and Apertium should sponsor it, * a description of how and who it will benefit in society, * and a detailed work plan (including, if possible, a brief schedule with milestones and deliverables).
Include time needed to think, to program, to document and to disseminate. List your skills and give evidence of your qualifications. Tell us what is your current field of study, major, etc. Convince us that you can do the work. In particular we would like to know whether you have programmed before in open-source projects. List any non-Summer-of-Code plans you have for the Summer, especially employment, if you are applying for internships, and class-taking. Be specific about schedules and time commitments. we would like to be sure you have at least 30 free hours a week to develop for our project.