Difference between revisions of "User:Esteth"
(Updated proposal) |
|||
(One intermediate revision by the same user not shown) | |||
Line 3: | Line 3: | ||
Application so far: |
Application so far: |
||
'''Apertium-deslatex: Additional format filters for Translation''' |
|||
⚫ | |||
Adam James Copp |
Adam James Copp |
||
30/03, 2010 |
|||
⚫ | |||
Adam James Copp |
|||
'''E-Mail Address:''' |
|||
E-mail address: |
|||
adam.copp@gmail.com |
adam.copp@gmail.com |
||
Other information that may be useful to contact you: |
'''Other information that may be useful to contact you:''' |
||
⚫ | |||
Mobile Phone: +447799-283-061 |
|||
IRC: Esteth |
|||
⚫ | |||
⚫ | |||
I often find that when attempting to find information online, I have trouble finding a resource that I can understand. Whether I find scientific journals that are too advanced for the topic, or blogs in a foreign language, I inevitably have trouble being able to understand what other people have written. I’d like to get involved with machine translation because it’s about making resources accessible to more people. (more clarification here?) |
|||
⚫ | |||
Of all the projects I was interested in, Apertium had the most immediately helpful, knowledgable community on-hand to help me with my questions. The #Apertium channel on freenode was especially helpful, and I feel at home with the community, rather than it being very much a clique of core developers, and then all these people like me who’re just a hastle. |
|||
⚫ | |||
Title''' |
|||
Apertium-deslatex: New format filters for translation |
|||
⚫ | |||
⚫ | |||
It's incredibly frustrating to me when I'm looking for information online and what I want isn't available, or it's available, but in a form I can't understand. I can only imagine what this is like for those who cannot read english fluently, since much of the web, particularly around the topic of computer programming is written in english. |
|||
Apertium can currently translate a variety of language pairs in a variety of document formats. However, it cannot curerently translate LaTeX files, since it does not know what parts of these documents are translatable. LaTeX is widely used throughout the scientific community as a means of typesetting journal articles and scientific papers. |
|||
⚫ | |||
Of all the projects I was interested in, Apertium had the most helpful, immediately inviting community. I feel that working with the people involved would be pleasant, rather than frustrating. With Apertium, I feel that I'd be part of a community, rather than feeling like an outsider. |
|||
⚫ | |||
⚫ | |||
I'd like to work on format filters. I plan to add new filters for LaTeX and MediaWiki. Hopefully also ReStructuredText files. |
|||
The worldwide scientific community, including researchers, lecturers and students, all consume documents marked up with LaTeX. The availability of an open-source application for translating documents will allow all of these groups to access a wider variety of papers, journals and articles. |
|||
Proposal: |
|||
Work Plan |
|||
Developing additional format filters for Apertium |
|||
Miscellaneous Tasks |
|||
Week plan |
|||
'''List your skills and give evidence of your qualifications''' |
|||
For Apertium to successfully translate documents, it needs to know which parts of the document are translatable text, and which parts of the document are just formatting text (HTML tags, for instance), or else it would try to translate parts of the document which are meant for machine consumption rather than human consumption, and probably render the document unopenable. I propose to develop two new format filters. One for MediaWiki and one for LaTeX. If time permits, I would also like to develop a format filter for ReStructuredText. |
|||
I am confident in my ability to write programs in imperative languages, given my introduction to javascript around 8 years ago, and progressing through visual basic, ruby, java, C# and python. I do not profess to be an expert in any of these languages, but I know enough to write serious programs in them. I have studied java at university for the past 2 years, and ruby in much of my spare time before that. I won the scottish student’s Herriot Watt programming contest in my final year of high school, and my university grades profess that I’m willing to put effort into my endeavors. |
|||
This project deserves sponsorship because it will further the core goal Apertium. That is, to bring existing content to more people. Machine translation for documents in LaTeX and mediaWiki format would allow for journals, articles and wikis to be automatically translated to any language Apertium supports. This will broaden access to a whole host of content that would otherwise be too difficult to translate. |
|||
'''List any non-Summer-of-Code plans you have for the summer''' |
|||
Include a proposal, including |
|||
* a title, |
|||
⚫ | |||
⚫ | |||
* and a detailed work plan (including, if possible, a brief schedule with milestones and deliverables).Include time needed to think, to program, to document and to disseminate. List your skills and give evidence of your qualifications. Tell us what is your current field of study, major, etc. Convince us that you can do the work. In particular we would like to know whether you have programmed before in open-source projects. |
|||
I have no plans for the summer, other than a short vacation, to last approximately one week, though with dates undecided. If I desperately need the time, I can cancel my vacation, though I should be able to put in more time in other weeks to compensate for my absence. |
|||
List any non-Summer-of-Code plans you have for the Summer, especially employment, if you are applying for internships, and class-taking. Be specific about schedules and time commitments. we would like to be sure you have at least 30 free hours a week to develop for our project: |
|||
I have no plans for the summer other than a week's holiday. I should be able to spend 30 free hours a week easily, aside from the week on holiday. If nessecary, it would be possible to compensate for this week by doing a few extra hours in other weeks. |
Latest revision as of 00:16, 2 April 2010
Hi there. I'm Adam Copp, a Computer Science undergraduate fromt the university of St. Andrews, in scotland. You'll find me on the IRC channel under the name Esteth.
Application so far:
Apertium-deslatex: Additional format filters for Translation
Adam James Copp
30/03, 2010
Name:
Adam James Copp
E-Mail Address:
adam.copp@gmail.com
Other information that may be useful to contact you:
Mobile Phone: +447799-283-061
IRC: Esteth
Windows Live Messenger: adam.copp@gmail.com
Why is it you are interested in machine translation?
I often find that when attempting to find information online, I have trouble finding a resource that I can understand. Whether I find scientific journals that are too advanced for the topic, or blogs in a foreign language, I inevitably have trouble being able to understand what other people have written. I’d like to get involved with machine translation because it’s about making resources accessible to more people. (more clarification here?)
Why are you interested in the Apertium project? Of all the projects I was interested in, Apertium had the most immediately helpful, knowledgable community on-hand to help me with my questions. The #Apertium channel on freenode was especially helpful, and I feel at home with the community, rather than it being very much a clique of core developers, and then all these people like me who’re just a hastle.
Which of the published tasks are you interested in? What do you plan to do?
Title
Apertium-deslatex: New format filters for translation
Why Google and Apertium should sponser it
Apertium can currently translate a variety of language pairs in a variety of document formats. However, it cannot curerently translate LaTeX files, since it does not know what parts of these documents are translatable. LaTeX is widely used throughout the scientific community as a means of typesetting journal articles and scientific papers.
How and who will it benefit in society
The worldwide scientific community, including researchers, lecturers and students, all consume documents marked up with LaTeX. The availability of an open-source application for translating documents will allow all of these groups to access a wider variety of papers, journals and articles.
Work Plan Miscellaneous Tasks Week plan
List your skills and give evidence of your qualifications
I am confident in my ability to write programs in imperative languages, given my introduction to javascript around 8 years ago, and progressing through visual basic, ruby, java, C# and python. I do not profess to be an expert in any of these languages, but I know enough to write serious programs in them. I have studied java at university for the past 2 years, and ruby in much of my spare time before that. I won the scottish student’s Herriot Watt programming contest in my final year of high school, and my university grades profess that I’m willing to put effort into my endeavors.
List any non-Summer-of-Code plans you have for the summer
I have no plans for the summer, other than a short vacation, to last approximately one week, though with dates undecided. If I desperately need the time, I can cancel my vacation, though I should be able to put in more time in other weeks to compensate for my absence.