Difference between revisions of "User:Ote/proposal"
(overview) |
|||
(6 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
Open Translation Engine (OTE) Version 2.0 |
Open Translation Engine (OTE) Version 2.0 |
||
The goal of the OTE project is to solve two problems found today within the open source translation community: |
|||
Proposal for NLnet[http://www.nlnet.nl/news/2007/20071003-call-en.html] |
|||
== Crowdsourcing == |
|||
= Abstract = |
|||
How can a large and divergent group of users easily build and manage translation dictionaries online? |
|||
The OTE will provide a suite of web-based tools to bring people together in the various tasks needed to build a workable translation dictionary. |
|||
== Dictionary Unification == |
|||
While there are many Open Source projects in the field of Machine Translation, there is a lack of tools for the community creation of translation dictionaries. The goal of the Open Translation Engine (OTE)[http://ote.2meta.com] Version 2.0 is to create a robust tool for this space. |
|||
How can we unify the numerous open content translation dictionaries available online? The OTE will support the import and export of many formats currently used by active open content/open source translation projects. Special attention will be paid to Apertium and Linguaphile (machine translation projects) and Wiktionary and Omegawiki (Wiki-based dictionaries) |
|||
= Use Cases = |
|||
How will the OTE be used? |
|||
== By the Linguistic == |
|||
In the field of Machine Translation, I have chosen the Apertium project as the first external format to support. |
|||
Apertium is a robust project with many active particpants. But there is a lack of tools to allow multiple developers to create the translation dictionaries needed to run Apertium. |
|||
Currently Apertium dictionaries are modified by direct editing of multiple XML files. While these XML files are versioned with Subversion, this is not an ideal solution for true community involvement. |
|||
The OTE will be enhanced to first support tagging of Parts of Speech (Noun, Verb, Adjective, etc). This is an important first step in order to support the second step: Dictionary Export in the proper Apertium XML formats. |
|||
== By the Student == |
|||
School / Language Learning |
|||
== By the Developer == |
|||
A primary goal of this project is to create tools that are easily useable with other Open Source projects. |
|||
Bootstrapping |
|||
= License = |
|||
Currently the OTE is under the BSD License. This includes both the source code and the translation dictionaries. |
|||
During this project, I will re-evaluate which Open Source license is the best choice, with particular attention to possible differences in needs between the source code and translation dictionaries. |
|||
= Project = |
|||
== Core System == |
|||
Conversion to Unicode - The current prototype is not unicode aware. Unicode is an absolute necessity to continue the project with all possible word languages. |
|||
Genericization of code base - The OTE will be built to be as generic as possible, thus allowing for ease of future enhancements. Currently the prototype code is hard coded with the Dutch/English language pair. |
|||
Install Procedure - Installation will be made as user-friendy as possible. |
|||
Documentation |
|||
== User System == |
|||
User accounts |
|||
User Permissions |
|||
User administration |
|||
== Core Tools == |
|||
Word Viewer |
|||
Dictionary Viewer |
|||
Dictionary Export: Export of Apertium XML dictionaries. This will allow OTE to be used as a dictionary building and management tool for Apertium. |
|||
Dictionary Import |
|||
Word-2-Word Translator |
|||
Classroom Tools: Random Word, Flash Cards, Word Lists |
|||
== Community Tools == |
|||
Add / Delete / Modify: Individual Word |
|||
Add / Delete / Modify: Tagging: Parts of Speech for a Word |
|||
Add / Delete / Modify: Translation Word Pairs |
|||
Add / Delete / Modify: Languages |
|||
Versioning - all words, translation pairs, and tags are versioned, allowing reversion to previous states. |
|||
= Misc.... = |
|||
Comparison of OTE to current 'translation memory' systems, such as the many gettext/.po file editor / aggregators. |
Latest revision as of 19:40, 29 November 2007
Open Translation Engine (OTE) Version 2.0
The goal of the OTE project is to solve two problems found today within the open source translation community:
Crowdsourcing[edit]
How can a large and divergent group of users easily build and manage translation dictionaries online? The OTE will provide a suite of web-based tools to bring people together in the various tasks needed to build a workable translation dictionary.
Dictionary Unification[edit]
How can we unify the numerous open content translation dictionaries available online? The OTE will support the import and export of many formats currently used by active open content/open source translation projects. Special attention will be paid to Apertium and Linguaphile (machine translation projects) and Wiktionary and Omegawiki (Wiki-based dictionaries)