Difference between revisions of "User:Ote/proposal"
(overview) |
|||
(6 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
Open Translation Engine (OTE) Version 2.0 |
Open Translation Engine (OTE) Version 2.0 |
||
+ | The goal of the OTE project is to solve two problems found today within the open source translation community: |
||
− | Proposal for NLnet[http://www.nlnet.nl/news/2007/20071003-call-en.html] |
||
+ | == Crowdsourcing == |
||
− | = Abstract = |
||
− | + | How can a large and divergent group of users easily build and manage translation dictionaries online? |
|
+ | The OTE will provide a suite of web-based tools to bring people together in the various tasks needed to build a workable translation dictionary. |
||
+ | == Dictionary Unification == |
||
− | While there are many Open Source projects in the field of Machine Translation, there is a lack of tools for the community creation of translation dictionaries. The goal of the Open Translation Engine (OTE)[http://ote.2meta.com] Version 2.0 is to create a robust tool for this space. |
||
+ | How can we unify the numerous open content translation dictionaries available online? The OTE will support the import and export of many formats currently used by active open content/open source translation projects. Special attention will be paid to Apertium and Linguaphile (machine translation projects) and Wiktionary and Omegawiki (Wiki-based dictionaries) |
||
− | |||
− | = Use Cases = |
||
− | |||
− | How will the OTE be used? |
||
− | |||
− | == By the Linguistic == |
||
− | |||
− | In the field of Machine Translation, I have chosen the Apertium project as the first external format to support. |
||
− | |||
− | Apertium is a robust project with many active particpants. But there is a lack of tools to allow multiple developers to create the translation dictionaries needed to run Apertium. |
||
− | |||
− | Currently Apertium dictionaries are modified by direct editing of multiple XML files. While these XML files are versioned with Subversion, this is not an ideal solution for true community involvement. |
||
− | |||
− | The OTE will be enhanced to first support tagging of Parts of Speech (Noun, Verb, Adjective, etc). This is an important first step in order to support the second step: Dictionary Export in the proper Apertium XML formats. |
||
− | |||
− | |||
− | == By the Student == |
||
− | |||
− | School / Language Learning |
||
− | |||
− | == By the Developer == |
||
− | |||
− | A primary goal of this project is to create tools that are easily useable with other Open Source projects. |
||
− | |||
− | Bootstrapping |
||
− | |||
− | = License = |
||
− | |||
− | Currently the OTE is under the BSD License. This includes both the source code and the translation dictionaries. |
||
− | |||
− | During this project, I will re-evaluate which Open Source license is the best choice, with particular attention to possible differences in needs between the source code and translation dictionaries. |
||
− | |||
− | |||
− | = Project = |
||
− | |||
− | == Core System == |
||
− | |||
− | Conversion to Unicode - The current prototype is not unicode aware. Unicode is an absolute necessity to continue the project with all possible word languages. |
||
− | |||
− | Genericization of code base - The OTE will be built to be as generic as possible, thus allowing for ease of future enhancements. Currently the prototype code is hard coded with the Dutch/English language pair. |
||
− | |||
− | Install Procedure - Installation will be made as user-friendy as possible. |
||
− | |||
− | Documentation |
||
− | |||
− | == User System == |
||
− | |||
− | User accounts |
||
− | |||
− | User Permissions |
||
− | |||
− | User administration |
||
− | |||
− | == Core Tools == |
||
− | |||
− | Word Viewer |
||
− | |||
− | Dictionary Viewer |
||
− | |||
− | Dictionary Export: Export of Apertium XML dictionaries. This will allow OTE to be used as a dictionary building and management tool for Apertium. |
||
− | |||
− | Dictionary Import |
||
− | |||
− | Word-2-Word Translator |
||
− | |||
− | Classroom Tools: Random Word, Flash Cards, Word Lists |
||
− | |||
− | |||
− | == Community Tools == |
||
− | |||
− | Add / Delete / Modify: Individual Word |
||
− | |||
− | Add / Delete / Modify: Tagging: Parts of Speech for a Word |
||
− | |||
− | Add / Delete / Modify: Translation Word Pairs |
||
− | |||
− | Add / Delete / Modify: Languages |
||
− | |||
− | Versioning - all words, translation pairs, and tags are versioned, allowing reversion to previous states. |
||
− | |||
− | |||
− | = Misc.... = |
||
− | |||
− | Comparison of OTE to current 'translation memory' systems, such as the many gettext/.po file editor / aggregators. |
Latest revision as of 19:40, 29 November 2007
Open Translation Engine (OTE) Version 2.0
The goal of the OTE project is to solve two problems found today within the open source translation community:
Crowdsourcing[edit]
How can a large and divergent group of users easily build and manage translation dictionaries online? The OTE will provide a suite of web-based tools to bring people together in the various tasks needed to build a workable translation dictionary.
Dictionary Unification[edit]
How can we unify the numerous open content translation dictionaries available online? The OTE will support the import and export of many formats currently used by active open content/open source translation projects. Special attention will be paid to Apertium and Linguaphile (machine translation projects) and Wiktionary and Omegawiki (Wiki-based dictionaries)