Difference between revisions of "User:Ote/proposal"

From Apertium
Jump to navigation Jump to search
(overview)
 
(8 intermediate revisions by the same user not shown)
Line 1: Line 1:
 
Open Translation Engine (OTE) Version 2.0
 
Open Translation Engine (OTE) Version 2.0
   
  +
The goal of the OTE project is to solve two problems found today within the open source translation community:
Proposal for NLnet[http://www.nlnet.nl/news/2007/20071003-call-en.html]
 
   
  +
== Crowdsourcing ==
= Abstract =
 
   
With this funding, I will produce a web-based system to allow a community of users to cooperatively build translation dictionaries.
+
How can a large and divergent group of users easily build and manage translation dictionaries online?
  +
The OTE will provide a suite of web-based tools to bring people together in the various tasks needed to build a workable translation dictionary.
   
  +
== Dictionary Unification ==
While there are many Open Source projects in the field of Machine Translation, there is a lack of tools for the community creation of translation dictionaries. The goal of the Open Translation Engine (OTE)[http://ote.2meta.com] Version 2.0 is to create a robust tool for this space.
 
   
  +
How can we unify the numerous open content translation dictionaries available online? The OTE will support the import and export of many formats currently used by active open content/open source translation projects. Special attention will be paid to Apertium and Linguaphile (machine translation projects) and Wiktionary and Omegawiki (Wiki-based dictionaries)
 
= Use Cases =
 
 
OTE Version 2.0 will provide additional tools to allow the system to be used as a dictionary building and management tool for Apertium.
 
 
School / Language Learning
 
 
Bootstrapping
 
 
= License =
 
 
Currently the OTE is under the BSD License. This includes both the source code and the translation dictionaries.
 
 
During this project, I will re-evaluate which Open Source license is the best choice, with particular attention to possible differences in needs between the source code and translation dictionaries.
 
 
 
= Project =
 
 
== Core System ==
 
 
Conversion to Unicode - The current prototype is not unicode aware. Unicode is an absolute necessity to continue the project with all possible word languages.
 
 
Genericization of code base - The OTE will be built to be as generic as possible, thus allowing for ease of future enhancements. Currently the prototype code is hard coded with the Dutch/English language pair.
 
 
Install Procedure - Installation will be made as user-friendy as possible.
 
 
Documentation
 
 
== User System ==
 
 
User accounts
 
 
User Permissions
 
 
User administration
 
 
== Core Tools ==
 
 
Word Viewer
 
 
Dictionary Viewer
 
 
Dictionary Export: Export of Apertium XML dictionaries. This will allow OTE to be used as a dictionary building and management tool for Apertium.
 
 
Dictionary Import
 
 
Word-2-Word Translator
 
 
Classroom Tools: Random Word, Flash Cards, Word Lists
 
 
 
== Community Tools ==
 
 
Add / Delete / Modify: Individual Word
 
 
Add / Delete / Modify: Tagging: Parts of Speech for a Word
 
 
Add / Delete / Modify: Translation Word Pairs
 
 
Add / Delete / Modify: Languages
 
 
Versioning - all words, translation pairs, and tags are versioned, allowing reversion to previous states.
 

Latest revision as of 19:40, 29 November 2007

Open Translation Engine (OTE) Version 2.0

The goal of the OTE project is to solve two problems found today within the open source translation community:

Crowdsourcing[edit]

How can a large and divergent group of users easily build and manage translation dictionaries online? The OTE will provide a suite of web-based tools to bring people together in the various tasks needed to build a workable translation dictionary.

Dictionary Unification[edit]

How can we unify the numerous open content translation dictionaries available online? The OTE will support the import and export of many formats currently used by active open content/open source translation projects. Special attention will be paid to Apertium and Linguaphile (machine translation projects) and Wiktionary and Omegawiki (Wiki-based dictionaries)