Difference between revisions of "User:Ote/proposal"

From Apertium
Jump to navigation Jump to search
(overview)
 
(8 intermediate revisions by the same user not shown)
Line 1: Line 1:
Open Translation Engine (OTE) Version 2.0
Open Translation Engine (OTE) Version 2.0


The goal of the OTE project is to solve two problems found today within the open source translation community:
Proposal for NLnet[http://www.nlnet.nl/news/2007/20071003-call-en.html]


== Crowdsourcing ==
= Abstract =


With this funding, I will produce a web-based system to allow a community of users to cooperatively build translation dictionaries.
How can a large and divergent group of users easily build and manage translation dictionaries online?
The OTE will provide a suite of web-based tools to bring people together in the various tasks needed to build a workable translation dictionary.


== Dictionary Unification ==
While there are many Open Source projects in the field of Machine Translation, there is a lack of tools for the community creation of translation dictionaries. The goal of the Open Translation Engine (OTE)[http://ote.2meta.com] Version 2.0 is to create a robust tool for this space.


How can we unify the numerous open content translation dictionaries available online? The OTE will support the import and export of many formats currently used by active open content/open source translation projects. Special attention will be paid to Apertium and Linguaphile (machine translation projects) and Wiktionary and Omegawiki (Wiki-based dictionaries)

= Use Cases =

OTE Version 2.0 will provide additional tools to allow the system to be used as a dictionary building and management tool for Apertium.

School / Language Learning

Bootstrapping

= License =

Currently the OTE is under the BSD License. This includes both the source code and the translation dictionaries.

During this project, I will re-evaluate which Open Source license is the best choice, with particular attention to possible differences in needs between the source code and translation dictionaries.


= Project =

== Core System ==

Conversion to Unicode - The current prototype is not unicode aware. Unicode is an absolute necessity to continue the project with all possible word languages.

Genericization of code base - The OTE will be built to be as generic as possible, thus allowing for ease of future enhancements. Currently the prototype code is hard coded with the Dutch/English language pair.

Install Procedure - Installation will be made as user-friendy as possible.

Documentation

== User System ==

User accounts

User Permissions

User administration

== Core Tools ==

Word Viewer

Dictionary Viewer

Dictionary Export: Export of Apertium XML dictionaries. This will allow OTE to be used as a dictionary building and management tool for Apertium.

Dictionary Import

Word-2-Word Translator

Classroom Tools: Random Word, Flash Cards, Word Lists


== Community Tools ==

Add / Delete / Modify: Individual Word

Add / Delete / Modify: Tagging: Parts of Speech for a Word

Add / Delete / Modify: Translation Word Pairs

Add / Delete / Modify: Languages

Versioning - all words, translation pairs, and tags are versioned, allowing reversion to previous states.

Latest revision as of 19:40, 29 November 2007

Open Translation Engine (OTE) Version 2.0

The goal of the OTE project is to solve two problems found today within the open source translation community:

Crowdsourcing[edit]

How can a large and divergent group of users easily build and manage translation dictionaries online? The OTE will provide a suite of web-based tools to bring people together in the various tasks needed to build a workable translation dictionary.

Dictionary Unification[edit]

How can we unify the numerous open content translation dictionaries available online? The OTE will support the import and export of many formats currently used by active open content/open source translation projects. Special attention will be paid to Apertium and Linguaphile (machine translation projects) and Wiktionary and Omegawiki (Wiki-based dictionaries)