Difference between revisions of "User:Ote/proposal"

From Apertium
Jump to navigation Jump to search
Line 5: Line 5:
 
= Abstract =
 
= Abstract =
   
I propose the enhancement of the Open Translation Engine[http://ote.2meta.com] project. With this funding, I will produce a 2.0 version that allows a community of users to work together in building translation dictionaries. The system will be built to be as generic as possible, thus allowing for ease of future enhancements.
+
With this funding, I will produce a web-based system to allow a community of users to cooperatively build translation dictionaries.
   
  +
While there are many Open Source projects in the field of Machine Translation, there is a lack of tools for the community creation of translation dictionaries. The goal of the Open Translation Engine (OTE)[http://ote.2meta.com] Version 2.0 is to create a robust tool for this space.
As a first real world use case, OTE Version 2.0 will provide additional tools to allow the system to be used as a dictionary building and management tool for Apertium.
 
  +
  +
  +
= Use Cases =
  +
 
OTE Version 2.0 will provide additional tools to allow the system to be used as a dictionary building and management tool for Apertium.
  +
  +
School / Language Learning
  +
  +
Bootstrapping
  +
  +
= License =
  +
  +
Currently the OTE is under the BSD License. This includes both the source code and the translation dictionaries.
  +
  +
During this project, I will re-evaluate which Open Source license is the best choice, with particular attention to possible differences in needs between the source code and translation dictionaries.
   
   
Line 14: Line 29:
 
== Core System ==
 
== Core System ==
   
  +
Conversion to Unicode - The current prototype is not unicode aware. Unicode is an absolute necessity to continue the project with all possible word languages.
Conversion to Unicode
 
   
  +
Genericization of code base - The OTE will be built to be as generic as possible, thus allowing for ease of future enhancements. Currently the prototype code is hard coded with the Dutch/English language pair.
Genericization of code base
 
   
Install Procedure
+
Install Procedure - Installation will be made as user-friendy as possible.
   
Documentation
+
Documentation
   
 
== User System ==
 
== User System ==
   
 
User accounts
 
User accounts
  +
  +
User Permissions
   
 
User administration
 
User administration
   
== Tools ==
+
== Core Tools ==
   
 
Word Viewer
 
Word Viewer
Line 47: Line 64:
 
Add / Delete / Modify: Individual Word
 
Add / Delete / Modify: Individual Word
   
Add / Delete / Modify: Parts of Speech for a Word
+
Add / Delete / Modify: Tagging: Parts of Speech for a Word
   
 
Add / Delete / Modify: Translation Word Pairs
 
Add / Delete / Modify: Translation Word Pairs
Line 53: Line 70:
 
Add / Delete / Modify: Languages
 
Add / Delete / Modify: Languages
   
Versioning - all words, translation pairs, and tags are versioned, allowing reversion to previous states
+
Versioning - all words, translation pairs, and tags are versioned, allowing reversion to previous states.

Revision as of 11:57, 16 November 2007

Open Translation Engine (OTE) Version 2.0

Proposal for NLnet[1]

Abstract

With this funding, I will produce a web-based system to allow a community of users to cooperatively build translation dictionaries.

While there are many Open Source projects in the field of Machine Translation, there is a lack of tools for the community creation of translation dictionaries. The goal of the Open Translation Engine (OTE)[2] Version 2.0 is to create a robust tool for this space.


Use Cases

OTE Version 2.0 will provide additional tools to allow the system to be used as a dictionary building and management tool for Apertium.

School / Language Learning

Bootstrapping

License

Currently the OTE is under the BSD License. This includes both the source code and the translation dictionaries.

During this project, I will re-evaluate which Open Source license is the best choice, with particular attention to possible differences in needs between the source code and translation dictionaries.


Project

Core System

Conversion to Unicode - The current prototype is not unicode aware. Unicode is an absolute necessity to continue the project with all possible word languages.

Genericization of code base - The OTE will be built to be as generic as possible, thus allowing for ease of future enhancements. Currently the prototype code is hard coded with the Dutch/English language pair.

Install Procedure - Installation will be made as user-friendy as possible.

Documentation

User System

User accounts

User Permissions

User administration

Core Tools

Word Viewer

Dictionary Viewer

Dictionary Export: Export of Apertium XML dictionaries. This will allow OTE to be used as a dictionary building and management tool for Apertium.

Dictionary Import

Word-2-Word Translator

Classroom Tools: Random Word, Flash Cards, Word Lists


Community Tools

Add / Delete / Modify: Individual Word

Add / Delete / Modify: Tagging: Parts of Speech for a Word

Add / Delete / Modify: Translation Word Pairs

Add / Delete / Modify: Languages

Versioning - all words, translation pairs, and tags are versioned, allowing reversion to previous states.