User:Ote/proposal

From Apertium
< User:Ote
Revision as of 11:57, 16 November 2007 by Ote (talk | contribs)
Jump to navigation Jump to search

Open Translation Engine (OTE) Version 2.0

Proposal for NLnet[1]

Abstract

With this funding, I will produce a web-based system to allow a community of users to cooperatively build translation dictionaries.

While there are many Open Source projects in the field of Machine Translation, there is a lack of tools for the community creation of translation dictionaries. The goal of the Open Translation Engine (OTE)[2] Version 2.0 is to create a robust tool for this space.


Use Cases

OTE Version 2.0 will provide additional tools to allow the system to be used as a dictionary building and management tool for Apertium.

School / Language Learning

Bootstrapping

License

Currently the OTE is under the BSD License. This includes both the source code and the translation dictionaries.

During this project, I will re-evaluate which Open Source license is the best choice, with particular attention to possible differences in needs between the source code and translation dictionaries.


Project

Core System

Conversion to Unicode - The current prototype is not unicode aware. Unicode is an absolute necessity to continue the project with all possible word languages.

Genericization of code base - The OTE will be built to be as generic as possible, thus allowing for ease of future enhancements. Currently the prototype code is hard coded with the Dutch/English language pair.

Install Procedure - Installation will be made as user-friendy as possible.

Documentation

User System

User accounts

User Permissions

User administration

Core Tools

Word Viewer

Dictionary Viewer

Dictionary Export: Export of Apertium XML dictionaries. This will allow OTE to be used as a dictionary building and management tool for Apertium.

Dictionary Import

Word-2-Word Translator

Classroom Tools: Random Word, Flash Cards, Word Lists


Community Tools

Add / Delete / Modify: Individual Word

Add / Delete / Modify: Tagging: Parts of Speech for a Word

Add / Delete / Modify: Translation Word Pairs

Add / Delete / Modify: Languages

Versioning - all words, translation pairs, and tags are versioned, allowing reversion to previous states.