The SVN url has been changed by SourceForge. In your repository root, or in an individual module, run the command:
svn switch --relocate https://apertium.svn.sourceforge.net/svnroot/apertium/ https://svn.code.sf.net/p/apertium/svn/

Ideas for Google Summer of Code/Monolingual and bilingual data decoupling

From Apertium

Jump to: navigation, search

Contents

Develop a method (scripts) to allow monolingual and bilingual data in Apertium to be decoupled, leaving each language pair with only the necessary bilingual data.

At the moment, Apertium has a separate module for each language pair. Each pair is self-contained, with a copy of both the monolingual data (e.g. POS tagger probabilities and monolingual dictionaries) and bilingual data (e.g. transfer rules and dictionaries). The method should be tested with es-ca, es-pt and pt-ca. After decoupling, all pairs should pass testvoc.

Tasks

  • Edit the lt-comp compiler to add a mode for compiling analysers that checks a bilingual dictionary to see if the current prefix is shared.
Optional
  • Edit each of the stages of Apertium that come after the POS tagger to accept input with original language surface form.

Coding challenge

Frequently asked questions

Previous GSOC projects

See also

Personal tools