Difference between revisions of "Ideas for Google Summer of Code/Improvements in lexical-selection module"

From Apertium
Jump to navigation Jump to search
 
Line 8: Line 8:
 
* Remove unused and redundant scripts.
 
* Remove unused and redundant scripts.
 
* Work on a way to trim non-significant features from the maximum-entropy models.
 
* Work on a way to trim non-significant features from the maximum-entropy models.
* Rewrite the <code>LRXProcessor::processME</code> and <code>LRXProcessor::process</code> methods so that they share more code and are more modularised. Having a 650 line method is not something I'm proud of ;__;
+
* Rewrite the <code>LRXProcessor::processME</code> and <code>LRXProcessor::process</code> methods so that they share more code and are more modularised. Having a 650 line method is not something I ([[User:Francis Tyers|Francis Tyers]]) am proud of ;__;
 
* Make sure that capitalisation, any tag and any character work as expected.
 
* Make sure that capitalisation, any tag and any character work as expected.
 
* ''more here''
 
* ''more here''

Latest revision as of 18:06, 22 March 2013

Implement a number of optimisations to the lexical selection module. The lexical selection module in Apertium is currently a prototype. There are many optimisations that could be made to make it more efficient and faster, and easier to install and use.

Tasks[edit]

  • Script/program for finding possibly missing bidix entries from an aligned parallel corpus.
  • Do proper processing of tags in all scripts.
  • Remove unused and redundant scripts.
  • Work on a way to trim non-significant features from the maximum-entropy models.
  • Rewrite the LRXProcessor::processME and LRXProcessor::process methods so that they share more code and are more modularised. Having a 650 line method is not something I (Francis Tyers) am proud of ;__;
  • Make sure that capitalisation, any tag and any character work as expected.
  • more here

Coding challenge[edit]

Frequently asked questions[edit]

  • none yet, ask us something! :)

See also[edit]