Difference between revisions of "Ideas for Google Summer of Code/Improvements in lexical-selection module"

Revision as of 21:09, 18 March 2013

Script/program for finding possibly missing bidix entries from an aligned parallel corpus.
Do proper processing of tags in all scripts.
Remove unused and redundant scripts.
Work on a way to trim non-significant features from the maximum-entropy models.
Rewrite the LRXProcessor::processME and LRXProcessor::process methods so that they share more code and are more modularised. Having a 650 line method is not something I'm proud of ;__;
Make sure that capitalisation, any tag and any character work as expected.
more here

@@ Line 9: / Line 9: @@
 * Work on a way to trim non-significant features from the maximum-entropy models.
 * Rewrite the <code>LRXProcessor::processME</code> and <code>LRXProcessor::process</code> methods so that they share more code and are more modularised. Having a 650 line method is not something I'm proud of ;__;
+* Make sure that capitalisation, any tag and any character work as expected.
 * ''more here''