Ideas for Google Summer of Code/Improvements in lexical-selection module
< Ideas for Google Summer of Code
Jump to navigation
Jump to search
Revision as of 11:07, 12 February 2013 by Francis Tyers (talk | contribs) (moved Ideas for Google Summer of Code/Performance optimisation of lexical-selection module to Ideas for Google Summer of Code/Improvements in lexical-selection module)
Implement a number of optimisations to the lexical selection module. The lexical selection module in Apertium is currently a prototype. There are many optimisations that could be made to make it more efficient and faster.
Tasks
- Make the module process word by word, instead of sentence by sentence.
- Move away from using regular expressions as transitions, to using lemma/tag pairs.
Coding challenge
- Install Apertium and the constraint-based lexical selection module
- Run through the Generating lexical-selection rules from a parallel corpus HOWTO.