Ideas for Google Summer of Code/Improvements in lexical-selection module
< Ideas for Google Summer of Code
Jump to navigation
Jump to search
Revision as of 16:48, 4 March 2012 by Francis Tyers (talk | contribs) (Created page with 'Implement a number of optimisations to the lexical selection module. The lexical selection module in Apertium is currently a prototype. There are many optimisations that could be…')
Implement a number of optimisations to the lexical selection module. The lexical selection module in Apertium is currently a prototype. There are many optimisations that could be made to make it more efficient and faster.
Tasks
- Make the module process word by word, instead of sentence by sentence.
- Move away from using regular expressions as transitions, to using lemma/tag pairs.
Coding challenge
- Install Apertium and the constraint-based lexical selection module
- Run through the Generating lexical-selection rules from a parallel corpus HOWTO.