Difference between revisions of "Ideas for Google Summer of Code/Improvements in lexical-selection module"
Jump to navigation
Jump to search
Line 4: | Line 4: | ||
==Tasks== |
==Tasks== |
||
+ | * Do proper processing of tags in all scripts. |
||
− | * Make the module process word by word, instead of sentence by sentence. |
||
+ | * Remove unused and redundant scripts. |
||
− | * Move away from using regular expressions as transitions, to using lemma/tag pairs. |
||
+ | * ''more here'' |
||
==Coding challenge== |
==Coding challenge== |
Revision as of 11:10, 12 February 2013
Implement a number of optimisations to the lexical selection module. The lexical selection module in Apertium is currently a prototype. There are many optimisations that could be made to make it more efficient and faster.
Tasks
- Do proper processing of tags in all scripts.
- Remove unused and redundant scripts.
- more here
Coding challenge
- Install Apertium and the constraint-based lexical selection module
- Run through the Generating lexical-selection rules from a parallel corpus HOWTO.