Difference between revisions of "Generating lexical-selection rules from a parallel corpus"
Jump to navigation
Jump to search
Line 1: | Line 1: | ||
+ | {{TOCD}} |
||
If you have a parallel corpus, one of the things you can do is generate some lexical selection rules from it, to improve translation of words with more than one possible translation. |
If you have a parallel corpus, one of the things you can do is generate some lexical selection rules from it, to improve translation of words with more than one possible translation. |
||
Revision as of 11:44, 17 January 2012
If you have a parallel corpus, one of the things you can do is generate some lexical selection rules from it, to improve translation of words with more than one possible translation.
You will need
Here is a list of software that you will need installed:
- Giza++ (or some other word aligner)
- Moses (for making Giza++ less human hostile)
- All the Moses scripts
- lttoolbox
- Apertium
- apertium-lex-tools
Furthermore you'll need:
- an Apertium language pair
- a parallel corpus
Getting started
Given that you've got all the stuff installed, the work will be as follows: