Difference between revisions of "Lexical selection"
Line 17: | Line 17: | ||
** [[Как начать работу с правилами по выбору лексики]] |
** [[Как начать работу с правилами по выбору лексики]] |
||
== The slr/srl approach ( |
== The slr/srl approach (2010-2012) == |
||
This uses a special [[Constraint Grammar]] (CG) file which runs _after_ regular morphological disambiguation, but _before_ bidix. |
|||
Could someone from sme-nob please explain? |
|||
The CG rules add a number to the lemma of the word if we want a non-default translation, so '''^ahte<CC>$''' might turn into '''^ahte:1<CC>$''' |
|||
== Transfer rule approach (2009) == |
== Transfer rule approach (2009) == |
Revision as of 09:31, 7 September 2012
Lexical selection is the task of choosing, given several source-language (SL) translations with the same part-of-speech (POS), the most adequate translation among them in the target language (TL). The task is related to the task of word-sense disambiguation. The difference is that its aim is to find the most adequate translation, not the most adequate sense. Thus, it is not necessary to choose between a series of fine-grained senses if all these senses result in the same final translation.
This page has some links to pages about lexical selection in Apertium.
General information:
Current lexical selection module (2012)
This is made by [Francis Tyers] an is deployed in XX-XX language pair where you can see an example.
- Rule-based lexical selection module
- Generating lexical-selection rules from a parallel corpus
- How to get started with lexical selection rules
The slr/srl approach (2010-2012)
This uses a special Constraint Grammar (CG) file which runs _after_ regular morphological disambiguation, but _before_ bidix.
The CG rules add a number to the lemma of the word if we want a non-default translation, so ^ahte<CC>$ might turn into ^ahte:1<CC>$
Transfer rule approach (2009)
You can make transfer rules that does lexical selection. Its not very elegant but it works, to a degree. The drawback is that you:
- get big transfer files
- mix transfer and lexical selection
- must write rules
This is the method used in most pairs.