Difference between revisions of "Multiwords"
Jump to navigation
Jump to search
Line 33: | Line 33: | ||
::This would leave us with: ^zračna luka<n><f><gen><pl>$ (basically an orthographic paradigm). |
::This would leave us with: ^zračna luka<n><f><gen><pl>$ (basically an orthographic paradigm). |
||
* Have more than one entry per multi-word — this is done in <code>apertium-es-ca</code>, see "dirección general", "direcciones generales". (bad) |
* Have more than one entry per multi-word — this is done in <code>apertium-es-ca</code>, see "dirección general", "direcciones generales". (bad) |
||
* Have a parameterised paradigm, that when called one way outputs a paradigm with symbols, and another way outputs a paradigm without symbols. |
Revision as of 12:29, 21 November 2007
Its possible to have pretty complex multiword combinations.
<e lm="zračna luka"> <i>zračn</i> <par n="zračn/a__adj"/> <p> <l><b/>luk</l> <r><g><b/>luk</g></r> </p> <par n="stolic/a__n"/> </e>
$ echo "zračna luka" | lt-proc sh-mk.automorf.bin ^zračna luka/zračna<adj><f><sg><nom># luka<n><f><gen><pl>/zračna<adj><f><sg><nom># luka<n><f><nom><sg>$ $ echo "zračna luka" | lt-proc sh-mk.automorf.bin | apertium-tagger -g sh-mk.prob ^zračna<adj><f><sg><nom># luka<n><f><gen><pl>$ $ echo "zračna luka" | lt-proc sh-mk.automorf.bin | apertium-tagger -g sh-mk.prob | apertium-pretransfer ^zračna# luka<adj><f><sg><nom><n><f><gen><pl>$
Problems:
- How to resolve
^zračna# luka<adj><f><sg><nom><n><f><gen><pl>$
in the bidix?
Solutions:
- Have two paradigms for each adjective, one with tags, one without. (bad)
- This would leave us with: ^zračna luka<n><f><gen><pl>$ (basically an orthographic paradigm).
- Have more than one entry per multi-word — this is done in
apertium-es-ca
, see "dirección general", "direcciones generales". (bad) - Have a parameterised paradigm, that when called one way outputs a paradigm with symbols, and another way outputs a paradigm without symbols.