apertium-pretransfer (installed as part of the
apertium package) does certain operations to multiword units before bidix lookup. Input is expected to be disambiguated, and have no surface forms (just analyses).
$ echo '^de<pr>+el<det><def><m><sg>$' | apertium-pretransfer ^de<pr>$ ^el<det><def><m><sg>$ $ echo '^arbeidsmiljø<n><nt><sg><ind><ep-Ø>+lov<n><m><sg><def>$' | apertium-pretransfer ^arbeidsmiljø<n><nt><sg><ind><ep-Ø>$ ^lov<n><m><sg><def>$
- Note: There have been discussions to change the + symbol for compounds into ~, since we typically do not want a space there.
Multiwords with inner inflection (using the <g/> in monodix) get the uninflected part, the stuff after the # sign, moved (from behind the tags) onto the lemma:
$ echo '^poner<vblex><inf># a prueba$' | apertium-pretransfer ^poner# a prueba<vblex><inf>$
- Note: The moving of the multiword queue (the part after #) is also done by