Apertium-pretransfer

From Apertium
Revision as of 07:32, 26 July 2010 by Unhammer (talk | contribs) (Created page with '<code>apertium-pretransfer</code> does certain operations to multiword units before bidix lookup. Compound multiwords (eg. a contraction in Romance languages, with …')
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

apertium-pretransfer does certain operations to multiword units before bidix lookup.

Compound multiwords (eg. a contraction in Romance languages, with &ltj/> in the monodix, or compound nominal in Germanic languages) are split into two at the + sign:

$ echo '^de<pr>+el<det><def><m><sg>$' | apertium-pretransfer 
^de<pr>$ ^el<det><def><m><sg>$
$ echo '^arbeidsmiljø<n><nt><sg><ind><ep-Ø>+lov<n><m><sg><def>$' | apertium-pretransfer 
^arbeidsmiljø<n><nt><sg><ind><ep-Ø>$ ^lov<n><m><sg><def>$

Multiwords with inner inflection (using the <g/> in monodix) get the uninflected part moved (from behind the tags) onto the lemma:

$ echo '^poner<vblex><inf># a prueba$' | apertium-pretransfer 
^poner# a prueba<vblex><inf>$