Compiling dictionaries
Revision as of 18:26, 3 September 2011 by Bech (talk | contribs) (Category:Documentation in English)
This page gives some specific instructions for compiling dictionaries from various language pairs using different build procedures, this page is principally for people who are interested in using the dictionaries as analysers or generators, and not as part of a language pair.
Standard dix compilation
Assuming you want to compile the file apertium-bn-en.bn.dix
and save it as bn.analyser.bin
:
$ lt-comp lr apertium-bn-en.bn.dix bn.analyser.bin final@inconditional 8 75 main@standard 6403 13351
English—Catalan
The English dictionary in English—Catalan, along with the English dictionaries in some other pairs (e.g. English—Spanish and English—Galician) uses a metadix file. This needs to be preprocessed before it can be compiled with lt-comp
.
$ xsltproc buscaPar.xsl apertium-en-ca.en.metadix | uniq > tmp1gen.xsl $ xsltproc tmp1gen.xsl apertium-en-ca.en.metadix > apertium-en-ca.en.dixtmp1 $ rm tmp1gen.xsl $ apertium-validate-acx apertium-en-ca.en.acx $ apertium-validate-dictionary apertium-en-ca.en.dixtmp1 $ lt-comp lr apertium-en-ca.en.dixtmp1 apertium-en-ca.en.acx $ rm apertium-en-ca.en.dixtmp1
Breton—French
The French dictionary in Breton—French is a metadix file which needs to be preprocessed before it can be compiled with lt-comp
.
xsltproc buscaPar.xsl apertium-br-fr.fr.metadix | uniq > tmp1gen.xsl xsltproc tmp1gen.xsl apertium-br-fr.fr.metadix > apertium-br-fr.fr.dix rm tmp1gen.xsl apertium-validate-dictionary apertium-br-fr.fr.dix lt-comp rl apertium-br-fr.fr.dix br-fr.autogen.bin