Apertium has moved from SourceForge to GitHub.
If you have any questions, please come and talk to us on #apertium on irc.freenode.net or contact the GitHub migration team.

Difference between revisions of "Compiling dictionaries"

From Apertium
Jump to navigation Jump to search
Line 22: Line 22:
 
$ lt-comp lr apertium-en-ca.en.dixtmp1 apertium-en-ca.en.acx
 
$ lt-comp lr apertium-en-ca.en.dixtmp1 apertium-en-ca.en.acx
 
$ rm apertium-en-ca.en.dixtmp1
 
$ rm apertium-en-ca.en.dixtmp1
  +
  +
</pre>
  +
  +
==Breton&mdash;French==
  +
  +
The French dictionary in Breton&mdash;French is a [[metadix]] file which needs to be preprocessed before it can be compiled with <code>lt-comp</code>.
  +
  +
<pre>
  +
xsltproc buscaPar.xsl apertium-br-fr.fr.metadix | uniq > tmp1gen.xsl
  +
xsltproc tmp1gen.xsl apertium-br-fr.fr.metadix > apertium-br-fr.fr.dix
  +
rm tmp1gen.xsl
  +
apertium-validate-dictionary apertium-br-fr.fr.dix
  +
lt-comp rl apertium-br-fr.fr.dix br-fr.autogen.bin
   
 
</pre>
 
</pre>

Revision as of 06:22, 23 February 2011

This page gives some specific instructions for compiling dictionaries from various language pairs using different build procedures, this page is principally for people who are interested in using the dictionaries as analysers or generators, and not as part of a language pair.

Standard dix compilation

Assuming you want to compile the file apertium-bn-en.bn.dix and save it as bn.analyser.bin:

$ lt-comp lr apertium-bn-en.bn.dix bn.analyser.bin
final@inconditional 8 75
main@standard 6403 13351

English—Catalan

The English dictionary in English—Catalan, along with the English dictionaries in some other pairs (e.g. English—Spanish and English—Galician) uses a metadix file. This needs to be preprocessed before it can be compiled with lt-comp.


$ xsltproc buscaPar.xsl apertium-en-ca.en.metadix | uniq > tmp1gen.xsl
$ xsltproc tmp1gen.xsl apertium-en-ca.en.metadix > apertium-en-ca.en.dixtmp1
$ rm tmp1gen.xsl
$ apertium-validate-acx apertium-en-ca.en.acx
$ apertium-validate-dictionary apertium-en-ca.en.dixtmp1
$ lt-comp lr apertium-en-ca.en.dixtmp1 apertium-en-ca.en.acx
$ rm apertium-en-ca.en.dixtmp1

Breton—French

The French dictionary in Breton—French is a metadix file which needs to be preprocessed before it can be compiled with lt-comp.

xsltproc buscaPar.xsl apertium-br-fr.fr.metadix | uniq > tmp1gen.xsl
xsltproc tmp1gen.xsl apertium-br-fr.fr.metadix > apertium-br-fr.fr.dix
rm tmp1gen.xsl
apertium-validate-dictionary apertium-br-fr.fr.dix
lt-comp rl apertium-br-fr.fr.dix br-fr.autogen.bin

Portuguese—Spanish

Occitan—Catalan

French—Spanish

See also