Difference between revisions of "Compiling dictionaries"

From Apertium
Jump to navigation Jump to search
Line 1: Line 1:
{{TOCD}}
{{TOCD}}
This page gives some specific instructions for compiling dictionaries from various language pairs using different build procedures, this page is principally for people who are interested in using the dictionaries as analysers or generators, and not as
This page gives some specific instructions for compiling dictionaries from various language pairs using different build procedures, this page is principally for people who are interested in using the dictionaries as analysers or generators, and not as part of a language pair.


==English—Catalan==
==English—Catalan==
Line 18: Line 18:
</pre>
</pre>


==Portuguese--Spanish==
==Portuguese&mdash;Spanish==


==Occitan--Catalan==
==Occitan&mdash;Catalan==





Revision as of 09:36, 25 November 2009

This page gives some specific instructions for compiling dictionaries from various language pairs using different build procedures, this page is principally for people who are interested in using the dictionaries as analysers or generators, and not as part of a language pair.

English—Catalan

The English dictionary in English—Catalan, along with the English dictionaries in some other pairs (e.g. English—Spanish and English—Galician) uses a metadix file. This needs to be preprocessed before it can be compiled with lt-comp.


$ xsltproc buscaPar.xsl apertium-en-ca.en.metadix | uniq > tmp1gen.xsl
$ xsltproc tmp1gen.xsl apertium-en-ca.en.metadix > apertium-en-ca.en.dixtmp1
$ rm tmp1gen.xsl
$ apertium-validate-acx apertium-en-ca.en.acx
$ apertium-validate-dictionary apertium-en-ca.en.dixtmp1
$ lt-comp lr apertium-en-ca.en.dixtmp1 apertium-en-ca.en.acx
$ rm apertium-en-ca.en.dixtmp1

Portuguese—Spanish

Occitan—Catalan

See also