Difference between revisions of "Crossdics"
Jump to navigation
Jump to search
m (→Compiling) |
m |
||
Line 6: | Line 6: | ||
$ svn co https://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-crossdics |
$ svn co https://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-crossdics |
||
</pre> |
</pre> |
||
== Software prerequisites == |
|||
You will need to install [http://ant.apache.org/ Ant] and [http://java.sun.com/javase/downloads/index.jsp Java Development Kit 6 (JDK6)] |
|||
$ sudo apt-get install ant sun-java6-jdk |
|||
== Compiling == |
== Compiling == |
Revision as of 18:50, 8 February 2008
- Main article: Building dictionaries
Contents
Download
$ svn co https://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-crossdics
Software prerequisites
You will need to install Ant and Java Development Kit 6 (JDK6)
$ sudo apt-get install ant sun-java6-jdk
Compiling
$ cd apertium-crossdics $ ant jar
Installing
$ sudo ant install
Using apertium-crossdics
$ apertium-crossdics
Crossing dictionaries
First of all, copy linguistic data into folder "dics"
- Bilingual dictionary A-B:
apertium-bb-aa.bb-aa.dix
- Bilingual dictionary B-C:
apertium-bb-cc.bb-cc.dix
- Morphological dictionary A:
apertium-bb-aa.aa.dix
- Morphological dictionary C:
apertium-bb-cc.cc.dix
Please note that:
- all dictionaries must be in the form:
apertium-xx-yy.xx-yy.dix
(bilingual dictionaries)apertium-xx-yy.xx.dix
(morphological dictionaries)
- the common language (B) must be in the left side, that is, dictionaries in the form B-A and B-C
- use "-r" instead of "-n" if the dictionary has to be reversed (apertium-aa-bb.aa-bb.dix to apertium-bb-aa.bb-aa.dix)
Use the dictools script to cross the dictionaries:
$ apertium-crossdics monA.dix -n bilAB.dix -n bilBC-dix monC.dix
An example for crossing es-ca and es-pt to get the ca-pt pair.
$ apertium-crossdics dics/apertium-es-ca.ca.dix -n dics/apertium-es-ca.es-ca.dix -n dics/apertium-es-pt.es-pt.dix dics/apertium-es-pt.pt.dix
Customizing cross actions
By default, the crossdics tool uses a simple cross model defining very simple rules for crossing two sets of dictionaries. However, more specific cross actions might be needed in order to cross certain language pairs correctly. Defining a new cross schema with concrete pattern-action elements solves this problem.