Difference between revisions of "Crossdics"

From Apertium
Jump to navigation Jump to search
m
Line 6: Line 6:
 
$ svn co https://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-crossdics
 
$ svn co https://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-crossdics
 
</pre>
 
</pre>
  +
  +
== Software prerequisites ==
  +
  +
You will need to install [http://ant.apache.org/ Ant] and [http://java.sun.com/javase/downloads/index.jsp Java Development Kit 6 (JDK6)]
  +
  +
$ sudo apt-get install ant sun-java6-jdk
   
 
== Compiling ==
 
== Compiling ==

Revision as of 18:50, 8 February 2008

Main article: Building dictionaries

Download

$ svn co https://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-crossdics

Software prerequisites

You will need to install Ant and Java Development Kit 6 (JDK6)

$ sudo apt-get install ant sun-java6-jdk

Compiling

$ cd apertium-crossdics
$ ant jar

Installing

$ sudo ant install

Using apertium-crossdics

$ apertium-crossdics

Crossing dictionaries

First of all, copy linguistic data into folder "dics"

  • Bilingual dictionary A-B: apertium-bb-aa.bb-aa.dix
  • Bilingual dictionary B-C: apertium-bb-cc.bb-cc.dix
  • Morphological dictionary A: apertium-bb-aa.aa.dix
  • Morphological dictionary C: apertium-bb-cc.cc.dix


Please note that:

  • all dictionaries must be in the form:
    • apertium-xx-yy.xx-yy.dix (bilingual dictionaries)
    • apertium-xx-yy.xx.dix (morphological dictionaries)
  • the common language (B) must be in the left side, that is, dictionaries in the form B-A and B-C
  • use "-r" instead of "-n" if the dictionary has to be reversed (apertium-aa-bb.aa-bb.dix to apertium-bb-aa.bb-aa.dix)


Use the dictools script to cross the dictionaries:

$ apertium-crossdics monA.dix -n bilAB.dix -n bilBC-dix monC.dix

An example for crossing es-ca and es-pt to get the ca-pt pair.

$ apertium-crossdics dics/apertium-es-ca.ca.dix -n dics/apertium-es-ca.es-ca.dix -n dics/apertium-es-pt.es-pt.dix dics/apertium-es-pt.pt.dix

Customizing cross actions

By default, the crossdics tool uses a simple cross model defining very simple rules for crossing two sets of dictionaries. However, more specific cross actions might be needed in order to cross certain language pairs correctly. Defining a new cross schema with concrete pattern-action elements solves this problem.

See also