Apertium on openSUSE

From Apertium
Jump to navigation Jump to search

Introduction

This note explains how to install the latest 3.0.4 svn version of Apertium on a stock openSUSE 10.2. The instructions below walk through the installation of the Unicode version of Apertium, since this will handle all of the world's languages. For legacy work, the non-Unicode version can also be installed using a similar process (see Section 9). Apertium is installed here into a directory in the user's homedir - in my case it was /home/kevin/downloads/linguistics, but it can be anything you choose. In this note, this choice will be referred to as [homedir].

Install required packages

If you choose the development pattern at or after your openSUSE install (using YaST, Smart, zypper or whatever), you will get most of the packages required for development, and may not have to add all of those listed below. Ensure that the following packages are present:

  • subversion
  • libxml2 and libxml2-devel
  • libxslt
  • libtool
  • pcre and pcre-devel
  • flex
  • automake
  • autoconf
  • gcc-c++

Download Apertium

Open a terminal, go to your [homedir] and checkout the latest version:

cd [homedir]
svn co http://apertium.svn.sourceforge.net/svnroot/apertium

This will take some time to fully download, and at the end you should have a directory [homedir]/apertium with 43 folders in it, some containing tools and some containing language-pair data.

Set up your install dirs

From [homedir]:

mkdir local
cd local/
mkdir legacy unicode

This creates [homedir]/local/legacy (for non-Unicode Apertium) and [homedir]/local/unicode (for Unicode Apertium). Only the latter of these is used here, but see Section 9 below.

Install lttoolbox

From [homedir]:

cd apertium/lttoolbox-unicode
./autogen.sh

At the end, you will get a message about PKG_CONFIG_PATH, so we will set it as part of the configure process.

PKG_CONFIG_PATH=[homedir]/local/unicode/lib/pkgconfig ./configure --prefix=[homedir]/local/unicode             

Obviously, replace [homedir] here and in subsequent instances by your real path.

make
make install

Your [homedir]/local/unicode dir should now contain four folders: bin, include, lib, and share.

Install apertium

From [homedir]:

cd apertium/apertium-unicode
./autogen.sh
PKG_CONFIG_PATH=[homedir]/local/unicode/lib/pkgconfig ./configure --prefix=[homedir]/local/unicode
make
make install

Your [homedir]/local/unicode/bin dir should now contain a number of apertium binaries.

Install language-pair data

In this instance we will install the cy-en (Welsh-English) pair. First, we need to ensure that the system can find the new Apertium binaries:

export PATH=/[homedir]/local/unicode/bin:$PATH

(This can be made permanent later, by adding the path to ~/.bash_profile, or whatever is your preferred method for doing this.)

From [homedir]:

cd apertium/apertium-cy-en
./autogen.sh
PKG_CONFIG_PATH=[homedir]/local/unicode/lib/pkgconfig ./configure --prefix=[homedir]/local/unicode
make
make install

Your [homedir]/local/unicode/share/apertium should now contain a new folder apertium-cy-en, and the modes folder should contain a file for each language direction (cy-en and en-cy).

Note that if you are using an AMD64 processor, you must use version 3.0.4 or above of Apertium. Versions below this have a bug which will mean the make fails.

Testing the new install

From [homedir]:

echo "test" | apertium en-cy

You should see prawf returned.

echo "prawf" | apertium cy-en

You should see test returned.