Apertium on openSUSE
Contents
Introduction
This note explains how to install the latest 3.0.4 svn version of Apertium on a stock openSUSE 10.2. The instructions below walk through the installation of the Unicode version of Apertium, since this will handle all of the world's languages. For legacy work, the non-Unicode version can also be installed using a similar process (see Section 9). Apertium is installed here into a directory in the user's homedir - in my case it was /home/kevin/downloads/linguistics, but it can be anything you choose. In this note, this choice will be referred to as [homedir].
Install required packages
If you choose the development pattern at or after your openSUSE install (using YaST, Smart, zypper or whatever), you will get most of the packages required for development, and may not have to add all of those listed below. Ensure that the following packages are present:
- subversion
- libxml2 and libxml2-devel
- libxslt
- libtool
- pcre and pcre-devel
- flex
- automake
- autoconf
- gcc-c++
Download Apertium
Open a terminal, go to your [homedir] and checkout the latest version:
cd [homedir] svn co http://apertium.svn.sourceforge.net/svnroot/apertium
This will take some time to fully download, and at the end you should have a directory [homedir]/apertium with 43 folders in it, some containing tools and some containing language-pair data.
Set up your install dirs
From [homedir]:
mkdir local cd local/ mkdir legacy unicode
This creates [homedir]/local/legacy (for non-Unicode Apertium) and [homedir]/local/unicode (for Unicode Apertium). Only the latter of these is used here, but see Section 9 below.
Install lttoolbox
From [homedir]:
cd apertium/lttoolbox-unicode ./autogen.sh
At the end, you will get a message about PKG_CONFIG_PATH, so we will set it as part of the configure process.
PKG_CONFIG_PATH=[homedir]/local/unicode/lib/pkgconfig ./configure --prefix=[homedir]/local/unicode
Obviously, replace [homedir] here and in subsequent instances by your real path.
make make install
Your [homedir]/local/unicode dir should now contain four folders: bin, include, lib, and share.
Install apertium
From [homedir]:
cd apertium/apertium-unicode ./autogen.sh PKG_CONFIG_PATH=[homedir]/local/unicode/lib/pkgconfig ./configure --prefix=[homedir]/local/unicode make make install
Your [homedir]/local/unicode/bin dir should now contain a number of apertium binaries.
Install language-pair data
In this instance we will install the cy-en (Welsh-English) pair. First, we need to ensure that the system can find the new Apertium binaries:
export PATH=/[homedir]/local/unicode/bin:$PATH
(This can be made permanent later, by adding the path to ~/.bash_profile, or whatever is your preferred method for doing this.)
From [homedir]:
cd apertium/apertium-cy-en ./autogen.sh PKG_CONFIG_PATH=[homedir]/local/unicode/lib/pkgconfig ./configure --prefix=[homedir]/local/unicode make make install
Your [homedir]/local/unicode/share/apertium should now contain a new folder apertium-cy-en, and the modes folder should contain a file for each language direction (cy-en and en-cy).
Note that if you are using an AMD64 processor, you must use version 3.0.4 or above of Apertium. Versions below this have a bug which will mean the make fails.
Testing the new install
From [homedir]:
echo "test" | apertium en-cy
You should see prawf returned.
echo "prawf" | apertium cy-en
You should see test returned.