English and Kazakh

From Apertium
Jump to navigation Jump to search

Starting work on Apertium English to Kazakh

These notes are basically for Anel, Aizhan and Assem who have started to develop this language pair...

Installing what is needed

Operating System

Install a suitable GNU/Linux system such as Debian, Ubuntu, Mint...

Install build essentials, etc.

Open a terminal window and type

sudo apt-get install subversion build-essential g++ pkg-config gawk libxml2 \
libxml2-dev libxml2-utils xsltproc flex automake autoconf libtool libpcre3-dev \
cmake libicu-dev libboost-dev libgoogle-perftools-dev bison

enter your password and Wait till the packages are downloaded and installed.

If you don't already have a directory for sources, make one in your home directory and enter it:

cd ~
mkdir Source
cd Source

Install HFST

This language pair uses the Helsinki Finite State Toolkit for Kazakh generation, so we need to install it, and its dependencies.

Install OpenFST

Main article: OpenFST
wget http://www.openfst.org/twiki/pub/FST/FstDownload/openfst-1.3.1.tar.gz &&
tar -xzvf openfst-1.3.1.tar.gz &&
cd openfst-1.3.1/ &&
./configure &&
make &&
sudo make install

Install Foma

Main article: Foma
svn checkout http://foma.googlecode.com/svn/trunk/foma/ foma 
cd foma
make
sudo make install

Install HFST

Main article: HFST
svn co https://hfst.svn.sourceforge.net/svnroot/hfst/trunk/hfst3
cd hfst3/
autoreconf -i
./configure --enable-lexc --with-foma
make
make install
sudo ldconfig

Install VISLCG3

Main article: Apertium and Constraint Grammar
svn co http://beta.visl.sdu.dk/svn/visl/tools/vislcg3/trunk vislcg3
cd vislcg3
./cmake.sh 
make -j3
make install

Download apertium, lttoolbox and eng-kaz data from SVN

Main article: Minimal installation from SVN
cd ~/Source
svn co http://apertium.svn.sourceforge.net/svnroot/apertium/trunk/lttoolbox
svn co http://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium
svn co http://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-lex-tools
svn co http://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-tools
svn co http://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-kaz
svn co http://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-eng-kaz

Compile and install lttoolbox

cd lttoolbox/
PKG_CONFIG_PATH=/usr/local/lib/pkgconfig ./autogen.sh
make
sudo make install
sudo ldconfig

Compile and install apertium

cd ..
cd apertium/
PKG_CONFIG_PATH=/usr/local/lib/pkgconfig ./autogen.sh
make
sudo make install
sudo ldconfig

Compile and install apertium-lex-tools

cd ..
cd apertium-lex-tools
PKG_CONFIG_PATH=/usr/local/lib/pkgconfig ./autogen.sh
make
sudo make install
sudo ldconfig

Install English--Kazakh language pair data from incubator

cd ..
cd apertium-eng-kaz/
PKG_CONFIG_PATH=/usr/local/lib/pkgconfig ./autogen.sh
make

Troubleshooting

If you get:

lt-comp: error while loading shared libraries: liblttoolbox3-3.2.so.0: cannot open shared object file: No such file or directory

Then you should do:

sudo ldconfig

Browse SVN

http://apertium.svn.sf.net/viewvc/apertium/incubator/apertium-eng-kaz/

IRC

Open up XChat (normally "Programs -> Internet -> XChat IRC") and type:

/server irc.freenode.net
/join #apertium
/join #hfst