English and Kazakh
Starting work on Apertium English to Kazakh
These notes are basically for Anel, Aizhan and Assem who have started to develop this language pair...
Installing what is needed
Operating System
Install a suitable GNU/Linux system such as Debian, Ubuntu, Mint...
Install build essentials, etc.
Open a terminal window and type
sudo apt-get install subversion build-essential g++ pkg-config gawk libxml2 \ libxml2-dev libxml2-utils xsltproc flex automake autoconf libtool libpcre3-dev \ cmake libicu-dev libboost-dev libgoogle-perftools-dev bison libreadline-dev
enter your password and Wait till the packages are downloaded and installed.
If you don't already have a directory for sources, make one in your home directory and enter it:
cd ~ mkdir Source cd Source
Install HFST
This language pair uses the Helsinki Finite State Toolkit for Kazakh generation, so we need to install it, and its dependencies.
Install OpenFST
- Main article: OpenFST
wget http://www.openfst.org/twiki/pub/FST/FstDownload/openfst-1.3.1.tar.gz && tar -xzvf openfst-1.3.1.tar.gz && cd openfst-1.3.1/ && ./configure && make && sudo make install cd ..
Install Foma
- Main article: Foma
svn checkout http://foma.googlecode.com/svn/trunk/foma/ foma cd foma make sudo make install cd ..
Install HFST
- Main article: HFST
svn co https://hfst.svn.sourceforge.net/svnroot/hfst/trunk/hfst3 cd hfst3/ autoreconf -i ./configure --enable-lexc --with-foma make sudo make install sudo ldconfig cd ..
Install VISLCG3
- Main article: Apertium and Constraint Grammar
svn co http://beta.visl.sdu.dk/svn/visl/tools/vislcg3/trunk vislcg3 cd vislcg3 ./cmake.sh make -j3 sudo make install cd ..
Download apertium, lttoolbox and eng-kaz data from SVN
- Main article: Minimal installation from SVN
cd ~/Source svn co http://apertium.svn.sourceforge.net/svnroot/apertium/trunk/lttoolbox svn co http://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium svn co http://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-lex-tools svn co http://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-tools svn co http://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-kaz svn co http://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-eng-kaz
Compile and install lttoolbox
cd lttoolbox/ PKG_CONFIG_PATH=/usr/local/lib/pkgconfig ./autogen.sh make sudo make install sudo ldconfig
Compile and install apertium
cd .. cd apertium/ PKG_CONFIG_PATH=/usr/local/lib/pkgconfig ./autogen.sh make sudo make install sudo ldconfig
Compile and install apertium-lex-tools
cd .. cd apertium-lex-tools PKG_CONFIG_PATH=/usr/local/lib/pkgconfig ./autogen.sh make sudo make install sudo ldconfig
Install English--Kazakh language pair data from incubator
cd .. cd apertium-eng-kaz/ PKG_CONFIG_PATH=/usr/local/lib/pkgconfig ./autogen.sh make
Troubleshooting
If you get:
lt-comp: error while loading shared libraries: liblttoolbox3-3.2.so.0: cannot open shared object file: No such file or directory
Then you should do:
sudo ldconfig
Browse SVN
http://apertium.svn.sf.net/viewvc/apertium/incubator/apertium-eng-kaz/
IRC
Open up XChat (normally "Programs -> Internet -> XChat IRC") and type:
/server irc.freenode.net /join #apertium /join #hfst