Difference between revisions of "Moses"
Jump to navigation
Jump to search
Line 94: | Line 94: | ||
* [[Using GIZA++]] |
* [[Using GIZA++]] |
||
==External links== |
|||
* [http://www.statmt.org/wmt08/baseline.html WMT08 Baseline system] |
|||
[[Category:Tools]] |
[[Category:Tools]] |
Revision as of 09:51, 30 January 2009
Requisites
- GIZA++ and mkcls http://giza-pp.googlecode.com/files/giza-pp-v1.0.2.tar.gz
- Moses (
svn co https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk
) - IRST LM (
svn co https://irstlm.svn.sourceforge.net/svnroot/irstlm
)
Compiling
- See also: Using GIZA++
- GIZA++
tar -xzvf giza-pp-v1.0.2.tar.gz cd giza-pp make cp mkcls-v2/mkcls /path/prefix/bin cp GIZA++-v2/GIZA++ /path/prefix/bin cp GIZA++-v2/plain2snt.out /path/prefix/bin cp GIZA++-v2/snt2cooc.out /path/prefix/bin cp GIZA++-v2/snt2plain.out /path/prefix/bin cp GIZA++-v2/trainGIZA++.sh /path/prefix/bin cd ..
- Moses
cd trunk ./regenerate-makefiles.sh ./configure --prefix=/path/prefix make make install cd scripts/training/symal make cp symal giza2bal.pl /path/prefix/bin cd ../../../ cd scripts/training/phrase-extract make cp extract score /path/prefix/bin cd ../../../
Now edit the file scripts/training/train-factored-phrase-model.perl
and change the following lines:
my $SCRIPTS_ROOTDIR = "/home/fran/source/moses/trunk/scripts/"; ... # the following line is set installation time by 'make release'. BEWARE! my $BINDIR="/path/prefix/bin";
cp scripts/training/train-factored-phrase-model.perl /path/prefix/bin/ cp scripts/training/symal/giza2bal.pl /path/prefix/bin/ cd ..
- IRSTLM
cd irstlm ./install
Now edit the files in scripts/build-sublm.pl
and scripts/merge-sublm.pl
and check the location of gzip,
my $gzip="/usr/bin/gzip"; my $gunzip="/usr/bin/gunzip";
On Debian systems, gzip
and gunzip
are found in /bin
, these two scripts will fail silently if gzip is not found.
cp bin/* /path/prefix/bin/ cp bin/x86_64-pc-linux-gnu/* /path/prefix/bin/ mkdir -p /path/prefix/include cp include/* /path/prefix/include cp lib/x86_64-pc-linux-gnu/libirstlm.a /path/prefix/lib/ cd ..
Building language model
export IRSTLM=/path/prefix build-lm.sh -i cy.crp.txt -o cy.lm.gz -t /tmp