Difference between revisions of "Matxin"
Jump to navigation
Jump to search
(83 intermediate revisions by 6 users not shown) | |||
Line 1: | Line 1: | ||
{{TOCD}} |
{{TOCD}} |
||
'''Matxin''' is a free software machine translation engine related to [[Apertium]]. It allows for deeper transfer than can be found in Apertium. |
'''Matxin''' is a free software machine translation engine related to [[Apertium]]. It allows for deeper transfer than can be found in Apertium. |
||
This page describes how to install the system, see [[Matxin#Documentation]] below for how to create or maintain language pairs. |
|||
==Prerequisites== |
|||
==Installation== |
|||
* libcfg+ — http://platon.sk/upload/_projects/00003/libcfg+-0.6.2.tar.gz |
|||
* BerkleyDB — sudo apt-get install libdb4.6++-dev |
|||
* libomlet — https://lafarga.cpl.upc.edu/frs/download.php/130/libomlet-0.97.tar.gz |
|||
* libfries — https://lafarga.cpl.upc.edu/frs/download.php/129/libfries-0.95.tar.gz |
|||
* FreeLing (version 1.5) — https://lafarga.cpl.upc.edu/frs/download.php/90/FreeLing-1.5.tar.gz |
|||
:If you're installing into a prefix, you'll need to set two environment variables: CPPFLAGS=-I<prefix>/include LDFLAGS=-L<prefix>/lib ./configure --prefix=<prefix> |
|||
* [[lttoolbox]] (version 2.0 -- yes, this is ancient) |
|||
==Building== |
|||
;Checkout |
|||
<pre> |
<pre> |
||
$ |
$ git clone https://github.com/matxin/matxin.git |
||
$ cd matxin/ |
|||
$ ./autogen.sh |
|||
$ make |
|||
# make install |
|||
</pre> |
</pre> |
||
==Language pairs== |
|||
First comment out the deformatters in the <code>src/Makefile.am</code> as they don't build properly. |
|||
* [[matxin-spa-eus]] |
|||
<pre> |
|||
* [[matxin-eng-eus]] |
|||
bin_PROGRAMS = Analyzer LT ST_intra ST_inter ST_prep ST_verb SG_inter SG_intra \ |
|||
MG |
|||
==Troubleshooting== |
|||
#reFormat txt-deformat html-deformat rtf-deformat |
|||
;Can't find AP_MKINCLUDE |
|||
# |
|||
#reFormat_SOURCES = reFormat.C data_manager.C XML_reader.C simpleregex.C |
|||
#reFormat_LDADD = -lpcre -lxml2 |
|||
# |
|||
#txt_deformat_SOURCES = txt-deformat.C |
|||
#html_deformat_SOURCES = html-deformat.C |
|||
#rtf_deformat_SOURCES = rtf-deformat.C |
|||
# |
|||
#txt-deformat.C: txt-format.xml Makefile.am deformat.xsl |
|||
# $(XSLTPROC) --stringparam mode matxin deformat.xsl txt-format.xml | $(FLEX) -Cfer -t >$@ |
|||
# |
|||
#html-deformat.C: html-format.xml Makefile.am deformat.xsl |
|||
# $(XSLTPROC) --stringparam mode matxin deformat.xsl html-format.xml | $(FLEX) -Cfer -t >$@ |
|||
# |
|||
#rtf-deformat.C: rtf-format.xml Makefile.am deformat.xsl |
|||
# $(XSLTPROC) --stringparam mode matxin deformat.xsl rtf-format.xml | $(FLEX) -Cfer -t >$@ |
|||
# |
|||
set your <code>ACLOCAL_PATH</code> to include the path to <code>matxin.m4</code> |
|||
</pre> |
|||
==Documentation== |
|||
Then in the file <code>configure.ac</code>, change |
|||
* [http://matxin.svn.sourceforge.net/viewvc/matxin/trunk/doc/documentation-es.pdf Descripción del sistema de traducción es-eu Matxin] (in Spanish) |
|||
<pre> |
|||
* [[Documentation of Matxin]] (in English) |
|||
AC_CHECK_LIB(lttoolbox-2.0, main, [], [AC_MSG_ERROR([library 'lttoolbox' is required for Matxin])]) |
|||
* [[Matxin New Language Pair HOWTO]] |
|||
</pre> |
|||
==Contact== |
|||
for |
|||
Questions and comments about Matxin can be sent to their mailing list [https://lists.sourceforge.net/lists/listinfo/matxin-devel matxin-devel], or to the [https://lists.sourceforge.net/lists/listinfo/apertium-stuff apertium-stuff] list. |
|||
<pre> |
|||
PKG_CHECK_MODULES(MATXIN, [lttoolbox-2.0 >= 2.0.0],,exit) |
|||
==External links== |
|||
CFLAGS="$CFLAGS $MATXIN_CFLAGS" |
|||
CPPFLAGS="$CPPFLAGS $MATXIN_CPPFLAGS" |
|||
LIBS="$LIBS $MATXIN_LIBS" |
|||
</pre> |
|||
*[http://ixa.si.ehu.es/Ixa IXA Research Group] |
|||
Then you need to comment out the check for gcc version in <code>configure.ac</code> |
|||
<pre> |
|||
# |
|||
# GCC version check |
|||
# |
|||
#AC_MSG_CHECKING(for GCC version) |
|||
#gccver=`$CXX -dumpversion` |
|||
#AC_MSG_RESULT($gccver); |
|||
#case "$gccver" in |
|||
# 4.3*) |
|||
# AC_MSG_ERROR([GCC version must be <= 4.2]) |
|||
#esac |
|||
# |
|||
</pre> |
|||
Then do: |
|||
<pre> |
|||
$ aclocal; automake -a; autoconf |
|||
$ LTTOOLBOX_DIR=/home/fran/svnroot/local/unstable/ LDFLAGS="-L/home/fran/local/lib -L/usr/lib" |
|||
CPPFLAGS="-I/home/fran/local/include" ./configure --prefix=/home/fran/local/ |
|||
</pre> |
|||
You will probably need to fix a few things in the file: <code>src/Makefile</code> to get it to build, for example add: |
|||
<pre> |
|||
DEFAULT_INCLUDES = -I. -I$(srcdir) -I/usr/include/libxml2 |
|||
CPPFLAGS = -I<prefix>/include/lttoolbox-2.0 -I<prefix>/include |
|||
</pre> |
|||
==Executing== |
|||
<pre> |
|||
$ export MATXIN_DIR=<prefix> |
|||
$ echo "Esto es una prueba" | \ |
|||
./Analyzer -f $MATXIN_DIR/share/matxin/config.cfg | \ |
|||
./LT -f $MATXIN_DIR/share/matxin/config.cfg | \ |
|||
./ST_inter --inter 1 -f $MATXIN_DIR/share/matxin/config.cfg | \ |
|||
./ST_prep -f $MATXIN_DIR/share/matxin/config.cfg | \ |
|||
./ST_inter --inter 2 -f $MATXIN_DIR/share/matxin/config.cfg | \ |
|||
./ST_verb -f $MATXIN_DIR/share/matxin/config.cfg | \ |
|||
./ST_inter --inter 3 -f $MATXIN_DIR/share/matxin/config.cfg | \ |
|||
./SG_inter -f $MATXIN_DIR/share/matxin/config.cfg | \ |
|||
./SG_intra -f $MATXIN_DIR/share/matxin/config.cfg | \ |
|||
./MG -f $MATXIN_DIR/share/matxin/config.cfg | \ |
|||
./reFormat |
|||
Hau prueba bat da |
|||
</pre> |
|||
==Speed== |
|||
Between 25--30 words per second. |
|||
==Documentation== |
|||
* [http://matxin.svn.sourceforge.net/viewvc/matxin/trunk/doc/documentation-es.pdf Descripción del sistema de traducción es-eu Matxin] (in Spanish) |
|||
[[Category: |
[[Category:Matxin|*]] |
Latest revision as of 20:29, 7 May 2016
Matxin is a free software machine translation engine related to Apertium. It allows for deeper transfer than can be found in Apertium.
This page describes how to install the system, see Matxin#Documentation below for how to create or maintain language pairs.
Installation[edit]
$ git clone https://github.com/matxin/matxin.git $ cd matxin/ $ ./autogen.sh $ make # make install
Language pairs[edit]
Troubleshooting[edit]
- Can't find AP_MKINCLUDE
set your ACLOCAL_PATH
to include the path to matxin.m4
Documentation[edit]
- Descripción del sistema de traducción es-eu Matxin (in Spanish)
- Documentation of Matxin (in English)
- Matxin New Language Pair HOWTO
Contact[edit]
Questions and comments about Matxin can be sent to their mailing list matxin-devel, or to the apertium-stuff list.