Difference between revisions of "Matxin"
Jump to navigation
Jump to search
(83 intermediate revisions by 6 users not shown) | |||
Line 1: | Line 1: | ||
{{TOCD}} |
{{TOCD}} |
||
− | '''Matxin''' is a free software machine translation engine related to [[Apertium]]. It allows for deeper transfer than can be found in Apertium. |
+ | '''Matxin''' is a free software machine translation engine related to [[Apertium]]. It allows for deeper transfer than can be found in Apertium. |
+ | This page describes how to install the system, see [[Matxin#Documentation]] below for how to create or maintain language pairs. |
||
− | ==Prerequisites== |
||
+ | ==Installation== |
||
− | * libcfg+ — http://platon.sk/upload/_projects/00003/libcfg+-0.6.2.tar.gz |
||
− | * BerkleyDB — sudo apt-get install libdb4.6++-dev |
||
− | * libomlet — https://lafarga.cpl.upc.edu/frs/download.php/130/libomlet-0.97.tar.gz |
||
− | * libfries — https://lafarga.cpl.upc.edu/frs/download.php/129/libfries-0.95.tar.gz |
||
− | * FreeLing (version 1.5) — https://lafarga.cpl.upc.edu/frs/download.php/90/FreeLing-1.5.tar.gz |
||
− | :If you're installing into a prefix, you'll need to set two environment variables: CPPFLAGS=-I<prefix>/include LDFLAGS=-L<prefix>/lib ./configure --prefix=<prefix> |
||
− | * [[lttoolbox]] (version 2.0 -- yes, this is ancient) |
||
− | |||
− | |||
− | |||
− | ==Building== |
||
− | |||
− | ;Checkout |
||
<pre> |
<pre> |
||
− | $ |
+ | $ git clone https://github.com/matxin/matxin.git |
+ | $ cd matxin/ |
||
+ | $ ./autogen.sh |
||
+ | $ make |
||
+ | # make install |
||
</pre> |
</pre> |
||
+ | ==Language pairs== |
||
− | First comment out the deformatters in the <code>src/Makefile.am</code> as they don't build properly. |
||
+ | * [[matxin-spa-eus]] |
||
− | <pre> |
||
+ | * [[matxin-eng-eus]] |
||
− | bin_PROGRAMS = Analyzer LT ST_intra ST_inter ST_prep ST_verb SG_inter SG_intra \ |
||
− | MG |
||
+ | ==Troubleshooting== |
||
− | #reFormat txt-deformat html-deformat rtf-deformat |
||
+ | ;Can't find AP_MKINCLUDE |
||
− | # |
||
− | #reFormat_SOURCES = reFormat.C data_manager.C XML_reader.C simpleregex.C |
||
− | #reFormat_LDADD = -lpcre -lxml2 |
||
− | # |
||
− | #txt_deformat_SOURCES = txt-deformat.C |
||
− | #html_deformat_SOURCES = html-deformat.C |
||
− | #rtf_deformat_SOURCES = rtf-deformat.C |
||
− | # |
||
− | #txt-deformat.C: txt-format.xml Makefile.am deformat.xsl |
||
− | # $(XSLTPROC) --stringparam mode matxin deformat.xsl txt-format.xml | $(FLEX) -Cfer -t >$@ |
||
− | # |
||
− | #html-deformat.C: html-format.xml Makefile.am deformat.xsl |
||
− | # $(XSLTPROC) --stringparam mode matxin deformat.xsl html-format.xml | $(FLEX) -Cfer -t >$@ |
||
− | # |
||
− | #rtf-deformat.C: rtf-format.xml Makefile.am deformat.xsl |
||
− | # $(XSLTPROC) --stringparam mode matxin deformat.xsl rtf-format.xml | $(FLEX) -Cfer -t >$@ |
||
− | # |
||
+ | set your <code>ACLOCAL_PATH</code> to include the path to <code>matxin.m4</code> |
||
− | </pre> |
||
+ | ==Documentation== |
||
− | Then in the file <code>configure.ac</code>, change |
||
+ | * [http://matxin.svn.sourceforge.net/viewvc/matxin/trunk/doc/documentation-es.pdf Descripción del sistema de traducción es-eu Matxin] (in Spanish) |
||
− | <pre> |
||
+ | * [[Documentation of Matxin]] (in English) |
||
− | AC_CHECK_LIB(lttoolbox-2.0, main, [], [AC_MSG_ERROR([library 'lttoolbox' is required for Matxin])]) |
||
+ | * [[Matxin New Language Pair HOWTO]] |
||
− | </pre> |
||
+ | ==Contact== |
||
− | for |
||
+ | Questions and comments about Matxin can be sent to their mailing list [https://lists.sourceforge.net/lists/listinfo/matxin-devel matxin-devel], or to the [https://lists.sourceforge.net/lists/listinfo/apertium-stuff apertium-stuff] list. |
||
− | <pre> |
||
− | PKG_CHECK_MODULES(MATXIN, [lttoolbox-2.0 >= 2.0.0],,exit) |
||
+ | ==External links== |
||
− | CFLAGS="$CFLAGS $MATXIN_CFLAGS" |
||
− | CPPFLAGS="$CPPFLAGS $MATXIN_CPPFLAGS" |
||
− | LIBS="$LIBS $MATXIN_LIBS" |
||
− | </pre> |
||
+ | *[http://ixa.si.ehu.es/Ixa IXA Research Group] |
||
− | Then you need to comment out the check for gcc version in <code>configure.ac</code> |
||
− | |||
− | <pre> |
||
− | # |
||
− | # GCC version check |
||
− | # |
||
− | #AC_MSG_CHECKING(for GCC version) |
||
− | #gccver=`$CXX -dumpversion` |
||
− | #AC_MSG_RESULT($gccver); |
||
− | #case "$gccver" in |
||
− | # 4.3*) |
||
− | # AC_MSG_ERROR([GCC version must be <= 4.2]) |
||
− | #esac |
||
− | # |
||
− | </pre> |
||
− | |||
− | Then do: |
||
− | |||
− | <pre> |
||
− | $ aclocal; automake -a; autoconf |
||
− | $ LTTOOLBOX_DIR=/home/fran/svnroot/local/unstable/ LDFLAGS="-L/home/fran/local/lib -L/usr/lib" |
||
− | CPPFLAGS="-I/home/fran/local/include" ./configure --prefix=/home/fran/local/ |
||
− | </pre> |
||
− | |||
− | You will probably need to fix a few things in the file: <code>src/Makefile</code> to get it to build, for example add: |
||
− | |||
− | <pre> |
||
− | DEFAULT_INCLUDES = -I. -I$(srcdir) -I/usr/include/libxml2 |
||
− | CPPFLAGS = -I<prefix>/include/lttoolbox-2.0 -I<prefix>/include |
||
− | </pre> |
||
− | |||
− | ==Executing== |
||
− | |||
− | <pre> |
||
− | $ export MATXIN_DIR=<prefix> |
||
− | $ echo "Esto es una prueba" | \ |
||
− | ./Analyzer -f $MATXIN_DIR/share/matxin/config.cfg | \ |
||
− | ./LT -f $MATXIN_DIR/share/matxin/config.cfg | \ |
||
− | ./ST_inter --inter 1 -f $MATXIN_DIR/share/matxin/config.cfg | \ |
||
− | ./ST_prep -f $MATXIN_DIR/share/matxin/config.cfg | \ |
||
− | ./ST_inter --inter 2 -f $MATXIN_DIR/share/matxin/config.cfg | \ |
||
− | ./ST_verb -f $MATXIN_DIR/share/matxin/config.cfg | \ |
||
− | ./ST_inter --inter 3 -f $MATXIN_DIR/share/matxin/config.cfg | \ |
||
− | ./SG_inter -f $MATXIN_DIR/share/matxin/config.cfg | \ |
||
− | ./SG_intra -f $MATXIN_DIR/share/matxin/config.cfg | \ |
||
− | ./MG -f $MATXIN_DIR/share/matxin/config.cfg | \ |
||
− | ./reFormat |
||
− | |||
− | Hau prueba bat da |
||
− | </pre> |
||
− | |||
− | ==Speed== |
||
− | |||
− | Between 25--30 words per second. |
||
− | |||
− | ==Documentation== |
||
− | |||
− | * [http://matxin.svn.sourceforge.net/viewvc/matxin/trunk/doc/documentation-es.pdf Descripción del sistema de traducción es-eu Matxin] (in Spanish) |
||
− | [[Category: |
+ | [[Category:Matxin|*]] |
Latest revision as of 20:29, 7 May 2016
Matxin is a free software machine translation engine related to Apertium. It allows for deeper transfer than can be found in Apertium.
This page describes how to install the system, see Matxin#Documentation below for how to create or maintain language pairs.
Installation[edit]
$ git clone https://github.com/matxin/matxin.git $ cd matxin/ $ ./autogen.sh $ make # make install
Language pairs[edit]
Troubleshooting[edit]
- Can't find AP_MKINCLUDE
set your ACLOCAL_PATH
to include the path to matxin.m4
Documentation[edit]
- Descripción del sistema de traducción es-eu Matxin (in Spanish)
- Documentation of Matxin (in English)
- Matxin New Language Pair HOWTO
Contact[edit]
Questions and comments about Matxin can be sent to their mailing list matxin-devel, or to the apertium-stuff list.