Matxin-eng

From Apertium
Jump to navigation Jump to search

matxin-eng is a wrapper around Stanford CoreNLP so it can be used in the Matxin MT system.

Prerequisites[edit]

See also: Matxin

You will need to install Matxin first. There is a script included to download the Java requirements.

Installing[edit]

$ git clone https://github.com/matxin/matxin-eng.git
$ sh scripts/download-corenlp.sh 
$ ./autogen.sh
$ make
$ sudo make install

Using[edit]

$ echo "This is a test." | matxin -d . -f none eng-analyser
<?xml version='1.0' encoding='UTF-8' ?>
<?xml-stylesheet type='text/xsl' href='profit.xsl'?>
<corpus>
<SENTENCE ord='1'>
    <CHUNK ord='4' alloc='10' type='sn' si='root'>
        <NODE ord='4' alloc='10' form='test' lem='test' mi='NN'>
            <NODE ord='3' alloc='8' form='a' lem='a' mi='DT'/>
        </NODE>
        <CHUNK ord='1' alloc='0' type='UNK' si='nsubj'>
            <NODE ord='1' alloc='0' form='This' lem='this' mi='DT'/>
        </CHUNK>
        <CHUNK ord='2' alloc='5' type='grup-verb' si='cop'>
            <NODE ord='2' alloc='5' form='is' lem='be' mi='VBZ'/>
        </CHUNK>
    </CHUNK>
    <CHUNK ord='5' alloc='14' type='UNK' si='root'>
        <NODE ord='5' alloc='14' form='.' lem='.' mi='.'/>
    </CHUNK>
</SENTENCE>
</corpus>