Cleanstream

From Apertium
Jump to navigation Jump to search
Note: After Apertium's migration to GitHub, this tool is read-only on the SourceForge repository and does not exist on GitHub. If you are interested in migrating this tool to GitHub, see Migrating tools to GitHub.

apertium-cleanstream removes superblanks and such from the Apertium Stream Format, and with the -n option puts each lexical unit on one line, so ^foo/bar<n>$[<b>]^fie/fum<vblex>$ ^./.<sent>$ sent through apertium-cleanstream -n turns into

^foo/bar<n>$
^fie/fum<vblex>$
^./.<sent>$

Check out, compile and install like this (no prerequisites apart from g++):

$ svn co https://svn.code.sf.net/p/apertium/svn/trunk/apertium-tools/apertium-cleanstream
$ make
$ sudo cp apertium-cleanstream /usr/local/bin