Difference between revisions of "AOT"
Jump to navigation
Jump to search
Line 45: | Line 45: | ||
==Use== |
==Use== |
||
<pre> |
|||
$ echo "язык" | iconv -f utf-8 -t koi8-r | ./Bin/TestLem Russian | head -3 | iconv -f koi8-r |
|||
Loading.. |
|||
Input a word.. |
|||
>+ {ЯЗЫК, С, "од", ("мр,им,ед",) } Id=28549 Accented=ЯЗЫ'К |
|||
</pre> |
|||
==External links== |
|||
* [http://aot.ru/docs/rusmorph.html Description of morphological tags] |
|||
[[Category:Russian]] |
[[Category:Russian]] |
Revision as of 10:49, 3 August 2011
AOT is a morphological analyser (or lemmatiser) for Russian. The main website is at www.aot.ru but some of the download links are broken. Fortunately it is mirrored on SourceForge.
Contents
Download
mkdir /tmp/RML cd /tmp/RML wget http://heanet.dl.sourceforge.net/project/rupostagger/rupostagger/0.1.02/rupostagger-0.1.02.tar.gz tar -xzvf rupostagger-0.1.02.tar.gz cp rupostagger-0.1.02/LemServer/aot.ru/* . tar -xzvf lemmatizer.tar.gz tar -xzvf rus-src-morph.tar.gz export RML=/tmp/RML
Compile
Edit the files "./compile_ross.sh" and "compile_morph.sh" and replace "gmake" with "make"
Edit the files:
- "Source/MorphWizardLib/FormInfo.h"
- "Source/AgramtabLib/GerGramTab.cpp"
- "Source/AgramtabLib/RusGramTab.cpp"
- "Source/AgramtabLib/EngGramTab.cpp"
- "Source/AgramtabLib/agramtab_.cpp"
- "Source/StructDictLib/Ross.h"
- "Source/StructDictLib/Field.h"
- "Source/GraphanLib/C_desc.cpp"
- "Source/common/utilit.cpp"
- "Source/common/PlmLine.cpp", and add:
#include <string.h>
at the top.
Then:
./compile_morph.sh ./generate_morph_bin.sh Russian
Use
$ echo "язык" | iconv -f utf-8 -t koi8-r | ./Bin/TestLem Russian | head -3 | iconv -f koi8-r Loading.. Input a word.. >+ {ЯЗЫК, С, "од", ("мр,им,ед",) } Id=28549 Accented=ЯЗЫ'К