Difference between revisions of "AOT"
Jump to navigation
Jump to search
Line 1: | Line 1: | ||
{{TOCD}} |
|||
'''AOT''' is a morphological analyser (or lemmatiser) for Russian. The main website is at [http://www.aot.ru www.aot.ru] but some of the download links are broken. Fortunately it is mirrored on SourceForge. |
'''AOT''' is a morphological analyser (or lemmatiser) for Russian. The main website is at [http://www.aot.ru www.aot.ru] but some of the download links are broken. Fortunately it is mirrored on SourceForge. |
||
Revision as of 10:50, 3 August 2011
Contents |
AOT is a morphological analyser (or lemmatiser) for Russian. The main website is at www.aot.ru but some of the download links are broken. Fortunately it is mirrored on SourceForge.
Download
mkdir /tmp/RML cd /tmp/RML wget http://heanet.dl.sourceforge.net/project/rupostagger/rupostagger/0.1.02/rupostagger-0.1.02.tar.gz tar -xzvf rupostagger-0.1.02.tar.gz cp rupostagger-0.1.02/LemServer/aot.ru/* . tar -xzvf lemmatizer.tar.gz tar -xzvf rus-src-morph.tar.gz export RML=/tmp/RML
Compile
Edit the files "./compile_ross.sh" and "compile_morph.sh" and replace "gmake" with "make"
Edit the files:
- "Source/MorphWizardLib/FormInfo.h"
- "Source/AgramtabLib/GerGramTab.cpp"
- "Source/AgramtabLib/RusGramTab.cpp"
- "Source/AgramtabLib/EngGramTab.cpp"
- "Source/AgramtabLib/agramtab_.cpp"
- "Source/StructDictLib/Ross.h"
- "Source/StructDictLib/Field.h"
- "Source/GraphanLib/C_desc.cpp"
- "Source/common/utilit.cpp"
- "Source/common/PlmLine.cpp", and add:
#include <string.h>
at the top.
Then:
./compile_morph.sh ./generate_morph_bin.sh Russian
Use
$ echo "язык" | iconv -f utf-8 -t koi8-r | ./Bin/TestLem Russian | head -3 | iconv -f koi8-r Loading.. Input a word.. >+ {ЯЗЫК, С, "од", ("мр,им,ед",) } Id=28549 Accented=ЯЗЫ'К