Difference between revisions of "IRSTLM"
Jump to navigation
Jump to search
Naan Dhaan (talk | contribs) m (IRSTLM path correction) |
Naan Dhaan (talk | contribs) m (Undo revision 73639 by Naan Dhaan (talk)) Tag: Undo |
||
Line 16: | Line 16: | ||
<pre> |
<pre> |
||
export IRSTLM=/path/prefix |
export IRSTLM=/path/prefix |
||
$IRSTLM/ |
$IRSTLM/bin/build-lm.sh -i incorpus.txt -o out.lm.gz -t tmp/ |
||
</pre> |
</pre> |
||
Revision as of 15:16, 25 July 2021
IRSTLM is a free and open source exact statistical language model using memory-mapping. The language models are compatible with those created with the closed-source SRILM Tooolkit.
See the homepage at https://hlt-mt.fbk.eu/technologies/irstlm
Installation
svn checkout svn://svn.code.sf.net/p/irstlm/code/trunk irstlm cd irstlm cmake -G "Unix Makefiles" -DCMAKE_INSTALL_PREFIX=/path/prefix make -j4 make install
Make a language model
export IRSTLM=/path/prefix $IRSTLM/bin/build-lm.sh -i incorpus.txt -o out.lm.gz -t tmp/
See also
- Moses (includes alternative LM system KenLM)
- Using GIZA++
- RandLM - a randomised LM, based on Bloom Filters