getnltk.py is located in
/trunk/apertium-tools/scraper/getnltk.py. It was written by Daniel Huang. The purpose is to make NLTK's Punkt sentence tokenizer work on Python 3.
You can call
getnltk.py in your Python 3 code like:
py2output = subprocess.check_output(['python', 'getnltk.py', tosplit, lang])
tosplitis the text that will be tokenized into sentences
langis the 3-letter or 2-letter language code. Currently, it supports English, Russian, and Armenian.
The sentences will be printed to the variable
xml2txt.py (in the same directory) uses