<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wiki.apertium.org/w/index.php?action=history&amp;feed=atom&amp;title=Getnltk.py</id>
	<title>Getnltk.py - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.apertium.org/w/index.php?action=history&amp;feed=atom&amp;title=Getnltk.py"/>
	<link rel="alternate" type="text/html" href="https://wiki.apertium.org/w/index.php?title=Getnltk.py&amp;action=history"/>
	<updated>2026-05-15T17:39:42Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.34.1</generator>
	<entry>
		<id>https://wiki.apertium.org/w/index.php?title=Getnltk.py&amp;diff=66194&amp;oldid=prev</id>
		<title>Shardulc: GitHub migration</title>
		<link rel="alternate" type="text/html" href="https://wiki.apertium.org/w/index.php?title=Getnltk.py&amp;diff=66194&amp;oldid=prev"/>
		<updated>2018-03-10T02:32:37Z</updated>

		<summary type="html">&lt;p&gt;GitHub migration&lt;/p&gt;
&lt;table class=&quot;diff diff-contentalign-left&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;Revision as of 02:32, 10 March 2018&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 1:&lt;/td&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 1:&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-empty diff-side-deleted&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-addedline diff-side-added&quot;&gt;&lt;div&gt;{{Github-unmigrated-tool}}&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-deleted&quot;&gt;&lt;div&gt;getnltk.py is located in &amp;lt;code&amp;gt;/trunk/apertium-tools/scraper/getnltk.py&amp;lt;/code&amp;gt;. It was written by [http://wiki.apertium.org/wiki/User:Dtvrij74 Daniel Huang]. The purpose is to make NLTK&#039;s Punkt sentence tokenizer work on Python 3.&lt;/div&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-added&quot;&gt;&lt;div&gt;getnltk.py is located in &amp;lt;code&amp;gt;/trunk/apertium-tools/scraper/getnltk.py&amp;lt;/code&amp;gt;. It was written by [http://wiki.apertium.org/wiki/User:Dtvrij74 Daniel Huang]. The purpose is to make NLTK&#039;s Punkt sentence tokenizer work on Python 3.&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-deleted&quot;&gt;&lt;div&gt;&amp;lt;br /&amp;gt;&lt;/div&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-added&quot;&gt;&lt;div&gt;&amp;lt;br /&amp;gt;&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;/table&gt;</summary>
		<author><name>Shardulc</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.apertium.org/w/index.php?title=Getnltk.py&amp;diff=37893&amp;oldid=prev</id>
		<title>Dtvrij74 at 00:54, 2 January 2013</title>
		<link rel="alternate" type="text/html" href="https://wiki.apertium.org/w/index.php?title=Getnltk.py&amp;diff=37893&amp;oldid=prev"/>
		<updated>2013-01-02T00:54:37Z</updated>

		<summary type="html">&lt;p&gt;&lt;/p&gt;
&lt;table class=&quot;diff diff-contentalign-left&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;Revision as of 00:54, 2 January 2013&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 1:&lt;/td&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 1:&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-deletedline diff-side-deleted&quot;&gt;&lt;div&gt;getnltk.py is located in &amp;lt;code&amp;gt;/trunk/apertium-tools/scraper/getnltk.py&amp;lt;/code&amp;gt;. It was written by [http://wiki.apertium.org/wiki/User:Dtvrij74 Daniel Huang].&lt;/div&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-addedline diff-side-added&quot;&gt;&lt;div&gt;getnltk.py is located in &amp;lt;code&amp;gt;/trunk/apertium-tools/scraper/getnltk.py&amp;lt;/code&amp;gt;. It was written by [http://wiki.apertium.org/wiki/User:Dtvrij74 Daniel Huang]&lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;. The purpose is to make NLTK&#039;s Punkt sentence tokenizer work on Python 3&lt;/ins&gt;.&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-deleted&quot;&gt;&lt;div&gt;&amp;lt;br /&amp;gt;&lt;/div&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-added&quot;&gt;&lt;div&gt;&amp;lt;br /&amp;gt;&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-deletedline diff-side-deleted&quot;&gt;&lt;div&gt;&lt;del class=&quot;diffchange diffchange-inline&quot;&gt;If you want to use NLTK&#039;s Punkt sentence tokenizer, you&lt;/del&gt; can call &amp;lt;code&amp;gt;getnltk.py&amp;lt;/code&amp;gt; in your Python 3 code like:&lt;/div&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-addedline diff-side-added&quot;&gt;&lt;div&gt;&lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;You&lt;/ins&gt; can call &amp;lt;code&amp;gt;getnltk.py&amp;lt;/code&amp;gt; in your Python 3 code like:&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-deleted&quot;&gt;&lt;div&gt;&amp;lt;pre&amp;gt;&lt;/div&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-added&quot;&gt;&lt;div&gt;&amp;lt;pre&amp;gt;&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-deleted&quot;&gt;&lt;div&gt;py2output = subprocess.check_output([&#039;python&#039;, &#039;getnltk.py&#039;, tosplit, lang])&lt;/div&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-added&quot;&gt;&lt;div&gt;py2output = subprocess.check_output([&#039;python&#039;, &#039;getnltk.py&#039;, tosplit, lang])&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;/table&gt;</summary>
		<author><name>Dtvrij74</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.apertium.org/w/index.php?title=Getnltk.py&amp;diff=37889&amp;oldid=prev</id>
		<title>Dtvrij74: Created page with &#039;getnltk.py is located in &lt;code&gt;/trunk/apertium-tools/scraper/getnltk.py&lt;/code&gt;. It was written by [http://wiki.apertium.org/wiki/User:Dtvrij74 Daniel Huang]. &lt;br /&gt; If you want t…&#039;</title>
		<link rel="alternate" type="text/html" href="https://wiki.apertium.org/w/index.php?title=Getnltk.py&amp;diff=37889&amp;oldid=prev"/>
		<updated>2013-01-02T00:49:56Z</updated>

		<summary type="html">&lt;p&gt;Created page with &amp;#039;getnltk.py is located in &amp;lt;code&amp;gt;/trunk/apertium-tools/scraper/getnltk.py&amp;lt;/code&amp;gt;. It was written by [http://wiki.apertium.org/wiki/User:Dtvrij74 Daniel Huang]. &amp;lt;br /&amp;gt; If you want t…&amp;#039;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;getnltk.py is located in &amp;lt;code&amp;gt;/trunk/apertium-tools/scraper/getnltk.py&amp;lt;/code&amp;gt;. It was written by [http://wiki.apertium.org/wiki/User:Dtvrij74 Daniel Huang].&lt;br /&gt;
&amp;lt;br /&amp;gt;&lt;br /&gt;
If you want to use NLTK&amp;#039;s Punkt sentence tokenizer, you can call &amp;lt;code&amp;gt;getnltk.py&amp;lt;/code&amp;gt; in your Python 3 code like:&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
py2output = subprocess.check_output([&amp;#039;python&amp;#039;, &amp;#039;getnltk.py&amp;#039;, tosplit, lang])&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
* &amp;lt;code&amp;gt;tosplit&amp;lt;/code&amp;gt; is the text that will be tokenized into sentences&lt;br /&gt;
* &amp;lt;code&amp;gt;lang&amp;lt;/code&amp;gt; is the 3-letter or 2-letter language code. Currently, it supports English, Russian, and Armenian.&lt;br /&gt;
&lt;br /&gt;
The sentences will be printed to the variable &amp;lt;code&amp;gt;py2output&amp;lt;/code&amp;gt;. &amp;lt;code&amp;gt;xml2txt.py&amp;lt;/code&amp;gt; (in the same directory) uses &amp;lt;code&amp;gt;getnltk.py&amp;lt;/code&amp;gt;.&lt;/div&gt;</summary>
		<author><name>Dtvrij74</name></author>
		
	</entry>
</feed>