<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wiki.apertium.org/w/index.php?action=history&amp;feed=atom&amp;title=User_talk%3AIrene%2Fproposal</id>
	<title>User talk:Irene/proposal - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.apertium.org/w/index.php?action=history&amp;feed=atom&amp;title=User_talk%3AIrene%2Fproposal"/>
	<link rel="alternate" type="text/html" href="https://wiki.apertium.org/w/index.php?title=User_talk:Irene/proposal&amp;action=history"/>
	<updated>2026-05-05T14:08:13Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.34.1</generator>
	<entry>
		<id>https://wiki.apertium.org/w/index.php?title=User_talk:Irene/proposal&amp;diff=63151&amp;oldid=prev</id>
		<title>Irene: /* updated proposal */ new section</title>
		<link rel="alternate" type="text/html" href="https://wiki.apertium.org/w/index.php?title=User_talk:Irene/proposal&amp;diff=63151&amp;oldid=prev"/>
		<updated>2017-05-30T04:07:42Z</updated>

		<summary type="html">&lt;p&gt;&lt;span dir=&quot;auto&quot;&gt;&lt;span class=&quot;autocomment&quot;&gt;updated proposal: &lt;/span&gt; new section&lt;/span&gt;&lt;/p&gt;
&lt;table class=&quot;diff diff-contentalign-left&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;Revision as of 04:07, 30 May 2017&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 17:&lt;/td&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 17:&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-deleted&quot;&gt;&lt;div&gt;[[Talk:Multiwords]] &amp;lt;br/&amp;gt;&lt;/div&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-added&quot;&gt;&lt;div&gt;[[Talk:Multiwords]] &amp;lt;br/&amp;gt;&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-deleted&quot;&gt;&lt;div&gt;[[Talk:Ideas for Google Summer of Code/Discontiguous multiwords]]&lt;/div&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-added&quot;&gt;&lt;div&gt;[[Talk:Ideas for Google Summer of Code/Discontiguous multiwords]]&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-empty diff-side-deleted&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-addedline diff-side-added&quot;&gt;&lt;br /&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-empty diff-side-deleted&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-addedline diff-side-added&quot;&gt;&lt;div&gt;== updated proposal ==&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-empty diff-side-deleted&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-addedline diff-side-added&quot;&gt;&lt;br /&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-empty diff-side-deleted&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-addedline diff-side-added&quot;&gt;&lt;div&gt;# tagging&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-empty diff-side-deleted&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-addedline diff-side-added&quot;&gt;&lt;div&gt;#* tag every discontinuous word for what can split it (e.g. &quot;take out&quot; -&amp;gt; can be split by a np/sn) &lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-empty diff-side-deleted&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-addedline diff-side-added&quot;&gt;&lt;div&gt;#* where to insert the tagging?&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-empty diff-side-deleted&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-addedline diff-side-added&quot;&gt;&lt;div&gt;# transfer stage&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-empty diff-side-deleted&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-addedline diff-side-added&quot;&gt;&lt;div&gt;#* sequences of discontinuous stuff&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-empty diff-side-deleted&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-addedline diff-side-added&quot;&gt;&lt;div&gt;# pseudo:&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-empty diff-side-deleted&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-addedline diff-side-added&quot;&gt;&lt;div&gt;#* if one of the sequences is encountered, then look into tags to see if it could be an instance of a discontinuous word&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-empty diff-side-deleted&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-addedline diff-side-added&quot;&gt;&lt;div&gt;#* if it is, then check whether or not it is a [[Ideas_for_Google_Summer_of_Code/Discontiguous_multiwords#Coding_challenge | real discontinuous word]]&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-empty diff-side-deleted&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-addedline diff-side-added&quot;&gt;&lt;div&gt;#* if it&#039;s real, then do re-ordering. if not, then do nothing.&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;/table&gt;</summary>
		<author><name>Irene</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.apertium.org/w/index.php?title=User_talk:Irene/proposal&amp;diff=63124&amp;oldid=prev</id>
		<title>Irene: idea</title>
		<link rel="alternate" type="text/html" href="https://wiki.apertium.org/w/index.php?title=User_talk:Irene/proposal&amp;diff=63124&amp;oldid=prev"/>
		<updated>2017-05-28T00:46:54Z</updated>

		<summary type="html">&lt;p&gt;idea&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;stage 1: MORPHOLOGICAL ANALYSIS&lt;br /&gt;
# Re-examine every multiword for whether or not it can be discontinuous, e.g. &amp;#039;&amp;#039;call (something) off&amp;#039;&amp;#039;, &amp;#039;&amp;#039;cheer (someone) up&amp;#039;&amp;#039;&lt;br /&gt;
#* parsing for multiwords can be done with a language-independent search for the words marked with &amp;#039;&amp;#039;&amp;lt;b/&amp;gt;&amp;#039;&amp;#039;, but I think determining whether or not a specific word can be discontinuous has to be done by hand.&lt;br /&gt;
#* different than what i originally proposed..&lt;br /&gt;
# If a word is separable, then tag it as so (introduce a new tag symbol).&lt;br /&gt;
#* the tag should contain information about which categories of words (&amp;#039;&amp;#039;np, vp&amp;#039;&amp;#039;) can split them. this will be useful when it comes to chunking, and achieving this alleviates the need for some of the hacks that we&amp;#039;re currently using&lt;br /&gt;
#* maybe this calls for creating a section in the paradigm definitions, since many follow the same pattern: &amp;#039;&amp;#039;call (something) off, cheer (someone) up, take (it) out&amp;#039;&amp;#039; are all verb-np-preposition&lt;br /&gt;
&lt;br /&gt;
stage 2: CHUNKING&lt;br /&gt;
# if the appropriate &amp;quot;chunk&amp;quot; is sandwiched between the separable word, then reorder the sentence accordingly&lt;br /&gt;
#* inter-chunk stage&lt;br /&gt;
#* maybe this could be done with a grep&lt;br /&gt;
#* check for false positives: take the thing out of the box does not use take out, as in take out the trash&lt;br /&gt;
&lt;br /&gt;
==Links==&lt;br /&gt;
[[Multiwords]] &amp;lt;br/&amp;gt;&lt;br /&gt;
[[Talk:Multiwords]] &amp;lt;br/&amp;gt;&lt;br /&gt;
[[Talk:Ideas for Google Summer of Code/Discontiguous multiwords]]&lt;/div&gt;</summary>
		<author><name>Irene</name></author>
		
	</entry>
</feed>