<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wiki.apertium.org/w/index.php?action=history&amp;feed=atom&amp;title=User%3AFpetkovski%2FGSOC_2012_Report</id>
	<title>User:Fpetkovski/GSOC 2012 Report - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.apertium.org/w/index.php?action=history&amp;feed=atom&amp;title=User%3AFpetkovski%2FGSOC_2012_Report"/>
	<link rel="alternate" type="text/html" href="https://wiki.apertium.org/w/index.php?title=User:Fpetkovski/GSOC_2012_Report&amp;action=history"/>
	<updated>2026-06-20T14:29:52Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.34.1</generator>
	<entry>
		<id>https://wiki.apertium.org/w/index.php?title=User:Fpetkovski/GSOC_2012_Report&amp;diff=52687&amp;oldid=prev</id>
		<title>Unhammer: backlink</title>
		<link rel="alternate" type="text/html" href="https://wiki.apertium.org/w/index.php?title=User:Fpetkovski/GSOC_2012_Report&amp;diff=52687&amp;oldid=prev"/>
		<updated>2015-02-09T11:18:45Z</updated>

		<summary type="html">&lt;p&gt;backlink&lt;/p&gt;
&lt;table class=&quot;diff diff-contentalign-left&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;Revision as of 11:18, 9 February 2015&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 2:&lt;/td&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 2:&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-deleted&quot;&gt;&lt;div&gt;* [[Corpus based preposition selection - HOWTO]]&lt;/div&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-added&quot;&gt;&lt;div&gt;* [[Corpus based preposition selection - HOWTO]]&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-deleted&quot;&gt;&lt;div&gt;* [[Building a pseudo-parallel corpus]]&lt;/div&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-added&quot;&gt;&lt;div&gt;* [[Building a pseudo-parallel corpus]]&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-empty diff-side-deleted&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-addedline diff-side-added&quot;&gt;&lt;div&gt;* [[Ideas for Google Summer of Code/Corpus-based lexicalised feature transfer]]&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-deleted&quot;&gt;&lt;br /&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-added&quot;&gt;&lt;br /&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-deleted&quot;&gt;&lt;div&gt;==Reports==&lt;/div&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-added&quot;&gt;&lt;div&gt;==Reports==&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;/table&gt;</summary>
		<author><name>Unhammer</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.apertium.org/w/index.php?title=User:Fpetkovski/GSOC_2012_Report&amp;diff=40286&amp;oldid=prev</id>
		<title>Francis Tyers: moved GSOC 2012 Report to User:Fpetkovski/GSOC 2012 Report</title>
		<link rel="alternate" type="text/html" href="https://wiki.apertium.org/w/index.php?title=User:Fpetkovski/GSOC_2012_Report&amp;diff=40286&amp;oldid=prev"/>
		<updated>2013-04-11T13:16:05Z</updated>

		<summary type="html">&lt;p&gt;moved &lt;a href=&quot;/w/index.php?title=GSOC_2012_Report&amp;amp;action=edit&amp;amp;redlink=1&quot; class=&quot;new&quot; title=&quot;GSOC 2012 Report (page does not exist)&quot;&gt;GSOC 2012 Report&lt;/a&gt; to &lt;a href=&quot;/wiki/User:Fpetkovski/GSOC_2012_Report&quot; title=&quot;User:Fpetkovski/GSOC 2012 Report&quot;&gt;User:Fpetkovski/GSOC 2012 Report&lt;/a&gt;&lt;/p&gt;
&lt;table class=&quot;diff diff-contentalign-left&quot; data-mw=&quot;interface&quot;&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;1&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;1&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;Revision as of 13:16, 11 April 2013&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-notice&quot; lang=&quot;en&quot;&gt;&lt;div class=&quot;mw-diff-empty&quot;&gt;(No difference)&lt;/div&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;/table&gt;</summary>
		<author><name>Francis Tyers</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.apertium.org/w/index.php?title=User:Fpetkovski/GSOC_2012_Report&amp;diff=40269&amp;oldid=prev</id>
		<title>Fpetkovski: Created page with &#039;==Documentation / HOWTO== * Corpus based preposition selection - HOWTO * Building a pseudo-parallel corpus  ==Reports== * Lexical feature transfer - First report  * […&#039;</title>
		<link rel="alternate" type="text/html" href="https://wiki.apertium.org/w/index.php?title=User:Fpetkovski/GSOC_2012_Report&amp;diff=40269&amp;oldid=prev"/>
		<updated>2013-04-11T12:19:24Z</updated>

		<summary type="html">&lt;p&gt;Created page with &amp;#039;==Documentation / HOWTO== * &lt;a href=&quot;/wiki/Corpus_based_preposition_selection_-_HOWTO&quot; title=&quot;Corpus based preposition selection - HOWTO&quot;&gt;Corpus based preposition selection - HOWTO&lt;/a&gt; * &lt;a href=&quot;/wiki/Building_a_pseudo-parallel_corpus&quot; title=&quot;Building a pseudo-parallel corpus&quot;&gt;Building a pseudo-parallel corpus&lt;/a&gt;  ==Reports== * &lt;a href=&quot;/wiki/Lexical_feature_transfer_-_First_report&quot; title=&quot;Lexical feature transfer - First report&quot;&gt;Lexical feature transfer - First report&lt;/a&gt;  * […&amp;#039;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;==Documentation / HOWTO==&lt;br /&gt;
* [[Corpus based preposition selection - HOWTO]]&lt;br /&gt;
* [[Building a pseudo-parallel corpus]]&lt;br /&gt;
&lt;br /&gt;
==Reports==&lt;br /&gt;
* [[Lexical feature transfer - First report]]&lt;br /&gt;
&lt;br /&gt;
* [[Lexical feature transfer - Second report]]&lt;br /&gt;
&lt;br /&gt;
==TODO==&lt;br /&gt;
&lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;Try generating corpus from monolingual SL corpus:&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
** Оваа лабавост на регулативите се одразува врз третманот на уапсените корисници на дрога.&lt;br /&gt;
&amp;lt;s&amp;gt;*** Run through lexical transfer &amp;lt;code&amp;gt;mk-en-biltrans&amp;lt;/code&amp;gt;&lt;br /&gt;
*** Run through &amp;lt;code&amp;gt;apertium-lex-tools/scripts/biltrans-to-multitrans.py&amp;lt;/code&amp;gt;&lt;br /&gt;
*** Run through the rest of the pipeline from &amp;lt;code&amp;gt;apertium-transfer -b&amp;lt;/code&amp;gt; onwards&lt;br /&gt;
*** Run through &amp;lt;code&amp;gt;apertium-lex-learner/irstlm-ranker&amp;lt;/code&amp;gt;&amp;lt;/s&amp;gt;&lt;br /&gt;
** This will give:&lt;br /&gt;
&amp;lt;s&amp;gt;*** SL:TL selection possibilities &lt;br /&gt;
*** probabilities from the TL language model for each selection&amp;lt;/s&amp;gt;&lt;br /&gt;
** Select a subset for training where one translation has a substantially higher proportion of the probability mass than the rest.&lt;br /&gt;
** Look at finding out how to work out what &amp;quot;substantially&amp;quot; should be.&lt;br /&gt;
&lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;Improve current method:&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
&amp;lt;s&amp;gt;** Split test corpus in two (dev, test)&lt;br /&gt;
*** Rerun the experiments and check with test corpus&lt;br /&gt;
*** Look at dev corpus to see what kind of patterns there are in lines that aren&amp;#039;t getting matched&lt;br /&gt;
** Look at combining the 1-feature with the 2-feature model as backoff.&lt;br /&gt;
&amp;lt;/s&amp;gt;&lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;Evaluation&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
** Try pair bootstrap resampling between best system and default translation for both WER and BLEU.&lt;br /&gt;
&lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039; Check the bidix entries that were added automatically&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
[[Category:Users|Fpetkovski]]&lt;/div&gt;</summary>
		<author><name>Fpetkovski</name></author>
		
	</entry>
</feed>