<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wiki.apertium.org/w/index.php?action=history&amp;feed=atom&amp;title=User%3AEden%2FGSoC2019_Progress-summary</id>
	<title>User:Eden/GSoC2019 Progress-summary - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.apertium.org/w/index.php?action=history&amp;feed=atom&amp;title=User%3AEden%2FGSoC2019_Progress-summary"/>
	<link rel="alternate" type="text/html" href="https://wiki.apertium.org/w/index.php?title=User:Eden/GSoC2019_Progress-summary&amp;action=history"/>
	<updated>2026-04-09T09:31:11Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.34.1</generator>
	<entry>
		<id>https://wiki.apertium.org/w/index.php?title=User:Eden/GSoC2019_Progress-summary&amp;diff=72200&amp;oldid=prev</id>
		<title>Eden: Created page with &quot;== Status table ==  {|class=wikitable |- !colspan=&quot;2&quot;|Week !colspan=&quot;2&quot;|Stems !colspan=&quot;2&quot;|naïve coverage !colspan=&quot;2&quot;|WER,PER !colspan=&quot;2&quot;|Progress |- ! № ! dates ! lin !...&quot;</title>
		<link rel="alternate" type="text/html" href="https://wiki.apertium.org/w/index.php?title=User:Eden/GSoC2019_Progress-summary&amp;diff=72200&amp;oldid=prev"/>
		<updated>2020-05-12T23:21:20Z</updated>

		<summary type="html">&lt;p&gt;Created page with &amp;quot;== Status table ==  {|class=wikitable |- !colspan=&amp;quot;2&amp;quot;|Week !colspan=&amp;quot;2&amp;quot;|Stems !colspan=&amp;quot;2&amp;quot;|naïve coverage !colspan=&amp;quot;2&amp;quot;|WER,PER !colspan=&amp;quot;2&amp;quot;|Progress |- ! № ! dates ! lin !...&amp;quot;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;== Status table ==&lt;br /&gt;
&lt;br /&gt;
{|class=wikitable&lt;br /&gt;
|-&lt;br /&gt;
!colspan=&amp;quot;2&amp;quot;|Week&lt;br /&gt;
!colspan=&amp;quot;2&amp;quot;|Stems&lt;br /&gt;
!colspan=&amp;quot;2&amp;quot;|naïve coverage&lt;br /&gt;
!colspan=&amp;quot;2&amp;quot;|WER,PER&lt;br /&gt;
!colspan=&amp;quot;2&amp;quot;|Progress&lt;br /&gt;
|-&lt;br /&gt;
! №&lt;br /&gt;
! dates&lt;br /&gt;
! lin&lt;br /&gt;
! lin-eng&lt;br /&gt;
! lin&lt;br /&gt;
! lin-eng&lt;br /&gt;
! lin→eng&lt;br /&gt;
! eng→lin&lt;br /&gt;
!Evaluation&lt;br /&gt;
!Notes&lt;br /&gt;
|-&lt;br /&gt;
| 0&lt;br /&gt;
| May 20 - May 26&lt;br /&gt;
| 727&lt;br /&gt;
| 139&lt;br /&gt;
| 61.95%&lt;br /&gt;
| 40.86%&lt;br /&gt;
| 86.79%,80.87%&lt;br /&gt;
| 75.27%,63.98%&lt;br /&gt;
| &lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
| 1&lt;br /&gt;
| May 27 - June 02&lt;br /&gt;
| 904&lt;br /&gt;
| 139&lt;br /&gt;
| 62.57%&lt;br /&gt;
| 40.86%&lt;br /&gt;
| 86.79%,80.87%&lt;br /&gt;
| 75.27%,63.98%&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
| 2&lt;br /&gt;
| May 03 - June 09&lt;br /&gt;
| 1,154&lt;br /&gt;
| 1,416&lt;br /&gt;
| 63.17%&lt;br /&gt;
| 53.03%&lt;br /&gt;
| 87.02%,79.95%&lt;br /&gt;
| 74.46%,60.22%&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
| 3&lt;br /&gt;
| June 10 - June 16&lt;br /&gt;
| 1,172&lt;br /&gt;
| 1,501&lt;br /&gt;
| &lt;br /&gt;
| 61.60%&lt;br /&gt;
| 91.57%,79.04%&lt;br /&gt;
| 75.85%,62.90%&lt;br /&gt;
|&lt;br /&gt;
| WER for &amp;#039;lin-eng&amp;#039; went up because of an incomplete rule for verbs that creates unnecessary pronouns. Main work next week will be on rules to dramatically improve WER and PER.&lt;br /&gt;
|-&lt;br /&gt;
| 4&lt;br /&gt;
| June 17 - June 23&lt;br /&gt;
| 1,200&lt;br /&gt;
| 1,540&lt;br /&gt;
| 69.70%&lt;br /&gt;
| 62.70%&lt;br /&gt;
| 79.27%,64.24%&lt;br /&gt;
| 84.41%,72.58%&lt;br /&gt;
|&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| 5&lt;br /&gt;
| June 24 - June 30&lt;br /&gt;
| 1,200&lt;br /&gt;
| 1,556&lt;br /&gt;
| 70.21%&lt;br /&gt;
| 61.90%&lt;br /&gt;
| 77.68%,67.88%&lt;br /&gt;
| 85.48%,73.92%&lt;br /&gt;
|&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| 6&lt;br /&gt;
| July 1 - July 7&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
| 7&lt;br /&gt;
| July 8 - July 14&lt;br /&gt;
|1,236&lt;br /&gt;
|1,577&lt;br /&gt;
|69.35%&lt;br /&gt;
|60.47%&lt;br /&gt;
|60.59%,46.47%&lt;br /&gt;
|72.61%,58.68%&lt;br /&gt;
|&lt;br /&gt;
|Work was done on lexical selection and rules about determinants. Current lexical selection works well with the text currently in use, which is a more rigid and literary Lingala. Further tests will be run on texts from the Wikipedia corpus to generalize lexical rules.&lt;br /&gt;
|-&lt;br /&gt;
| 8&lt;br /&gt;
| July 15 - July 21&lt;br /&gt;
|1,280 &lt;br /&gt;
|1,580&lt;br /&gt;
|72.81%&lt;br /&gt;
|68.62%&lt;br /&gt;
|52.62%,42.82%&lt;br /&gt;
|59.04%,46.28%&lt;br /&gt;
|&lt;br /&gt;
| WER went down in both directions by approximately 2% after I added accents, and missing ɔ́ ɔ ɛ́ ɛ. Next focus will be on negation and trying to find a bigger corpus(&amp;gt;1000 words).&lt;br /&gt;
|-&lt;br /&gt;
| 9&lt;br /&gt;
| July 22 - July 28&lt;br /&gt;
| 1,320&lt;br /&gt;
| 1,600&lt;br /&gt;
| 73.24%&lt;br /&gt;
| 68.92%&lt;br /&gt;
| 50.02%,41.55%&lt;br /&gt;
| 52.81%,40.09%&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
| 10&lt;br /&gt;
| July 29 - Aug 04&lt;br /&gt;
| &lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
| Work was mainly on lexical selection rules. First half of Bible translation(~1,100 words) is understandable.&lt;br /&gt;
|-&lt;br /&gt;
| 11&lt;br /&gt;
| Aug 5 - Aug 11&lt;br /&gt;
| 1,341&lt;br /&gt;
| 1,661&lt;br /&gt;
| 75.35%&lt;br /&gt;
| 69.33%&lt;br /&gt;
| 48.97%,39.18%&lt;br /&gt;
| 53.99%,41.49%&lt;br /&gt;
|&lt;br /&gt;
| Lexical selection rules for &amp;#039;na&amp;#039; and &amp;#039;ya&amp;#039;. WER in eng-lin went up because I commented out some words in the bidix. &lt;br /&gt;
|-&lt;br /&gt;
| 12&lt;br /&gt;
| Aug 12 - Aug 18&lt;br /&gt;
| 1,444&lt;br /&gt;
| 1,700&lt;br /&gt;
| 76.5%&lt;br /&gt;
| 71.10%&lt;br /&gt;
| 48.52%,37.81%&lt;br /&gt;
| 50.13%,38.13%&lt;br /&gt;
|&lt;br /&gt;
| Added missing morphology for determinants and adjectives. &lt;br /&gt;
|-&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== Notes ==&lt;br /&gt;
* To count stems in &amp;lt;code&amp;gt;lexc&amp;lt;/code&amp;gt;,  try:&lt;br /&gt;
  grep -E &amp;quot;:\w+.*;&amp;quot; apertium-lin.lin.lexc | grep -v &amp;quot;[&amp;lt;&amp;gt;]&amp;quot; | wc -l&lt;br /&gt;
&lt;br /&gt;
* To count stems in the bidix, try this:&lt;br /&gt;
  grep &amp;quot;&amp;lt;p&amp;quot; apertium-eng-lin.eng-lin.dix  | wc -l&lt;br /&gt;
&lt;br /&gt;
* To get WER and PER use &amp;lt;code&amp;gt;apertium-eval-translator-line&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* Coverage above is on [https://dumps.wikimedia.org/lnwiki/20190520/ 2019-05-20 Wikipedia dump].&lt;/div&gt;</summary>
		<author><name>Eden</name></author>
		
	</entry>
</feed>