<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wiki.apertium.org/w/index.php?action=history&amp;feed=atom&amp;title=Aragonese_and_Catalan%2FEvaluation</id>
	<title>Aragonese and Catalan/Evaluation - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.apertium.org/w/index.php?action=history&amp;feed=atom&amp;title=Aragonese_and_Catalan%2FEvaluation"/>
	<link rel="alternate" type="text/html" href="https://wiki.apertium.org/w/index.php?title=Aragonese_and_Catalan/Evaluation&amp;action=history"/>
	<updated>2026-04-04T02:20:20Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.34.1</generator>
	<entry>
		<id>https://wiki.apertium.org/w/index.php?title=Aragonese_and_Catalan/Evaluation&amp;diff=56150&amp;oldid=prev</id>
		<title>Juanpabl at 08:55, 16 January 2016</title>
		<link rel="alternate" type="text/html" href="https://wiki.apertium.org/w/index.php?title=Aragonese_and_Catalan/Evaluation&amp;diff=56150&amp;oldid=prev"/>
		<updated>2016-01-16T08:55:44Z</updated>

		<summary type="html">&lt;p&gt;&lt;/p&gt;
&lt;table class=&quot;diff diff-contentalign-left&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;Revision as of 08:55, 16 January 2016&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 60:&lt;/td&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 60:&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-deleted&quot;&gt;&lt;div&gt;Percentage of unknown words that were free rides: 32.69 %&lt;/div&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-added&quot;&gt;&lt;div&gt;Percentage of unknown words that were free rides: 32.69 %&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-deleted&quot;&gt;&lt;div&gt;&amp;lt;/pre&amp;gt;&lt;/div&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-added&quot;&gt;&lt;div&gt;&amp;lt;/pre&amp;gt;&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-empty diff-side-deleted&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-addedline diff-side-added&quot;&gt;&lt;br /&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-empty diff-side-deleted&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-addedline diff-side-added&quot;&gt;&lt;div&gt;[[Category:Aragonese and Catalan]]&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;/table&gt;</summary>
		<author><name>Juanpabl</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.apertium.org/w/index.php?title=Aragonese_and_Catalan/Evaluation&amp;diff=56149&amp;oldid=prev</id>
		<title>Juanpabl: Created page with &quot;== Version 0.1 (Beta) ==  === Naïve coverage === ==== arg-cat ==== &lt;pre&gt; $ cat corpus_narrative.txt | sh corpus-stat-arg-cat.sh Number of tokenised words in the corpus: 37844...&quot;</title>
		<link rel="alternate" type="text/html" href="https://wiki.apertium.org/w/index.php?title=Aragonese_and_Catalan/Evaluation&amp;diff=56149&amp;oldid=prev"/>
		<updated>2016-01-16T08:54:47Z</updated>

		<summary type="html">&lt;p&gt;Created page with &amp;quot;== Version 0.1 (Beta) ==  === Naïve coverage === ==== arg-cat ==== &amp;lt;pre&amp;gt; $ cat corpus_narrative.txt | sh corpus-stat-arg-cat.sh Number of tokenised words in the corpus: 37844...&amp;quot;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;== Version 0.1 (Beta) ==&lt;br /&gt;
&lt;br /&gt;
=== Naïve coverage ===&lt;br /&gt;
==== arg-cat ====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
$ cat corpus_narrative.txt | sh corpus-stat-arg-cat.sh&lt;br /&gt;
Number of tokenised words in the corpus: 378440&lt;br /&gt;
Number of known words in the corpus: 337924&lt;br /&gt;
Coverage:     89.3 %&lt;br /&gt;
&lt;br /&gt;
$ cat sentencelistanwiki.txt | sh corpus-stat-arg-cat.sh&lt;br /&gt;
Number of tokenised words in the corpus: 2673751&lt;br /&gt;
Number of known words in the corpus: 2344686&lt;br /&gt;
Coverage:     87.7 %&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
==== cat-arg ====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
$ cat ../apertium-es-ca/ca-tagger-data/ca.tagged.txt | sh corpus-stat-cat-arg.sh&lt;br /&gt;
Number of tokenised words in the corpus: 24590&lt;br /&gt;
Number of known words in the corpus: 22919&lt;br /&gt;
Coverage:     93.2 %&lt;br /&gt;
&lt;br /&gt;
trunk/apertium-eo-ca/tekstaro/ca.crp.txt&lt;br /&gt;
$ cat ca.crp.txt | sed &amp;#039;s/^ *[0123456789]*\.//g&amp;#039;| sh ./corpus-stat-cat-arg.sh&lt;br /&gt;
Number of tokenised words in the corpus: 567608&lt;br /&gt;
Number of known words in the corpus: 497165&lt;br /&gt;
Coverage:     87.6 %&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
=== Translation Quality ===&lt;br /&gt;
==== cat-arg ====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
$../apertium-eval-translator/apertium-eval-translator.pl -test MT.txt -ref postedit.txt&lt;br /&gt;
Test file: &amp;#039;MT.txt&amp;#039;&lt;br /&gt;
Reference file &amp;#039;postedit.txt&amp;#039;&lt;br /&gt;
&lt;br /&gt;
Statistics about input files&lt;br /&gt;
-------------------------------------------------------&lt;br /&gt;
Number of words in reference: 1311&lt;br /&gt;
Number of words in test: 1315&lt;br /&gt;
Number of unknown words (marked with a star) in test: 156&lt;br /&gt;
Percentage of unknown words: 11.86 %&lt;br /&gt;
&lt;br /&gt;
Results when removing unknown-word marks (stars)&lt;br /&gt;
-------------------------------------------------------&lt;br /&gt;
Edit distance: 203&lt;br /&gt;
Word error rate (WER): 15.48 %&lt;br /&gt;
Number of position-independent correct words: 1132&lt;br /&gt;
Position-independent word error rate (PER): 13.96 %&lt;br /&gt;
&lt;br /&gt;
Results when unknown-word marks (stars) are not removed&lt;br /&gt;
-------------------------------------------------------&lt;br /&gt;
Edit distance: 254&lt;br /&gt;
Word Error Rate (WER): 19.37 %&lt;br /&gt;
Number of position-independent correct words: 1081&lt;br /&gt;
Position-independent word error rate (PER): 17.85 %&lt;br /&gt;
&lt;br /&gt;
Statistics about the translation of unknown words&lt;br /&gt;
-------------------------------------------------------&lt;br /&gt;
Number of unknown words which were free rides: 51&lt;br /&gt;
Percentage of unknown words that were free rides: 32.69 %&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;/div&gt;</summary>
		<author><name>Juanpabl</name></author>
		
	</entry>
</feed>