Difference between revisions of "Meta-evaluation"

Revision as of 22:12, 30 May 2019

Apertium language modules and translation pairs are subject to the following types of evaluation:

Morphology coverage / regression testing
Size of system
- Number of stems in lexc, monodix, bidix
- Number of disambiguation rules
- Number of lexical selection rules
- Number of transfer rules
Naïve coverage
- Monolingual naïve coverage
- Trimmed naïve coverage (i.e., using a trimmed dictionary)
Accuracy of analyses
- Precision/Recall/F-score
Accuracy of translation
- WER/PER/BLEU
Clenliness of translation output
- Testvoc

Morphology coverage

The tools we have for this are aq-morftest from Apertium quality and morph-test.py.

Naïve coverage

In theory, aq-covtest does this, but mostly people write their own scripts.

A good generalised script that supports hfst and lttoolbox binaries and arbitrary corpora would be good. It should also (optionally) output hitparades (e.g., frequency lists of unknown forms in the corpus).

Translation accuracy

apertium-eval-translator.pl and apertium-eval-translator-line.pl work well but are a bit old, and could probably benefit from being rewritten in python

@@ Line 16: / Line 16: @@
 * Clenliness of translation output
 ** Testvoc
+== Morphology coverage ==
+The tools we have for this are <code>aq-morftest</code> from [[Apertium quality]] and [[morph-test.py]].
+== Naïve coverage ==
+In theory, [[Apertium quality|aq-covtest]] does this, but mostly people write their own scripts.
+A good generalised script that supports hfst and lttoolbox binaries and arbitrary corpora would be good.  It should also (optionally) output hitparades (e.g., frequency lists of unknown forms in the corpus).
+== Translation accuracy ==
+[[apertium-eval-translator.pl]] and [[apertium-eval-translator-line.pl]] work well but are a bit old, and could probably benefit from being rewritten in python

Difference between revisions of "Meta-evaluation"

Revision as of 22:12, 30 May 2019

Morphology coverage

Naïve coverage

Translation accuracy

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools