Difference between revisions of "Meta-evaluation"

From Apertium
Jump to navigation Jump to search
m
m
Line 5: Line 5:
 
** Monolingual naïve coverage
 
** Monolingual naïve coverage
 
** Trimmed naïve coverage (i.e., using a trimmed dictionary)
 
** Trimmed naïve coverage (i.e., using a trimmed dictionary)
* Accuracy of tagging/disambiguation
+
* Accuracy of tagging
 
** Precision/Recall/F-score
 
** Precision/Recall/F-score
 
* Accuracy of translation
 
* Accuracy of translation

Revision as of 22:01, 30 May 2019

Apertium language modules and translation pairs are subject to the following types of evaluation:


  • Naïve coverage
    • Monolingual naïve coverage
    • Trimmed naïve coverage (i.e., using a trimmed dictionary)
  • Accuracy of tagging
    • Precision/Recall/F-score
  • Accuracy of translation
    • WER/PER/BLEU
  • Clenliness of translation output
    • Testvoc