Difference between revisions of "Meta-evaluation"

From Apertium
Jump to navigation Jump to search
m
Line 5: Line 5:
** Monolingual naïve coverage
** Monolingual naïve coverage
** Trimmed naïve coverage (i.e., using a trimmed dictionary)
** Trimmed naïve coverage (i.e., using a trimmed dictionary)
* Accuracy of tagging
* Accuracy of analyses
** Precision/Recall/F-score
** Precision/Recall/F-score
* Accuracy of translation
* Accuracy of translation

Revision as of 22:02, 30 May 2019

Apertium language modules and translation pairs are subject to the following types of evaluation:


  • Naïve coverage
    • Monolingual naïve coverage
    • Trimmed naïve coverage (i.e., using a trimmed dictionary)
  • Accuracy of analyses
    • Precision/Recall/F-score
  • Accuracy of translation
    • WER/PER/BLEU
  • Clenliness of translation output
    • Testvoc