Translation quality statistics
Jump to navigation
Jump to search
This page aims to give an overview of the quality of various translators available in the Apertium platform. Word Error Rate (WER) and Position-independent Word Error Rate (PWER) are measures of post-edition effort. The number gives the expected number of words needed to be corrected in 100 words of running text. So, a WER of 4.7% indicates that in a given 100 words of text, 4.7 of them will need to be corrected by the post-editor.
Translator | Date | Version | Direction | WER | PWER | BLEU | Reference |
---|---|---|---|---|---|---|---|
apertium-nn-nb |
12th October 2009 | 0.6.1 | nn → nb | - | - | - | |
nb → nn | - | - | - | ||||
apertium-sv-da |
12th Oct 2009 | 0.5.0 | sv → da | 30.3 % | 27.7 % | - | http://wiki.apertium.org/w/index.php?title=Swedish_and_Danish/Evaluation&oldid=14881 |
da → sv | - | - | - | ||||
apertium-eu-es |
2nd September 2009 | eu → es | 72.4 % | 39.8 % | - | Ginestí-Rosell et al., 2009 | |
es → eu | - | - | - | ||||
apertium-cy-en |
2nd January 2009 | cy → en | 55.7 % | 30.5 % | - | Tyers and Donnelly, 2009 | |
en → cy | - | - | - | ||||
apertium-eo-en |
08 May 2009 | 0.9.0 | en → eo | 21.0 % | 19.0 % | - | http://wiki.apertium.org/w/index.php?title=English_and_Esperanto/Evaluation&oldid=12418 |
eo → en | - | - | - | ||||
apertium-es-pt |
15th May 2006 | es → pt | 4.7 % | - | - | Armentano et al., 2006 | |
pt → es | 11.3 % | - | - | ||||
apertium-oc-ca |
10th May 2006 | oc → ca | 9.6 % | - | - | Armentano and Forcada, 2006 | |
ca → oc | - | - | - | ||||
apertium-pt-ca |
28th July 2008 | pt → ca | 16.6% | - | - | Armentano and Forcada, 2008 | |
ca → pt | 14.1% | - | - | ||||
apertium-en-es |
May 2009 | en → es | - | - | 18.51% | Sánchez-Martínez, 2009 | |
es → en | - | - | 18.81% |
References
- Armentano-Oller, C., Carrasco, R. C. Corbí-Bellot, A. M., Forcada, M. L., Ginestí-Rosell, M., Ortiz-Rojas, S., Pérez-Ortiz, J. A., Ramírez-Sánchez, G., Sánchez-Martínez, F., Scalco, M. A. (2006) "Open-source Portuguese-Spanish machine translation", in In Lecture Notes in Computer Science 3960 (Computational Processing of the Portuguese Language, Proceedings of the 7th International Workshop on Computational Processing of Written and Spoken Portuguese, PROPOR 2006), May 13-17, 2006, ME - RJ / Itatiaia, Rio de Janeiro, Brazil. , p. 50-59
- Armentano-Oller, C. and Forcada, M. L. (2006) "Open-source machine translation between small languages: Catalan and Aranese Occitan", in Strategies for developing machine translation for minority languages (5th SALTMIL workshop on Minority Languages) (organized in conjunction with LREC 2006 (22-28.05.2006)) , p. 51-54
- C. Armentano-Oller, M.L. Forcada, "Reutilización de datos lingüísticos para la creación de un sistema de traducción automática para un nuevo par de lenguas", Procesamiento del Lenguaje Natural, :41, 243-250
- Ginestí-Rosell, M. and Ramírez-Sánchez, G. and Ortiz-Rojas, S. and Tyers, F. M. and Forcada, M. L. (2009) "Development of a free Basque to Spanish machine translation system". Procesamiento de Lenguaje Natural. No. 43, pp. 185--197
- Tyers, F. M. and Donnelly, K. (2009) "apertium-cy - a collaboratively-developed free RBMT system for Welsh to English". The Prague Bulletin of Mathematical Linguistics No. 91, pp. 57--66.
- Felipe Sánchez-Martínez, Mikel L. Forcada, Andy Way. "Hybrid rule-based ‒ example-based MT: Feeding Apertium with sub-sentential translation units". In Proceedings of the 3rd Workshop on Example-Based Machine Translation, p. 11-18, November 12-13, 2009, Dublin, Ireland.