Ideas for Google Summer of Code/Apertium assimilation evaluation toolkit

The idea is to measure how the ability of human subjects to fill in the holes improves when the source or a machine translation of it are presented. The task involves also generating a program that computes the success as a function of the information presented to the user, and utilities to make the whole process automatic given an Apertium language pair.

There should be both a text-based interface, and a web-based interface.