ReTraTos
Revision as of 12:41, 15 March 2008 by Francis Tyers (talk | contribs)
ReTraTos is a toolbox to build linguistic resources useful for machine translation (MT): bilingual dictionaries and transfer rules. The induction systems and open linguistic data can be used with the Apertium toolbox to build open-source MT systems.
Usage
Input format
The input sentences need to be given in two separate files, for example en.txt
for English and pt.txt
for Portuguese.
- pt.txt
<s snum=1>Os/O<det><def><m><pl>:1 alunos/aluno<n><m><pl>:2 do/de<pr>+o<det><def><m><sg>:3_4 colégio/colégio<n><m><sg>:5 ...
- en.txt
<s snum=1>The/The<det><def><sp>:1 students/student<n><pl>:2 of/of<pr>:3 the/the<det><def><sp>:3 school/school<n><pl>:5 ...
See also
External links
Further reading
- Helena M. Caseli, Maria das Graças V. Nunes, Mikel L. Forcada, "From free shallow monolingual resources to machine translation systems: easing the task", in Mixing Approaches To Machine Translation, MATMT2008, proceedings (Donostia, Spain, Feb. 14, 2008), pp. 41--48