Difference between revisions of "ReTraTos"

From Apertium
Jump to navigation Jump to search
Line 2: Line 2:
 
'''ReTraTos''' is a toolbox to build linguistic resources useful for machine translation (MT): bilingual dictionaries and transfer rules. The induction systems and open linguistic data can be used with the [[Apertium]] toolbox to build open-source MT systems.
 
'''ReTraTos''' is a toolbox to build linguistic resources useful for machine translation (MT): bilingual dictionaries and transfer rules. The induction systems and open linguistic data can be used with the [[Apertium]] toolbox to build open-source MT systems.
   
==Usage==
+
==Input format==
 
===Input format===
 
   
 
The input sentences need to be given in two separate files, for example <code>en.txt</code> for English and <code>pt.txt</code> for Portuguese.
 
The input sentences need to be given in two separate files, for example <code>en.txt</code> for English and <code>pt.txt</code> for Portuguese.
Line 21: Line 19:
 
...
 
...
 
</nowiki></pre>
 
</nowiki></pre>
  +
  +
==Usage==
  +
  +
===ReTraTos_lex===
  +
  +
You will need the header and footer of a bilingual dictionary in two separate files, for example, <code>dic_header.txt</code> and <code>dic_footer.txt</code> (see the examples in the package). Example sentences, in the format described above will be in the files <code>en.txt</code> and <code>pl.txt</code>.
  +
  +
<pre>
  +
$ ReTraTos_lex -s pt.txt -t en.txt -b dic_header.txt -e dic_footer.txt
  +
  +
PRE-PROCESSAMENTO
  +
  +
Reading the examples ... 100 examples read
  +
Reading the examples ... 100 examples read
  +
  +
GERANDO LEXICO
  +
  +
Generating source-target dictionary ... OK
  +
Generating target-source dictionary ... OK
  +
Processing bilingual dictionary ... OK
  +
Generalizing bilingual dictionary ... OK
  +
Cleaning equal attributes ... OK
  +
  +
IMPRIMINDO LEXICO
  +
  +
Printing bilingual dictionary ... OK
  +
</pre>
  +
  +
The output file will be <code>ReTraTos_lex_ptXen_1.dix</code>.
   
 
==See also==
 
==See also==

Revision as of 17:42, 15 March 2008

ReTraTos is a toolbox to build linguistic resources useful for machine translation (MT): bilingual dictionaries and transfer rules. The induction systems and open linguistic data can be used with the Apertium toolbox to build open-source MT systems.

Input format

The input sentences need to be given in two separate files, for example en.txt for English and pt.txt for Portuguese.

pt.txt
  <s snum=1>Os/O<det><def><m><pl>:1 alunos/aluno<n><m><pl>:2 do/de<pr>+o<det><def><m><sg>:3_4 colégio/colégio<n><m><sg>:5
 
  ...
en.txt
  <s snum=1>The/The<det><def><sp>:1 students/student<n><pl>:2 of/of<pr>:3 the/the<det><def><sp>:3 school/school<n><pl>:5

  ...

Usage

ReTraTos_lex

You will need the header and footer of a bilingual dictionary in two separate files, for example, dic_header.txt and dic_footer.txt (see the examples in the package). Example sentences, in the format described above will be in the files en.txt and pl.txt.

$ ReTraTos_lex -s pt.txt -t en.txt -b dic_header.txt -e dic_footer.txt 

PRE-PROCESSAMENTO

        Reading the examples ...  100 examples read
        Reading the examples ...  100 examples read

GERANDO LEXICO

        Generating source-target dictionary ... OK
        Generating target-source dictionary ... OK
        Processing bilingual dictionary ... OK
        Generalizing bilingual dictionary ... OK
        Cleaning equal attributes ... OK

IMPRIMINDO LEXICO

        Printing bilingual dictionary ... OK

The output file will be ReTraTos_lex_ptXen_1.dix.

See also

External links

Further reading