Difference between revisions of "Tips for working on bilingual dictionaries"

From Apertium
Jump to navigation Jump to search
 
(3 intermediate revisions by 2 users not shown)
Line 1: Line 1:
  +
[[Conseils pour travailler sur les dictionnaires bilingues|En français]]
  +
 
This page is to catalogue tips for working on bilingual dictionaries for use in machine translation.
 
This page is to catalogue tips for working on bilingual dictionaries for use in machine translation.
   
If you have two target translations to choose between (and don't have a [[lexical selection]] module), you should typically choose the one that fits the most circumstances, or will be correct in the most circumstances (letting the other one get a [[directional restriction]]).
+
If you have two target translations to choose between, you should write a default [[lexical selection]] rule that chooses the one that fits the most circumstances, or will be correct in the most circumstances. Lexical selection rules working on context should choose "non-default" translations.
   
 
==If you can't translate it, don't.==
 
==If you can't translate it, don't.==
Line 25: Line 27:
 
==See also==
 
==See also==
 
* [[Bilingual dictionary]]
 
* [[Bilingual dictionary]]
  +
* [[Generate-bidix-templates]] – script for making bidix pardefs
   
 
==Notes==
 
==Notes==

Latest revision as of 07:32, 14 January 2016

En français

This page is to catalogue tips for working on bilingual dictionaries for use in machine translation.

If you have two target translations to choose between, you should write a default lexical selection rule that chooses the one that fits the most circumstances, or will be correct in the most circumstances. Lexical selection rules working on context should choose "non-default" translations.

If you can't translate it, don't.[edit]

Example:

<e><p><l>laufabrauð<s n="n"/><s n="nt"/></l><r>traditional<b/>Icelandic<b/>deep-fried<b/>patterned<b/>Christmas<b/>wafer<s n="n"/></r></p></e>

There is literally nowhere that that description would fit fluently into a sentence. First try and find a more literal translation in the target language, for example in this case "laufa" + "brauð" = "leaf" + "bread". Check in a search engine, or one of the image search engines to make sure it comes up with the same kinds of results, and if it does use that. Just because it doesn't appear in a dictionary does not make it a bad translation.

As a last resort, just use the source language word in the target language, consider the following translations:

  • Börnin bökuðu laufabrauð
  • The children baked leafbread.
  • The children baked laufabrauð.
  • The children baked traditional Icelandic deep-fried patterned Christmas wafer.

People who are going to be reading machine translation have the nouse[1] to go and search for a word they are unfamiliar with. People who are post-editting machine translation are not going to want to remove six words, to re-add one.

See also[edit]

Notes[edit]

  1. More frequently nous, common sense