Difference between revisions of "Talk:Turkic-Turkic translator"
Jump to navigation
Jump to search
(Created page with ' ==Lexicon trimming== <pre> <spectre> also, some 'corner cases' for the lexicon scraper <spectre> when you have stems in a continuation lexicon, e.g. demonstratives in kazakh, p…') |
|||
| Line 1: | Line 1: | ||
==Lexicon trimming== |
==Lexicon trimming== |
||
| Line 11: | Line 10: | ||
<spectre> for verbs (this one can be fixed by having a correspondence between tags/continuation lexica) |
<spectre> for verbs (this one can be fixed by having a correspondence between tags/continuation lexica) |
||
e.g. when you have a verb which is both tv/iv but only one in the bidix |
e.g. when you have a verb which is both tv/iv but only one in the bidix |
||
<spectre> another example: when you have an entry like foo<adj>:bar<n><attr> in the bilingual dictionary, but no entry |
|||
for foo<adj><subst>:bar<n> (or foo<adj><subst>:baz<n>) then there will be errors |
|||
</pre> |
</pre> |
||
Revision as of 12:54, 22 April 2012
Lexicon trimming
<spectre> also, some 'corner cases' for the lexicon scraper
<spectre> when you have stems in a continuation lexicon, e.g. demonstratives in kazakh, personal pronouns in other places
<spectre> they will all get included disregarding the bilingual dictionary
<spectre> another example: when one language has a word that another doesn't have (e.g. a case that turns into a postposition,
but the postposition doesn't have an equivalent in the other language, e.g. is inserted by transfer
<spectre> (same goes for "particles")
<spectre> for verbs (this one can be fixed by having a correspondence between tags/continuation lexica)
e.g. when you have a verb which is both tv/iv but only one in the bidix
<spectre> another example: when you have an entry like foo<adj>:bar<n><attr> in the bilingual dictionary, but no entry
for foo<adj><subst>:bar<n> (or foo<adj><subst>:baz<n>) then there will be errors