Search results

Jump to navigation Jump to search
  • ...high, and compares with commercial systems -- over 95% coverage (around 5 unknown words out of 100 words), and between 3-7% word-error rate (out of 100 words ...final coverage of the system was around 90%, e.g. over a set of corpora 10 unknown words out of 100 on average. The word-error rate was around 17%, meaning th
    12 KB (1,683 words) - 08:42, 10 May 2013
  • ...from Wikipedia, newspapers, literature, etc.) detect the 50 most frequent unknown words (source words which are not in the dictionaries of the language pair # add these words to the source dictionary (so that they are not unknown anymore), add the correspondence to the bilingual dictionary, and add the w
    2 KB (271 words) - 05:34, 17 December 2015
  • ...rus-nova.txt, -ref = tat-rus-posted.txt). WER / PER results are given when unknown-word marks (stars) are not removed. ...if it's usable by that time. || '''Midterm evaluation'''<br/>Results when unknown word-marks (stars) are not removed<br/>tat-rus/texts/text1.* (full coverage
    8 KB (1,006 words) - 12:48, 9 March 2018
  • -e: morphological analysis, with compound analysis on unknown words -n: morph. generation without unknown word marks
    9 KB (1,370 words) - 09:49, 7 April 2020
  • ...high, and compares with commercial systems -- over 95% coverage (around 5 unknown words out of 100 words), and between 3-7% word-error rate (out of 100 words ...final coverage of the system was around 90%, e.g. over a set of corpora 10 unknown words out of 100 on average. The word-error rate was around 17%, meaning th
    12 KB (1,683 words) - 11:00, 30 October 2015
  • ...verb conjugations, declensions, etc. More generically, upon finding a new unknown word, we can productively generate all its inflections according to every p ...ome constraints, which we can use in order to gather information about an unknown word. More generically, we can gather information about a word knowing whic
    6 KB (928 words) - 13:57, 3 April 2009
  • ...smi="v|tv|fut|p1|sg" si="root" UpCase="none" lem="iç" mi="v|tv|fut|p1|sg" unknown="transfer"> ...="0" slem="bira" smi="n|acc" si="dobj" UpCase="none" lem="bira" mi="n|acc" unknown="transfer">
    53 KB (8,811 words) - 04:05, 21 January 2017
  • ...analysis. This could happen by changing lt-proc (fst_processor.cc) so that unknown words are sent to a decompounding-function that tries various strategies (l * If the first member is unknown, choose the analysis with the longest last member.
    13 KB (2,051 words) - 10:24, 22 September 2010
  • |unknown |REGLA: unknown
    45 KB (7,840 words) - 10:56, 18 September 2017
  • '''Detect hidden unknown words by using the probabilities of the HMM-based part-of-speech tagger in ...orms for which there exists at least one lexical form cannot be considered unknown and there is no way to know whether the set of possible lexical forms provi
    2 KB (277 words) - 19:51, 24 March 2020
  • ...m Wikipedia, newspapers, literature, etc.) '''detect the 250 most frequent unknown words''' (words in the source document which are not in the dictionary). S ...opriate <code>.dix</code> or <code>.lexc</code> file) so that they are not unknown anymore. Make sure to categorise stems correctly (this can be hard, so ple
    2 KB (299 words) - 19:44, 30 December 2019
  • ...m Wikipedia, newspapers, literature, etc.) '''detect the 200 most frequent unknown words''' (words in the source document which are not in the bilingual dicti ...ropriate <code>.dix</code> file) in [[bidix]] format (so that they are not unknown anymore), as well as the monolingual analysers if needed. Make sure to cat
    2 KB (320 words) - 15:01, 19 January 2020
  • <pre>LIST unknown = ("\\*.*"r) ; </pre> <pre>SELECT proper-name IF (1 unknown);</pre>
    8 KB (1,211 words) - 23:02, 4 April 2021
  • ...high, and compares with commercial systems -- over 95% coverage (around 5 unknown words out of 100 words), and between 3-7% word-error rate (out of 100 words ...final coverage of the system was around 90%, e.g. over a set of corpora 10 unknown words out of 100 on average. The word-error rate was around 17%, meaning th
    12 KB (1,679 words) - 12:00, 31 January 2012
  • ...in extending dictionaries by assigning stems and inflectional paradigms to unknown words] (pp.19-26.). EAMT 2014 – 17th Annual conference of the European As ...sites/default/files/FreeRBMT-2012.pdf#33 Choosing the correct paradigm for unknown words in rule-based machine translation systems]. Third International Works
    33 KB (4,418 words) - 11:52, 29 December 2021
  • ...cognates through Turkish, corpora were examined to determine and ascertain unknown words' meanings and Persian, Arabic and Russian vocabulary were used to goo
    4 KB (551 words) - 23:52, 28 August 2017
  • &markup, &unknown);
    5 KB (680 words) - 16:10, 13 May 2010
  • A star * means a word was unknown to the translator and passed through unchanged. For proper nouns, this is o
    3 KB (427 words) - 18:56, 26 September 2016
  • * Alt-U: Set/unset 'mark unknown words'
    10 KB (1,606 words) - 20:54, 20 February 2022
  • * How about unknown words...
    5 KB (788 words) - 10:50, 9 February 2015

View (previous 20 | next 20) (20 | 50 | 100 | 250 | 500)