Search results

Jump to navigation Jump to search
  • == Corpora ==
    6 KB (786 words) - 10:56, 4 May 2016
  • == Corpora ==
    1 KB (154 words) - 06:03, 16 December 2014
  • == Corpora ==
    2 KB (255 words) - 19:20, 9 October 2021
  • == Corpora ==
    1 KB (173 words) - 06:04, 16 December 2014
  • == Corpora ==
    1 KB (135 words) - 06:03, 16 December 2014
  • == Corpora ==
    2 KB (199 words) - 06:51, 6 July 2018
  • == Corpora ==
    1 KB (173 words) - 06:03, 16 December 2014
  • == Corpora ==
    1 KB (177 words) - 06:01, 16 December 2014
  • == Corpora ==
    2 KB (211 words) - 06:02, 16 December 2014
  • == Corpora ==
    1 KB (173 words) - 06:06, 16 December 2014
  • # the best taggers use hand-tagged corpora to train with (we use untagged corpora -- for English)
    7 KB (1,177 words) - 08:34, 8 October 2014
  • * [[Corpora]]
    13 KB (1,601 words) - 23:31, 23 July 2021
  • ...toscore.txt | ~/source/apertium/trunk/apertium-lex-learner/irstlm-ranker ~/corpora/català/en.blm > unk.trans.scored.txt
    9 KB (1,470 words) - 11:28, 24 March 2012
  • to be related. Both corpora must be pre-processed before the training. This pre-processing, consisting in analysing the corpora and
    11 KB (1,814 words) - 03:22, 9 March 2019
  • The final coverage of the system was around 90%, e.g. over a set of corpora 10 unknown words out of 100 on average. The word-error rate was around 17%, When linguistic resources, for example corpora, dictionaries, grammars, morphological analysers, lists of lemmata etc. are
    12 KB (1,683 words) - 08:42, 10 May 2013
  • The final coverage of the system was around 90%, e.g. over a set of corpora 10 unknown words out of 100 on average. The word-error rate was around 17%, When linguistic resources, for example corpora, dictionaries, grammars, morphological analysers, lists of lemmata etc. are
    12 KB (1,683 words) - 11:00, 30 October 2015
  • The final coverage of the system was around 90%, e.g. over a set of corpora 10 unknown words out of 100 on average. The word-error rate was around 17%, When linguistic resources, for example corpora, dictionaries, grammars, morphological analysers, lists of lemmata etc. are
    12 KB (1,679 words) - 12:00, 31 January 2012
  • * '''Coverage''' is the naïve coverage over one or more free corpora.
    6 KB (591 words) - 22:50, 30 October 2017
  • ...steps to create each one. But you'll want to build it up, testing against corpora, etc. You want to be able to have as many correct analyses as possible, an ...if your pair will support translation in both directions). The corpus or corpora ideally should represent a range of content—i.e., it shouldn't be just sp
    10 KB (1,615 words) - 07:43, 20 December 2015
  • ...dinian to Italian. We started a manual morphological disambiguation of the corpora that will help the translator to recognize the correct morphology of each w We have treated two corpora: one journalistic and more dialectal, and other taken directly from literar
    9 KB (1,306 words) - 15:56, 2 September 2017

View (previous 20 | next 20) (20 | 50 | 100 | 250 | 500)