Search results

Jump to navigation Jump to search

Page title matches

  • 20 bytes (2 words) - 12:51, 7 December 2011

Page text matches

  • We're going to do the example with [[Corpora|EuroParl]] and the English to Spanish pair in Apertium. $ nohup cat europarl.en-es.en | apertium-destxt |\
    15 KB (2,206 words) - 13:58, 7 October 2014
  • Nous alors faire l'exemple avec [[Corpora|EuroParl]] et la paire anglais vers espagnol d'Apertium. ...scripts-20120109-1229/training/clean-corpus-n.perl europarl-v6.es-en es en europarl.clean 1 40
    9 KB (1,445 words) - 14:05, 7 October 2014
  • We're going to do the example with EuroParl and the English to Spanish pair in Apertium. ...m-destxt | apertium -f none -d ~/source/apertium-en-es en-es-pretransfer > europarl.en-es.es.tagged
    12 KB (1,634 words) - 18:26, 26 September 2016
  • EuroParl: 96,18% EuroParl: 96,76%
    20 KB (2,856 words) - 06:26, 27 May 2021
  • Here is an example from [[EuroParl]]: $ paste europarl-v6.es-en.en europarl-v6.es-en.es | grep ' power .* potencia '
    11 KB (1,638 words) - 13:21, 7 December 2015
  • Посмотрите на пример из [[EuroParl|европарламента]]: $ paste europarl-v6.es-en.en europarl-v6.es-en.es | grep ' power .* potencia '
    11 KB (749 words) - 07:13, 22 January 2012
  • Training corpus size: 1.000.000 lines chosen from the front of the EUROPARL-es-en.en corpus, 613786 training examples. Testing corpus size: 50.000 line Training corpus size: 1.000.000 lines chosen from the front of the EUROPARL-es-en corpus, 151.000 aligned examples.
    6 KB (838 words) - 17:47, 25 July 2012
  • * EuroParl: http://www.statmt.org/europarl/v7/es-en.tgz ! SETimes !! EuroParl !! Wikipedia !! NewsComm !! !! SETimes !! EuroParl !! Wikipedia !! NewsComm
    3 KB (151 words) - 17:51, 11 June 2014
  • |Europarl |Europarl
    11 KB (1,215 words) - 14:43, 17 April 2018
  • Dækningen på Wikipedia-tekst er p.t. 72% og korpuset Europarl 80%. We measured coverage on Wikipedia to 72 % and the EuroParl corpus to 80 %.
    9 KB (1,406 words) - 20:34, 29 October 2010
  • For the parallel corpus we're going to use Europarl, the page [[corpora]] lists some others: $ wget http://www.statmt.org/europarl/v5/da-en.tgz
    4 KB (647 words) - 07:45, 8 October 2014
  • The corpora used for this task can be found here: http://www.statmt.org/europarl/v7/sl-en.tgz cat europarl-v7.sl-en.sl | apertium -d ~/Apertium/apertium-sh-sl/ sl-sh-morph | sed 's/\
    6 KB (625 words) - 16:54, 1 July 2013
  • Pour le corpus parallèle, on va utiliser Europarl, la page [[corpora]] (seulement en anglais) en liste d'autres : $ wget http://www.statmt.org/europarl/v5/da-en.tgz
    5 KB (699 words) - 07:52, 8 October 2014
  • ...[http://opus.lingfil.uu.se/ OPUS Uppsala University, Sweden]. Choose among Europarl, OpenOffice and OpenSubtitles etc in many languages.
    6 KB (1,009 words) - 13:14, 15 March 2015
  • * EuroParl — http://www.statmt.org/europarl/ — EU12 languages up to 44 million words per language
    5 KB (746 words) - 20:36, 25 January 2020
  • ...p://opus.lingfil.uu.se/ OPUS Uppsala University, Sweden]. Choisissez parmi Europarl, OpenOffice et OpenSubtitles etc. dans plusieurs langues.
    7 KB (1,144 words) - 14:00, 15 March 2015
  • -lm 0:5:/home/fran/corpora/europarl/europarl.lm:0 >log 2>&1 &
    5 KB (680 words) - 11:53, 26 September 2016
  • The Anaphora Resolution Module was run on a corpus of a 1000 sentences from Europarl, using this [https://github.com/apertium/apertium-eng-spa/blob/anaphora-tra
    20 KB (3,107 words) - 21:13, 24 June 2022
  • ...omanian-French (a French--Romanian parallel corpus can be generated from [[EuroParl]]. You can analyse the corpus with a morphological analyser, and use some w
    4 KB (686 words) - 11:05, 27 September 2016
  • ...cepted, only if you manage to reach a comparable quality. For instance, if europarl+moses gets a [[WER]] of 15-20%, we'd be happy with 25%.
    2 KB (377 words) - 19:18, 25 January 2023
  • {{LangStats | lang = slv | corpus1 = wikipedia | corpus2=europarl }}
    114 bytes (11 words) - 09:54, 2 November 2013
  • ...cepted, only if you manage to reach a comparable quality. For instance, if europarl+moses gets a [[WER]] of 15-20%, we'd be happy with 25%.
    2 KB (383 words) - 19:46, 2 March 2023
  • europarl * words: <section begin=europarl-words />4.4M<section end=europarl-words />
    1 KB (173 words) - 06:06, 16 December 2014
  • ...{{#lst:Apertium-slv/stats|wikipedia-coverage}} + {{#lst:Apertium-slv/stats|europarl-coverage}}) / 2)) round 1 }}
    125 bytes (10 words) - 09:56, 2 November 2013
  • {{LangStats | lang = ces | corpus1 = wikipedia | corpus2=europarl }}
    110 bytes (11 words) - 20:51, 3 November 2013
  • europarl * words: <section begin=europarl-words />4.4M<section end=europarl-words />
    2 KB (213 words) - 17:55, 16 December 2017
  • ...{{#lst:Apertium-ces/stats|wikipedia-coverage}} + {{#lst:Apertium-ces/stats|europarl-coverage}}) / 2)) round 1 }}
    125 bytes (10 words) - 20:53, 3 November 2013
  • ...not only the two languages are not very closely related, but the SMT/Moses+Europarl baseline is expected to be rather good and difficult to reach in quality.
    414 bytes (64 words) - 00:06, 20 March 2014