Search results

Jump to navigation Jump to search
  • * [[Learning rules from parallel and non-parallel corpora]] – this is the current documentation on training/inferring rules ** preprocess corpora
    4 KB (541 words) - 13:46, 29 March 2021
  • ===Corpora=== * [http://corpora.uni-leipzig.de/en?corpusId=hin_news_2011 Hindi News Corpus] Creative Common
    6 KB (806 words) - 00:45, 7 December 2018
  • Apertium-regtest is a program for managing regression tests and [[Corpus test|corpora]]. # in the browser, select one or all of the corpora to rerun tests for
    11 KB (1,823 words) - 12:17, 6 June 2023
  • ===Corpora=== * [http://childes.talkbank.org/access/French/ CHIDES Corpora]. [http://talkbank.org/share/rules.html ''Requires reference'']
    15 KB (2,081 words) - 07:14, 12 August 2020
  • ..., Tommi Pirinen, Jonathan Washington. "Finite-state morphologies and text corpora as resources for improving morphological descriptions". [https://sites.goog ...f Inferring shallow-transfer machine translation rules from small parallel corpora]". In Journal of Artificial Intelligence Research. volume 34, p. 605-635.
    33 KB (4,418 words) - 11:52, 29 December 2021
  • Once a transducer has ~80% coverage on a range of medium-large corpora we can say it is "working". Over 90% and it can be considered to be "produc
    12 KB (1,017 words) - 09:06, 18 January 2022
  • ====Training on Corpora with Ambiguous Lexical Units==== ...m-tagger</code> '', the tagger prints warnings about ambiguous analyses in corpora to stderr.''
    20 KB (3,229 words) - 20:06, 12 March 2018
  • * Tufiş, D., A. M. Barbu, V. Pătraşcu, G. Rotariu, and C. Popescu. "Corpora and Corpus-Based Morpho-Lexical Processing."&nbsp;''Recent Advances in Roma ===Corpora===
    7 KB (889 words) - 09:53, 28 November 2018
  • ===Corpora=== * [http://opus.nlpl.eu/ Hindi-English Parallel Corpora]
    8 KB (1,079 words) - 11:17, 3 December 2018
  • ...why the errors are so high. Re-evaluation will be done as soon as the two corpora are manually realigned. ...nslated corpora, and the label will be extracted from the manually written corpora. This method might provide better results since the model will be trained o
    6 KB (838 words) - 17:47, 25 July 2012
  • ==Corpora== The tagged corpora used in the experiments are found in the monolingual packages in [[language
    16 KB (1,448 words) - 16:50, 22 August 2017
  • ...n be found [https://github.com/taruen/apertiumpp/tree/master/data4apertium/corpora here]. ...m/corpora/jam/uzb.txt | apertium -d . uzb-kaa) -ref ../../../data4apertium/corpora/jam/kaa.txt
    5 KB (515 words) - 14:34, 1 September 2019
  • ===Corpora === * [http://www.elra.info/en/catalogues/free-resources/nepali-corpora/ ''"Nepali"'']
    8 KB (948 words) - 19:59, 30 December 2017
  • === Obtaining corpora (and getAlignmentWithText.pl) === The corpora need to be untarred, and inserted into a new, common directory.
    7 KB (973 words) - 02:52, 20 May 2021
  • Automatic shallow-transfer rules generation from parallel corpora ...in statistical machine translation, that have been extracted from parallel corpora and extended with a set of restrictions controlling their application.
    4 KB (525 words) - 19:21, 17 September 2009
  • ~/source/corpora/lm/en.blm europarl.en-es.es.multi-trimmed -f > europarl.en-es.es.annotated MODEL=/home/philip/Apertium/corpora/language-models/en/setimes.en.5.blm
    12 KB (1,634 words) - 18:26, 26 September 2016
  • ...e project was a success. All the goals have been achieved: the creation of corpora in LSC; italian monodix: apertium-srd-srd.dix: 51,743 words; apertium-ita-i ...rned to use markup languages (XML and HTML) for the creation of linguistic corpora. At present, I attend a Master’s Degree in Translation of specialized tex
    21 KB (3,171 words) - 14:34, 3 April 2017
  • ===Corpora=== * [https://korpora.zim.uni-duisburg-essen.de/Limas/ Corpora from Limas z.Hd. Prof. Dr. Bernhard Schröder Universität Duisburg-Essen,
    8 KB (900 words) - 10:15, 4 December 2018
  • == Corpora and Coverage == Our main corpora consisted of [https://www.rfa.org/uyghur/ RFA], [http://uy.ts.cn/ Tanritor]
    5 KB (607 words) - 13:25, 12 August 2018
  • == Corpora == ...8ba2a9c0e50bc885bfad3bfbff3b4afbd.pdf Building Open Javanese and Sundanese Corpora for Multilingual Text-to-Speech]
    7 KB (881 words) - 13:11, 12 December 2018

View (previous 20 | next 20) (20 | 50 | 100 | 250 | 500)