Search results

Jump to navigation Jump to search
  • ...f the big language data sets. You do not want to add to or modify language data, you want to use it. <span style="color:darkorange;">'''Data may be outdated'''</span>, use only for system assessment. See the main sec
    3 KB (445 words) - 12:38, 24 April 2017
  • ...rtium machine translation system from scratch. You can check the [[list of language pairs]] that have already been started. ...translation systems. The only thing you need to do is write the data. The data consists, on a basic level, of three dictionaries and a few rules (to deal
    19 KB (3,164 words) - 20:58, 2 April 2021
  • ...]. The instructions are very different. This page is for existing language data. ...mar or HFST. If that happens, follow instructions under [[Install language data by compiling#Missing dependencies | missing dependencies]].
    5 KB (843 words) - 19:44, 2 March 2023
  • ...on Ubuntu/Debian, using the Voikko plugins and Giellatekno/Divvun language data. ==Install the language data==
    4 KB (596 words) - 21:02, 2 April 2021
  • |title=Add recursive transfer support to a language pair that doesn't support it |description=Make a branch of an Apertium language pair that doesn't support recursive transfer and call it "recursive transfe
    32 KB (4,862 words) - 06:23, 5 December 2019
  • * https://apertium.org is the official site, and offers all the released language pairs ...Apertium platform, and also offers a simple web interface to the released language pairs
    6 KB (848 words) - 12:51, 1 April 2024
  • ...rtium.org page uses an installation which currently only runs ''released'' language pairs (also available from https://apertium.org/apy if you prefer). However $ curl -G --data "lang=kir&modes=morph&q=алдым" https://beta.apertium.org/apy/analyse
    37 KB (5,132 words) - 16:36, 5 June 2020
  • ...chine translation to understand the general meaning of the text in foreign language. The other approach is instead that of "dissemination" in which the MT is a ...(coding and decoding), data (linguistic data) and support tools to convert data and make them compatible with the engine. Even if most RBMT systems are pri
    21 KB (3,171 words) - 14:34, 3 April 2017
  • [[Target-language tagger training|In English]] ...t changez les variables <code>DATA</code> et <code>DIRECTION</code>. <code>DATA</code> doit pointer vers le répertoire contenant les données de la paire
    12 KB (1,625 words) - 08:20, 8 October 2014
  • ...epository scheme. (Originally, all monolingual language data was found in language pairs, meaning that there was a lot of duplication.) If you feel something ...hat constitutes a minimally-useful language package; generally, however, a language package should have over 60% coverage on a variety of corpora and should pr
    15 KB (1,783 words) - 22:33, 1 February 2019
  • ====When running configure script for language pair data==== ====Workaround when language pairs need updated configure.ac's====
    20 KB (3,153 words) - 08:13, 24 May 2019
  • DATA=/home/philip/Apertium/gsoc2013/monolingual/data ...atterns-frac-maxent.py $DATA/setimes.sh-mk.freq $DATA/setimes.sh-mk.ambig $DATA/setimes.sh-mk.annotated > events 2>ngrams
    3 KB (520 words) - 21:25, 14 February 2014
  • ...to be translated. For example, HTML tags must not be translated in another language, but only the text of the Web page. ...e same software are used for every language pairs. It is the format of the data to be translated which will take to use a particular deformatter.
    58 KB (8,365 words) - 20:16, 26 June 2018
  • Owing to the different syntactic structure of the phrases in each language, some Although the details of the modules and the linguistic data is presented in
    58 KB (8,964 words) - 11:11, 14 May 2016
  • ...Iberian peninsula, but is now being used to translate between more distant language pairs. ...ngineering ([http://www.prompsit.com http://www.prompsit.com]). Linguistic data are being developed by Transducens, the Seminario
    26 KB (3,122 words) - 06:25, 27 May 2021
  • ...ngsnes (ed.) Bauta: Janne Bondi Johannessen in memoriam, Oslo Studies in Language 11(2), 2020. 489–501. (ISSN 1890-9639 / ISBN 978-82-91398-12-9) ...system/files/swj1419.pdf The apertium bilingual dictionaries on the web of data]. Semantic Web, 9(2), 231-240.
    33 KB (4,418 words) - 11:52, 29 December 2021
  • ...tion of each module with more precision. They may also introduce technical language which linguists and/or computer coders would use. The technical description References to 'xxx' and 'yyy' refer to a language code, for example 'en-es'; 'English' to 'Spanish'.
    29 KB (4,687 words) - 16:28, 5 June 2020
  • ...of any language in Russia in areas smaller than the Federal Subjects. The data is in Russian and comes from the official 2010 Russian Census website. Here are the steps to access the data:
    2 KB (296 words) - 21:12, 13 January 2018
  • ...//d3js.org/ D3.js] tool that depicts all Apertium [[list of language pairs|language pairs]] in an interactive graph initially developed sometime before the [[G === Updating language data by scraping ===
    5 KB (702 words) - 01:34, 9 December 2018
  • === Language pairs === .../github.com/apertium/apertium-urd-hin?files=1 apertium-urd-hin] Linguistic data for the Apertium Urdu-Hindi machine translator
    6 KB (806 words) - 00:45, 7 December 2018

View (previous 20 | next 20) (20 | 50 | 100 | 250 | 500)