Search results
Jump to navigation
Jump to search
Page title matches
- ...f the big language data sets. You do not want to add to or modify language data, you want to use it. <span style="color:darkorange;">'''Data may be outdated'''</span>, use only for system assessment. See the main sec3 KB (445 words) - 12:38, 24 April 2017
- ...]. The instructions are very different. This page is for existing language data. ...mar or HFST. If that happens, follow instructions under [[Install language data by compiling#Missing dependencies | missing dependencies]].5 KB (843 words) - 19:44, 2 March 2023
Page text matches
- * an Apertium language pair Make a folder called data-en-es. We are going to keep all the generated files there.15 KB (2,206 words) - 13:58, 7 October 2014
- * A language pair (e.g. apertium-br-fr) ** The language pair should have the following two modes:12 KB (1,634 words) - 18:26, 26 September 2016
- Your language pair should be fully set up in the direction that you're training for, and * an Apertium language pair14 KB (2,181 words) - 19:01, 17 August 2018
- This is a guide on how to add linguistic data directly to an existing language pair in Apertium. It gets a bit technical – if you just want to notify us ...t-of-speech tagger, which is in charge of the disambiguation of the source language text.50 KB (7,915 words) - 00:04, 10 March 2019
- * Train a target side language model (http://hermes.fbk.eu/people/bertoldi/teaching/lab_2010-2011/img/irst * The language pair must support the pretransfer and multi modes. See apertium-sh-mk/modes4 KB (503 words) - 19:01, 17 August 2018
- ...iew to the kind of data and resources that can be useful in building a new language pair for Apertium, and how to go about building them if they do not already Each Apertium language pair requires 3 dictionary files. For instance, for the English-Afrikaans13 KB (2,099 words) - 12:04, 6 October 2014
- ...language pair XX-YY by adding 50 words to its vocabulary || Add words to language pair XX-YY and test that the new vocabulary works. [[/Add words|Read more]] ...language pair || Add or correct a structural transfer rule to an existing language pair and test that it works. [[/Add transfer rule|Read more]]... || [[User68 KB (10,323 words) - 15:37, 25 October 2014
- ...converted or expanded in the [[incubator]]. Consider doing or improving a language pair (see [[incubator]], [[nursery]] and [[staging]] for pairs that need wo == Language Data ==22 KB (3,123 words) - 20:07, 21 March 2023
- ...nguage (<code>SL</code>) will be trained using information from the target language (<code>TL</code>). ==Language pair==11 KB (1,470 words) - 08:16, 8 October 2014
- ...rtium machine translation system from scratch. You can check the [[list of language pairs]] that have already been started. ...translation systems. The only thing you need to do is write the data. The data consists, on a basic level, of three dictionaries and a few rules (to deal19 KB (3,164 words) - 20:58, 2 April 2021
- |title=Add recursive transfer support to a language pair that doesn't support it |description=Make a branch of an Apertium language pair that doesn't support recursive transfer and call it "recursive transfe32 KB (4,862 words) - 06:23, 5 December 2019
- ...rtium.org page uses an installation which currently only runs ''released'' language pairs (also available from https://apertium.org/apy if you prefer). However $ curl -G --data "lang=kir&modes=morph&q=алдым" https://beta.apertium.org/apy/analyse37 KB (5,132 words) - 16:36, 5 June 2020
- ...]. The instructions are very different. This page is for existing language data. ...mar or HFST. If that happens, follow instructions under [[Install language data by compiling#Missing dependencies | missing dependencies]].5 KB (843 words) - 19:44, 2 March 2023
- ...f the big language data sets. You do not want to add to or modify language data, you want to use it. <span style="color:darkorange;">'''Data may be outdated'''</span>, use only for system assessment. See the main sec3 KB (445 words) - 12:38, 24 April 2017
- ...chine translation to understand the general meaning of the text in foreign language. The other approach is instead that of "dissemination" in which the MT is a ...(coding and decoding), data (linguistic data) and support tools to convert data and make them compatible with the engine. Even if most RBMT systems are pri21 KB (3,171 words) - 14:34, 3 April 2017
- ...on Ubuntu/Debian, using the Voikko plugins and Giellatekno/Divvun language data. ==Install the language data==4 KB (596 words) - 21:02, 2 April 2021
- ...of any language in Russia in areas smaller than the Federal Subjects. The data is in Russian and comes from the official 2010 Russian Census website. ===Complete guide to accessing the data===3 KB (561 words) - 17:58, 14 January 2018
- * https://apertium.org is the official site, and offers all the released language pairs ...Apertium platform, and also offers a simple web interface to the released language pairs6 KB (851 words) - 21:45, 26 July 2022
- ...epository scheme. (Originally, all monolingual language data was found in language pairs, meaning that there was a lot of duplication.) If you feel something ...hat constitutes a minimally-useful language package; generally, however, a language package should have over 60% coverage on a variety of corpora and should pr15 KB (1,783 words) - 22:33, 1 February 2019
- ...to be translated. For example, HTML tags must not be translated in another language, but only the text of the Web page. ...e same software are used for every language pairs. It is the format of the data to be translated which will take to use a particular deformatter.58 KB (8,365 words) - 20:16, 26 June 2018
- ====When running configure script for language pair data==== ====Workaround when language pairs need updated configure.ac's====20 KB (3,153 words) - 08:13, 24 May 2019
- '''Apertium New Language Pair HOWTO''' ...rtium machine translation system from scratch. You can check the [[list of language pairs]] that have already been started.36 KB (5,933 words) - 16:14, 22 February 2021
- ...tion of each module with more precision. They may also introduce technical language which linguists and/or computer coders would use. The technical description References to 'xxx' and 'yyy' refer to a language code, for example 'en-es'; 'English' to 'Spanish'.29 KB (4,687 words) - 16:28, 5 June 2020
- ...ngsnes (ed.) Bauta: Janne Bondi Johannessen in memoriam, Oslo Studies in Language 11(2), 2020. 489–501. (ISSN 1890-9639 / ISBN 978-82-91398-12-9) ...system/files/swj1419.pdf The apertium bilingual dictionaries on the web of data]. Semantic Web, 9(2), 231-240.33 KB (4,418 words) - 11:52, 29 December 2021
- ...Iberian peninsula, but is now being used to translate between more distant language pairs. ...ngineering ([http://www.prompsit.com http://www.prompsit.com]). Linguistic data are being developed by Transducens, the Seminario26 KB (3,122 words) - 06:25, 27 May 2021
- [[Target-language tagger training|In English]] ...t changez les variables <code>DATA</code> et <code>DIRECTION</code>. <code>DATA</code> doit pointer vers le répertoire contenant les données de la paire12 KB (1,625 words) - 08:20, 8 October 2014
- ...uide on how to use a development version of Apertium to make a change in a language pair. ...ou should try this to make sure things work before you move on to whatever language pair you plan on working on.10 KB (1,626 words) - 17:46, 13 January 2020
- ...http://wiki.apertium.org/wiki/Mandarin_Chinese#In_Apertium some linguistic data in Apertium]. ...fers to the most commonly spoken form of Chinese that is the sole official language of China and Taiwan. It is also known as Putonghua or Standard Chinese ([[W16 KB (2,148 words) - 03:28, 16 December 2015
- ...mpire, as did all Romance languages. There are currently 4 released French language pairs ...the sixth most spoken language in the world and is the second most studied language worldwide.15 KB (2,081 words) - 07:14, 12 August 2020
- ...MT based on corpora: adding new languages is very easy. To create a new language pair, in fact, it is not necessary to include corpora with millions of word ...airs can be added by creating dictionaries and rules containing linguistic data in XML format.15 KB (2,339 words) - 00:41, 4 June 2018
- Owing to the different syntactic structure of the phrases in each language, some Although the details of the modules and the linguistic data is presented in58 KB (8,964 words) - 11:11, 14 May 2016
- DATA=/home/philip/Apertium/gsoc2013/monolingual/data ...atterns-frac-maxent.py $DATA/setimes.sh-mk.freq $DATA/setimes.sh-mk.ambig $DATA/setimes.sh-mk.annotated > events 2>ngrams3 KB (520 words) - 21:25, 14 February 2014
- ...//d3js.org/ D3.js] tool that depicts all Apertium [[list of language pairs|language pairs]] in an interactive graph initially developed sometime before the [[G === Updating language data by scraping ===5 KB (702 words) - 01:34, 9 December 2018
- ...appear at the beginning of a sentence. The unique thing about the persian language though, is that they use prepositions which is quite uncommon in many SOV l ...designed a Two-sided morphology analyst of nouns and adjectives in Persian language, using Xerox Finite State Technology as giving input word (adjective or nou16 KB (2,597 words) - 20:58, 12 January 2013
- The number of language pairs in development for Apertium is increasing, and so is the complexity o language pairs. With better tools, more people will be able to develop language pairs.29 KB (4,382 words) - 07:53, 6 October 2019
- === Language pairs === .../github.com/apertium/apertium-urd-hin?files=1 apertium-urd-hin] Linguistic data for the Apertium Urdu-Hindi machine translator6 KB (806 words) - 00:45, 7 December 2018
- ...of any language in Russia in areas smaller than the Federal Subjects. The data is in Russian and comes from the official 2010 Russian Census website. Here are the steps to access the data:2 KB (296 words) - 21:12, 13 January 2018
- ...ind that are incorrectly translated, to getting involved in creating a new language pair or programming on tools or user interfaces. Here are some question fre Our language agnostic tools are native and written in [https://en.wikipedia.org/wiki/C++7 KB (1,139 words) - 06:27, 27 May 2021
- ...he implementation of the algorithms must be free/open-source, but also the data themselves. Nowadays, there are many machine translation packages of this t ...morphologically rich languages, which even with large corpora suffer from data sparseness.6 KB (905 words) - 17:26, 18 October 2010
- ...-supervised.make this one] from en-eo. You will need modify it to fit your language pair. This usually means editing the first few lines. ===Tagger data directory===3 KB (537 words) - 13:44, 18 June 2014
- ...of the main five data files in any language pair (see also: [[Apertium New Language Pair HOWTO]]). ....dix'' where ''apertium-A-B'' is the name of the [[List of language pairs| language pair]]. For example file ''apertium-af-nl.af-nl.dix'' is the bilingual dict7 KB (1,244 words) - 16:41, 17 March 2018
- *'''langpair''': language pair to use for translation curl -G --data "langpair=eng|spa&q=run" http://localhost:2737/dictionaryLookup5 KB (712 words) - 21:27, 16 August 2016
- * Apertium language pairs .../engine of Apertium installed (including the requirement lttoolbox, but no language pairs yet).9 KB (1,367 words) - 09:17, 26 May 2021
- |Language You will need to install NLTK and NLTK data. Unfortunately, they both only support Python versions 2.6-2.7. If you are14 KB (2,232 words) - 12:51, 26 September 2018
- ...duce translations which are less fluent, but more preserving of the source language meaning. ...er and number between a determiner and head noun will remain in the target language output.12 KB (1,464 words) - 12:00, 31 January 2012
- ...duce translations which are less fluent, but more preserving of the source language meaning. ...er and number between a determiner and head noun will remain in the target language output.11 KB (1,519 words) - 06:51, 11 May 2013
- ...duce translations which are less fluent, but more preserving of the source language meaning. ...er and number between a determiner and head noun will remain in the target language output.11 KB (1,519 words) - 18:27, 16 October 2015
- ...ll these language pairs. This means that the data can be re-used by other language projects (e.g. in developing spelling or grammar checkers, thesauri, etc). This project was accepted as part of our "adopt a language pair" idea12 KB (1,917 words) - 15:54, 12 September 2009
- ...probably try this to make sure things work before you move on to whatever language pair you plan on working on. Note that some existing language pairs have external dependencies, like HFST or Constraint Grammar. The [[In10 KB (1,715 words) - 12:29, 28 May 2018
- ...tended to show how you can make an "indirect" contribution, by documenting language resources, helping us to build bilingual test sets, translating, promoting, ...first language, and translate them to the other. A translation in a third language may be useful in enlisting help, but is not required.9 KB (1,494 words) - 05:58, 18 March 2015