Search results

Jump to navigation Jump to search

Page title matches

Install language data using packaging
...f the big language data sets. You do not want to add to or modify language data, you want to use it. <span style="color:darkorange;">'''Data may be outdated'''</span>, use only for system assessment. See the main sec

3 KB (445 words) - 12:38, 24 April 2017
Install language data by compiling
...]. The instructions are very different. This page is for existing language data. ...mar or HFST. If that happens, follow instructions under [[Install language data by compiling#Missing dependencies | missing dependencies]].

5 KB (843 words) - 19:44, 2 March 2023

Page text matches

Generating lexical-selection rules from a parallel corpus
* an Apertium language pair Make a folder called data-en-es. We are going to keep all the generated files there.

15 KB (2,206 words) - 13:58, 7 October 2014
Generating lexical-selection rules from monolingual corpora
* A language pair (e.g. apertium-br-fr) ** The language pair should have the following two modes:

12 KB (1,634 words) - 18:26, 26 September 2016
Learning rules from parallel and non-parallel corpora
Your language pair should be fully set up in the direction that you're training for, and * an Apertium language pair

14 KB (2,181 words) - 19:01, 17 August 2018
Talk:Task ideas for Google Code-in
...SVN made it so this script which is very handy for downloading an Apertium language/pair doesn't fetch the newest packages anymore. This also means that beta.a |description=Currently, some Apertium pairs/language modules use CI but it's very inconsistent and doesn't come by default. Aper

397 KB (52,731 words) - 11:22, 10 December 2019
Running the monolingual rule learning
* Train a target side language model (http://hermes.fbk.eu/people/bertoldi/teaching/lab_2010-2011/img/irst * The language pair must support the pretransfer and multi modes. See apertium-sh-mk/modes

4 KB (503 words) - 19:01, 17 August 2018
Contributing to an existing pair
This is a guide on how to add linguistic data directly to an existing language pair in Apertium. It gets a bit technical – if you just want to notify us ...t-of-speech tagger, which is in charge of the disambiguation of the source language text.

50 KB (7,915 words) - 00:04, 10 March 2019
Using linguistic resources
...iew to the kind of data and resources that can be useful in building a new language pair for Apertium, and how to go about building them if they do not already Each Apertium language pair requires 3 dictionary files. For instance, for the English-Afrikaans

13 KB (2,112 words) - 12:11, 26 May 2023
User:Srbhr/GSOC 2020 Proposal: Automatic PostEditing
== Title: Automatic Post-Editing/Improving Language Pairs by Mining Post-Edits == ...stha University, New Delhi. I'm interested in Machine Learning and Natural Language Processing, and always seek to find ways to improve stuff based on them. I

23 KB (3,704 words) - 17:29, 30 March 2020
Talk:Ideas for Google Summer of Code
...'Rationale''':<br>One of the most frustrating things when developing a new language pair is that you have to get the tags ''just right'' in order to be able to ...adequate translation. For instance, you might want to know in an ergative language if an absolutive is subject or object while translating. A shallow function

71 KB (10,374 words) - 21:14, 18 January 2021
User:Ksnmi/Application
...interested in Language, and machine translation is a part of handling the language change. I have been working with understanding both theoretically as well a ...the kind of projects whose implementation will help translation on all the language pairs on apertium at the end.

32 KB (5,064 words) - 09:19, 21 March 2014
Target-language tagger training
...nguage (<code>SL</code>) will be trained using information from the target language (<code>TL</code>). ==Language pair==

11 KB (1,470 words) - 08:16, 8 October 2014
User:Shash42/GSoC 2020 Proposal: Bilingual Dictionary Discovery
I am interested in Natural Language Processing, Deep Learning, and Applied Math. I am fascinated by computation ...s inherently targeted at low-resource, similar yet mutually unintelligible language-pairs. It is a dream organization for me because I get to contribute to a c

24 KB (3,699 words) - 22:49, 31 March 2020
Ideas for Google Summer of Code
...converted or expanded in the [[incubator]]. Consider doing or improving a language pair (see [[incubator]], [[nursery]] and [[staging]] for pairs that need wo == Language Data ==

23 KB (3,198 words) - 09:15, 4 March 2024
User:RomanZegarski/GSoC2011 proposal
...between languages and creating rules making possible to translate from one language to another one is intriguing process. ...allows to pass information regardless of language in which was created and language known by person retrieving it. Even if translation isn't perfect it gives a

7 KB (1,021 words) - 19:30, 7 April 2011
User:Deltamachine/proposal2018
<li>Theory of Language (Phonetics, Morphology, Syntax, Semantics)</li> <li>Language Diversity and Typology</li>

16 KB (2,445 words) - 09:19, 26 March 2018
Task ideas for Google Code-in (2013)
...language pair XX-YY by adding 50 words to its vocabulary || Add words to language pair XX-YY and test that the new vocabulary works. [[/Add words|Read more]] ...language pair || Add or correct a structural transfer rule to an existing language pair and test that it works. [[/Add transfer rule|Read more]]... || [[User

68 KB (10,323 words) - 15:37, 25 October 2014
User:Mjaskowski
I know as well how much time does one need to learn yet another language. I can only imagine problems arising if one wants to learn a language which is not as popular as languages I have mentioned above (for example, t

19 KB (3,209 words) - 18:45, 9 April 2010
Languages Of Russia
...of any language in Russia in areas smaller than the Federal Subjects. The data is in Russian and comes from the official 2010 Russian Census website. ===Complete guide to accessing the data===

3 KB (561 words) - 17:58, 14 January 2018
Install language data using packaging
...f the big language data sets. You do not want to add to or modify language data, you want to use it. <span style="color:darkorange;">'''Data may be outdated'''</span>, use only for system assessment. See the main sec

3 KB (445 words) - 12:38, 24 April 2017
The quick and dirty guide to making a new language pair
...rtium machine translation system from scratch. You can check the [[list of language pairs]] that have already been started. ...translation systems. The only thing you need to do is write the data. The data consists, on a basic level, of three dictionaries and a few rules (to deal

19 KB (3,164 words) - 20:58, 2 April 2021
User talk:Muki987
The Chinese police, according to official data, 1,317 people detained. `According to official data the Chinese police detained 1,317 people.'

85 KB (13,901 words) - 20:42, 19 June 2009
Install language data by compiling
...]. The instructions are very different. This page is for existing language data. ...mar or HFST. If that happens, follow instructions under [[Install language data by compiling#Missing dependencies | missing dependencies]].

5 KB (843 words) - 19:44, 2 March 2023
Using Giellatekno Divvun spellers with LibreOffice-Voikko on Debian
...on Ubuntu/Debian, using the Voikko plugins and Giellatekno/Divvun language data. ==Install the language data==

4 KB (596 words) - 21:02, 2 April 2021
Task ideas for Google Code-in
|title=Add recursive transfer support to a language pair that doesn't support it |description=Make a branch of an Apertium language pair that doesn't support recursive transfer and call it "recursive transfe

32 KB (4,862 words) - 06:23, 5 December 2019
Installation
* https://apertium.org is the official site, and offers all the released language pairs ...Apertium platform, and also offers a simple web interface to the released language pairs

6 KB (848 words) - 12:51, 1 April 2024
Apertium-apy
...rtium.org page uses an installation which currently only runs ''released'' language pairs (also available from https://apertium.org/apy if you prefer). However $ curl -G --data "lang=kir&modes=morph&q=алдым" https://beta.apertium.org/apy/analyse

37 KB (5,132 words) - 16:36, 5 June 2020
Grfro3d/proposal apertium cat-srd and ita-srd
...chine translation to understand the general meaning of the text in foreign language. The other approach is instead that of "dissemination" in which the MT is a ...(coding and decoding), data (linguistic data) and support tools to convert data and make them compatible with the engine. Even if most RBMT systems are pri

21 KB (3,171 words) - 14:34, 3 April 2017
User:Rcrowther
</ref> and language data on your system (developers may also want to consider their operating enviro ==== For translators: Install language data/dictionaries/pairs from repositories ====

4 KB (643 words) - 12:55, 24 April 2017
Entraînement d'un tagueur de langue cible
[[Target-language tagger training|In English]] ...t changez les variables <code>DATA</code> et <code>DIRECTION</code>. <code>DATA</code> doit pointer vers le répertoire contenant les données de la paire

12 KB (1,625 words) - 08:20, 8 October 2014
Languages
...epository scheme. (Originally, all monolingual language data was found in language pairs, meaning that there was a lot of duplication.) If you feel something ...hat constitutes a minimally-useful language package; generally, however, a language package should have over 60% coverage on a variety of corpora and should pr

15 KB (1,783 words) - 22:33, 1 February 2019
Installation troubleshooting
====When running configure script for language pair data==== ====Workaround when language pairs need updated configure.ac's====

20 KB (3,153 words) - 08:13, 24 May 2019
User:Khannatanmai/GSoC2020Proposal Trimming
...ence if one learns to create good tools for MT, they learn most of Natural Language Processing. A tool which is rule-based and open source really helps the community with language pairs that are resource- poor and gives them free translations for their ne

30 KB (4,918 words) - 16:55, 31 March 2020
Running the MaxEnt rule learning
DATA=/home/philip/Apertium/gsoc2013/monolingual/data ...atterns-frac-maxent.py $DATA/setimes.sh-mk.freq $DATA/setimes.sh-mk.ambig $DATA/setimes.sh-mk.annotated > events 2>ngrams

3 KB (520 words) - 21:25, 14 February 2014
User:Shraier/GSoC2012-Application1
...translation makes you notice these little differences by making the cross-language variation explicit. Since I have been involved in the translation area for ...of the field to refine the results of the automatically produced data. All data is organized in the XML files, which are humanly readable and editable. I b

14 KB (2,289 words) - 11:27, 6 April 2012
User:Khannatanmai/GSoC2019Proposal
...ence if one learns to create good tools for MT, they learn most of Natural Language Processing. A tool which is rule-based and open source really helps the community with language pairs that are resource- poor and gives them free translations for their ne

26 KB (4,048 words) - 18:50, 18 March 2020
Transfer rules examples
...to be translated. For example, HTML tags must not be translated in another language, but only the text of the Web page. ...e same software are used for every language pairs. It is the format of the data to be translated which will take to use a particular deformatter.

58 KB (8,365 words) - 20:16, 26 June 2018
Documentation of Matxin 1.0
Owing to the different syntactic structure of the phrases in each language, some Although the details of the modules and the linguistic data is presented in

58 KB (8,964 words) - 11:11, 14 May 2016
Flyer
...Iberian peninsula, but is now being used to translate between more distant language pairs. ...ngineering ([http://www.prompsit.com http://www.prompsit.com]). Linguistic data are being developed by Transducens, the Seminario

26 KB (3,122 words) - 06:25, 27 May 2021
User:Shraier/GSoC2012-Application2
...translation makes you notice these little differences by making the cross-language variation explicit. Since I have been involved in the translation area for ...of the field to refine the results of the automatically produced data. All data is organized in the XML files, which are humanly readable and editable. I b

14 KB (2,245 words) - 11:33, 6 April 2012
Publications
...ngsnes (ed.) Bauta: Janne Bondi Johannessen in memoriam, Oslo Studies in Language 11(2), 2020. 489–501. (ISSN 1890-9639 / ISBN 978-82-91398-12-9) ...system/files/swj1419.pdf The apertium bilingual dictionaries on the web of data]. Semantic Web, 9(2), 231-240.

33 KB (4,418 words) - 11:52, 29 December 2021
Workflow reference
...tion of each module with more precision. They may also introduce technical language which linguists and/or computer coders would use. The technical description References to 'xxx' and 'yyy' refer to a language code, for example 'en-es'; 'English' to 'Spanish'.

29 KB (4,687 words) - 16:28, 5 June 2020
User:Aha/GsocApplication
...translation, being a sub-field of NLP, enables to explore the grammar of a language and deal with it from a computational perspective. I really like the idea o ...vated community like Apertium's it is possible to accomplish such numerous language-pair translation. The project supports both widely spoken languages and min

11 KB (1,672 words) - 20:56, 9 April 2010
Finding numbers of speakers from the Russian census
...of any language in Russia in areas smaller than the Federal Subjects. The data is in Russian and comes from the official 2010 Russian Census website. Here are the steps to access the data:

2 KB (296 words) - 21:12, 13 January 2018
User:Deltamachine/proposal2017
<li>Theory of Language (Phonetics, Morphology, Syntax, Semantics)</li> <li>Language Diversity and Typology</li>

13 KB (2,187 words) - 09:56, 23 March 2018
User:Saswata Bose/GSoC2024Proposal
* Apertium allows one, as a language lover, to work very closely on a language both from a linguistic and a computational perspective. I, being a Research * ''Interested Task: '' Add a new variety to an existing language

9 KB (1,397 words) - 15:31, 2 April 2024
Pairviewer
...//d3js.org/ D3.js] tool that depicts all Apertium [[list of language pairs|language pairs]] in an interactive graph initially developed sometime before the [[G === Updating language data by scraping ===

5 KB (702 words) - 01:34, 9 December 2018
Hindi
=== Language pairs === .../github.com/apertium/apertium-urd-hin?files=1 apertium-urd-hin] Linguistic data for the Apertium Urdu-Hindi machine translator

6 KB (806 words) - 00:45, 7 December 2018
Apertium New Language Pair HOWTO
'''Apertium New Language Pair HOWTO''' ...rtium machine translation system from scratch. You can check the [[list of language pairs]] that have already been started.

36 KB (5,933 words) - 16:14, 22 February 2021
User:Gang Chen/GSoC 2013 Application: "Sliding Window PoS Tagger"
...language is a very complicated system, and translating a sentence from one language to another has always been a challenging task. Meanwhile, the translation n ...are approximations to the real structural rules in a language system, many language pairs have proved a high translation quality. So hopefully, we can expect t

21 KB (3,340 words) - 10:56, 28 May 2018
User:Mfoat/GSoC 2012 Application
...periments and develops all the possible methods, tools and technologies of data processing in natural languages (NLP). Therefore, it can be viewed as some ...pporting mainly the European languages; first of all they are focused on a language pair including English. Besides, those systems are not open, which makes it

10 KB (1,535 words) - 09:26, 5 April 2012

Retrieved from "https://wiki.apertium.org/wiki/Special:Search"

Navigation menu