Search results

Ideas for Google Summer of Code/automatic-postediting
== Improving language pairs by mining MediaWiki Content Translation postedits == ...and bidix entries to improve the performance of an Apertium language pair. Data is available from Wikimedia content translation through an [API https://www

3 KB (383 words) - 19:56, 24 March 2020
Ideas for Google Summer of Code/Apertium Occitan French
...language, as Apertium offers the only machine translation system for this language pair. The idea is to make Occitan output easier to postedit and French outp ...guage data], [https://github.com/apertium/apertium-fra the French language data], and [https://github.com/apertium/apertium-oci-fra the Apertium Occitan-F

2 KB (213 words) - 19:48, 24 March 2020
User:Arinkverma
</li><li>'''Mathematics and graphing Language''': SciLab and Matlab </li><li>'''Database language''': SQL

4 KB (512 words) - 11:34, 11 April 2013
Altay
=== Altai Language Resources === Crúbadán language data for Southern Altai. Kevin Scannell. 2015. The Crúbadán Project. oai:cruba

2 KB (217 words) - 06:57, 5 December 2017
User:Francis Tyers/Experiments
* <s>why when we add more data, do the results get worse ? </s> * run br-fr test with huge data.

16 KB (1,524 words) - 10:49, 22 November 2012
Error: A new ambiguity class was found
(in this example, I use eng as language resp. eng-deu as pair) the file ./eng-tagger-data/eng.dic for some reasons is empty (has a file size of 0).

1 KB (165 words) - 14:16, 28 August 2016
Freeling
...in some cases data or tools from Freeling could be useful to apertium, and data from apertium could be useful to Freeling. Also, to install the data, I had to change the lines in freeling/data/Makefile.am that looked like

5 KB (720 words) - 02:20, 10 March 2018
Fisl13
...Everything in Apertium is free/open source: engine, data for more than 29 language pairs and tools to translate at a speed of more than 20,000 words per secon === Useful data ===

1 KB (175 words) - 14:19, 25 July 2012
User:Nikita Medyankin/GSoC 2016 WTR Proposal
...to be an improvement to the architecture of Apertium and would benefit all language pairs. ...u need to read some documentation, wikis, sites, and the like in a foreign language. Any improvements to the translation quality we can think of will make the

10 KB (1,671 words) - 18:37, 24 March 2016
Iranian languages
...ter plan involves generating independent finite-state transducers for each language, and then making individual dictionaries and transfer rules for every pair. ...dictionary for the pair X→Y. Below is listed development progress for each language's transducers and dictionary pairs.

22 KB (2,532 words) - 11:36, 30 July 2018
Dravidian languages
...e>[http://www.ethnologue.com/subgroups/dravidian dra]</code>) constitute a language family of about 70 languages spoken primarily in South Asia. The four most ...ter plan involves generating independent finite-state transducers for each language, and then making individual dictionaries and transfer rules for every pair.

19 KB (2,201 words) - 09:21, 9 December 2019
User:Kanjbaba
...master’s degree at the University of Helsinki, where I majored in Finnish language and culture but also focused on other Finno-Ugric languages, particularly N ...nline forums and channels without having to use Russian as an intermediate language. This would promote the use of the languages in wider domains and prevent f

10 KB (1,561 words) - 15:46, 27 March 2018
Turkic languages
...ter plan involves generating independent finite-state transducers for each language, and then making individual dictionaries and transfer rules for every pair. ...ictionary for the pair X→Y. Below is listed development progress for each language's transducers and dictionary pairs.

35 KB (3,577 words) - 15:24, 1 October 2021
Apertium
...y aimed at related-language pairs but expanded to deal with more divergent language pairs (such as English-Catalan). The platform provides * a language-independent machine translation engine

776 bytes (114 words) - 19:07, 12 September 2018
Tartu Apertium Course/Session 8
...on-months (four people, 18 months) to develop (both engine, and linguistic data). It was widely used, with thousands of requests per day. ...sh State to rewrite the code as open-source, and to convert the linguistic data. After one person year, the first version of the Spanish--Catalan translato

12 KB (1,683 words) - 11:00, 30 October 2015
Unigram tagger
...ll the unigram models from “A set of open-source tools for Turkish natural language processing.”<ref name="trmorph-tools">http://coltekin.net/cagri/papers/tr ...tuff.”<ref name="prerequisites">[[Installation#If you want to add language data / do more advanced stuff]]</ref>

20 KB (3,229 words) - 20:06, 12 March 2018
User:Nstsj/Proposal
The main problem of such languages is that they lack written (and annotated) data, thus stopping us from applying most of ML-methods (for example, neural net ...ple at Apertium are doing a lot of good work making low-resourced-language data available and I'd like to contribute to that.

5 KB (776 words) - 19:50, 21 May 2019
Курсы машинного перевода для языков России/Session 8
...on-months (four people, 18 months) to develop (both engine, and linguistic data). It was widely used, with thousands of requests per day. ...sh State to rewrite the code as open-source, and to convert the linguistic data. After one person year, the first version of the Spanish--Catalan translato

12 KB (1,679 words) - 12:00, 31 January 2012
Google Summer of Code/Application 2012
...m project develops a free/open-source platform for machine translation and language technology. We try to focus our efforts on lesser-resourced and marginalise ...ped around the world, largely in universities and companies (e.g. Prompsit Language Engineering), but independent free-software developers also play a huge rol

11 KB (1,680 words) - 12:22, 20 June 2019
Helsinki Apertium Workshop/Session 8
...on-months (four people, 18 months) to develop (both engine, and linguistic data). It was widely used, with thousands of requests per day. ...sh State to rewrite the code as open-source, and to convert the linguistic data. After one person year, the first version of the Spanish--Catalan translato

12 KB (1,683 words) - 08:42, 10 May 2013

Search results

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools