Search results

Jump to navigation Jump to search
  • == Improving language pairs by mining MediaWiki Content Translation postedits == ...and bidix entries to improve the performance of an Apertium language pair. Data is available from Wikimedia content translation through an [API https://www
    3 KB (383 words) - 19:56, 24 March 2020
  • ...language, as Apertium offers the only machine translation system for this language pair. The idea is to make Occitan output easier to postedit and French outp ...guage data], [https://github.com/apertium/apertium-fra the French language data], and [https://github.com/apertium/apertium-oci-fra the Apertium Occitan-F
    2 KB (213 words) - 19:48, 24 March 2020
  • </li><li>'''Mathematics and graphing Language''': SciLab and Matlab </li><li>'''Database language''': SQL
    4 KB (512 words) - 11:34, 11 April 2013
  • === Altai Language Resources === Crúbadán language data for Southern Altai. Kevin Scannell. 2015. The Crúbadán Project. oai:cruba
    2 KB (217 words) - 06:57, 5 December 2017
  • * <s>why when we add more data, do the results get worse ? </s> * run br-fr test with huge data.
    16 KB (1,524 words) - 10:49, 22 November 2012
  • (in this example, I use eng as language resp. eng-deu as pair) the file ./eng-tagger-data/eng.dic for some reasons is empty (has a file size of 0).
    1 KB (165 words) - 14:16, 28 August 2016
  • ...in some cases data or tools from Freeling could be useful to apertium, and data from apertium could be useful to Freeling. Also, to install the data, I had to change the lines in freeling/data/Makefile.am that looked like
    5 KB (720 words) - 02:20, 10 March 2018
  • ...Everything in Apertium is free/open source: engine, data for more than 29 language pairs and tools to translate at a speed of more than 20,000 words per secon === Useful data ===
    1 KB (175 words) - 14:19, 25 July 2012
  • ...to be an improvement to the architecture of Apertium and would benefit all language pairs. ...u need to read some documentation, wikis, sites, and the like in a foreign language. Any improvements to the translation quality we can think of will make the
    10 KB (1,671 words) - 18:37, 24 March 2016
  • ...ter plan involves generating independent finite-state transducers for each language, and then making individual dictionaries and transfer rules for every pair. ...dictionary for the pair X→Y. Below is listed development progress for each language's transducers and dictionary pairs.
    22 KB (2,532 words) - 11:36, 30 July 2018
  • ...e>[http://www.ethnologue.com/subgroups/dravidian dra]</code>) constitute a language family of about 70 languages spoken primarily in South Asia. The four most ...ter plan involves generating independent finite-state transducers for each language, and then making individual dictionaries and transfer rules for every pair.
    19 KB (2,201 words) - 09:21, 9 December 2019
  • ...master’s degree at the University of Helsinki, where I majored in Finnish language and culture but also focused on other Finno-Ugric languages, particularly N ...nline forums and channels without having to use Russian as an intermediate language. This would promote the use of the languages in wider domains and prevent f
    10 KB (1,561 words) - 15:46, 27 March 2018
  • ...ter plan involves generating independent finite-state transducers for each language, and then making individual dictionaries and transfer rules for every pair. ...ictionary for the pair X→Y. Below is listed development progress for each language's transducers and dictionary pairs.
    35 KB (3,577 words) - 15:24, 1 October 2021
  • ...y aimed at related-language pairs but expanded to deal with more divergent language pairs (such as English-Catalan). The platform provides * a language-independent machine translation engine
    776 bytes (114 words) - 19:07, 12 September 2018
  • ...on-months (four people, 18 months) to develop (both engine, and linguistic data). It was widely used, with thousands of requests per day. ...sh State to rewrite the code as open-source, and to convert the linguistic data. After one person year, the first version of the Spanish--Catalan translato
    12 KB (1,683 words) - 11:00, 30 October 2015
  • ...ll the unigram models from “A set of open-source tools for Turkish natural language processing.”<ref name="trmorph-tools">http://coltekin.net/cagri/papers/tr ...tuff.”<ref name="prerequisites">[[Installation#If you want to add language data / do more advanced stuff]]</ref>
    20 KB (3,229 words) - 20:06, 12 March 2018
  • The main problem of such languages is that they lack written (and annotated) data, thus stopping us from applying most of ML-methods (for example, neural net ...ple at Apertium are doing a lot of good work making low-resourced-language data available and I'd like to contribute to that.
    5 KB (776 words) - 19:50, 21 May 2019
  • ...on-months (four people, 18 months) to develop (both engine, and linguistic data). It was widely used, with thousands of requests per day. ...sh State to rewrite the code as open-source, and to convert the linguistic data. After one person year, the first version of the Spanish--Catalan translato
    12 KB (1,679 words) - 12:00, 31 January 2012
  • ...m project develops a free/open-source platform for machine translation and language technology. We try to focus our efforts on lesser-resourced and marginalise ...ped around the world, largely in universities and companies (e.g. Prompsit Language Engineering), but independent free-software developers also play a huge rol
    11 KB (1,680 words) - 12:22, 20 June 2019
  • ...on-months (four people, 18 months) to develop (both engine, and linguistic data). It was widely used, with thousands of requests per day. ...sh State to rewrite the code as open-source, and to convert the linguistic data. After one person year, the first version of the Spanish--Catalan translato
    12 KB (1,683 words) - 08:42, 10 May 2013

View (previous 20 | next 20) (20 | 50 | 100 | 250 | 500)