Search results

Jump to navigation Jump to search
  • Websites referencing Apertium categorised by language of the website. News about Apertium categorised by language of report.
    13 KB (1,689 words) - 21:42, 28 February 2021
  • ...ium-init to bootstrap a new language pair (optionally with new monolingual data packages as well). ...is script in your working directory where you will be downloading language data. You can get the script from https://apertium.org/apertium-init
    5 KB (824 words) - 15:30, 20 April 2021
  • ...re to be done and different potential ways to apply NLP techniques to help language learners). ...ne translation and NLP is that a rule based system can explain itself to a language learner (some statistical/ML approaches can learn rules - such hybrid syste
    2 KB (288 words) - 18:21, 22 August 2016
  • ...m project develops a free/open-source platform for machine translation and language technology. We try and focus our efforts on lesser-resourced and marginalis ...eloped around the world, both in universities and companies (e.g. Prompsit Language Engineering) and by a growing numbers independent free-software developers.
    6 KB (1,057 words) - 15:34, 28 October 2013
  • ...e machine translation engine and has been expanded to treat more divergent language pairs. It is well-designed and allows everyone to contribute to it. This en Second, the linguistic data files are encoded in XML-based formats. XML files are easy to understand, w
    7 KB (1,097 words) - 02:39, 21 March 2014
  • !rowspan=2| Language ==Existing language pairs==
    5 KB (538 words) - 15:52, 11 April 2015
  • == GSoC application: apertium hbs-eng, adopting a language pair == One of my majors is Linguistics, the other is English Language and Literature. Other than the simple fact that machine translation gives q
    6 KB (987 words) - 15:28, 16 May 2014
  • ...lation fascinates me. The core problem that translation of a text from one language to other can’t be solved by simple substitution of words, catches my inte I plan to “Adopt an unreleased language pair”, or to be precise, three language pairs: mar-hin, guj-hin, mar-guj. Mar-hin and guj-hin pairs are in incubato
    11 KB (1,617 words) - 11:06, 29 April 2017
  • ...ipedia:Indonesian language]]) is an Austronesian language and the official language of Indonesia. Since it is a register of [[Malay]], it is also often general In [[Apertium]], there is a language pair of [[Indonesian and Malaysian]] already in the [[Trunk|trunk category]
    5 KB (629 words) - 13:08, 21 December 2019
  • | width=320 | '''[[Apertium New Language Pair HOWTO]]''' | [[Become a language pair developer for Apertium]]
    13 KB (1,601 words) - 23:31, 23 July 2021
  • ...and everyone, and of course we don't have time, or inclination, to learn a language, just for a work, just for a e-mail answer … A machine translation become ...an. Indeed, we tell the computer to mimic the human in its own domain, the language.
    10 KB (1,635 words) - 09:42, 8 April 2011
  • If you're working on language data, <code>sudo</code> is pretty much only for running package managers like <c ...exception is <code>sudo make install</code>, but when working on language data you should never have to do this.
    856 bytes (144 words) - 12:52, 3 May 2018
  • ..."State-of-the-art Morphological Analayser for Uzbek language and improved language pairs uz-kk, uz-ky, uz-tr". After discussions with mentors, the best path t ...rtium-tur-uzb) translation pair, Southeast European Times(SETimes) website data collection in Turkish was used(around 3.7M tokens).
    5 KB (722 words) - 16:16, 5 September 2020
  • ...ings of the 9th International Workshop on Finite State Methods and Natural Language Processing, pages 39--47.</ref> and understand what it does. ! Language
    34 KB (5,431 words) - 16:27, 29 October 2013
  • ...rs independent free-software developers. There are currently 40 published language pairs within the project (including a number of "firsts" — for example Sp ...ommunication) often occurs at this age, and if we can show them that their language is useful, and other people care, and there is no barrier for its use in th
    6 KB (987 words) - 10:21, 7 November 2014
  • The study of natural language processing is fascinating to me, and machine learning is a remarkably pract ...at the output of the system becomes intelligible, valid text in the target language.
    4 KB (575 words) - 10:03, 16 April 2017
  • ...s a computational linguist, it would be great to apply my knowledge in the language theory to machine translation.<br/> ...o useful: machine learning gets successful when we have access to tones of data, but that is not what we have dealing with dead or minority languages. That
    6 KB (925 words) - 16:09, 27 March 2018
  • ...e official web site – it serves only the ''released'' (stable) versions of language pairs ** This is the official "beta" site – it serves the latest work in all language pairs (so things may work better, but also may have weird bugs). You can al
    3 KB (457 words) - 07:42, 18 June 2021
  • ...ecifies the parameters and data files specific to that language pair. Each language pair can contain a number of modes; most of these are used for debugging ea ...b server. We use apertium-nn-nb as an example, but it should work with any language pair; the modules lt-proc/cg-proc/apertium-{tagger,pretransfer,transfer,int
    13 KB (2,039 words) - 11:56, 3 June 2022
  • ...ding period &mdash; and for documentation. Anyone thinking of working on a language pair should make sure that they read about [[testvoc]] and other quality co ...all]] Apertium and a language pair; read through the [[:Category:HOWTO|new language pair HOWTO]]. This might even give you some more ideas!
    9 KB (1,509 words) - 23:51, 27 February 2023
  • ...nasi. I love learning new languages, and I have a keen interest in Natural Language Processing and Linguistics. I have also contributed to Apertium previously ...to translation is very interesting and more interpretable compared to the data-hungry, uninterpretable black boxes that modern-day machine learning-based
    6 KB (918 words) - 06:00, 2 April 2024
  • ** Language pages [[French]], [[Spanish]], [[Nahuatl]], [[Dutch]] * Language pairs:
    2 KB (218 words) - 16:46, 9 December 2012
  • ...textbook distinction in language, isn't it? When you start exploring real data the boundaries fade very fast and everything looks much more complicated.
    22 KB (2,150 words) - 20:21, 24 April 2013
  • ...statistical parser, which in turn can serve different purposes of natural language processing. For creating a good treebank, manual annotation and/or disambig ...interface allows to work with CoNLL-U and CG3 formats, and to convert the data between the formats. It also allows to either upload or paste corpora in pl
    6 KB (930 words) - 15:59, 29 August 2017
  • ...d of existing trained models. Successful tries are saved into new training data.<ref>https://static.googleusercontent.com/media/research.google.com/en//pub ...butions can also be found [https://github.com/tesseract-ocr/tesseract/wiki/Data-Files-Contributions here].
    2 KB (305 words) - 14:36, 28 October 2018
  • ...rning engineer. My role was developing sentiment analysis model for Arabic language. ...urses, I had to use python/ R and Tableau to perform analysis on different data-sets.
    8 KB (1,258 words) - 15:30, 27 April 2020
  • * Converting another language pair | Complete, fully documented system with full ruleset for at least one language pair
    14 KB (2,141 words) - 21:26, 13 August 2019
  • ...er]] or [[CG]] files. It creates fully working Makefiles and stub language data, so you can compile and test straight away (assuming you've [[Installation|
    744 bytes (108 words) - 20:38, 13 January 2021
  • ...thub. What this actually means is that you can set an apertium language or language pair on github to automatically build and test on each commit. You only nee This is an example for a monolingual data using hfst (from [apertium-fin]):
    2 KB (249 words) - 06:26, 27 May 2021
  • File:Pet1.png
    ...h he/she is going to provide Input Text data and also needs to specify the language into which the Translation needs to be done. After that the user will enter the text data in the box located at the Left hand side of the page and clicks on the "Tra
    (1,280 × 800 (96 KB)) - 18:52, 2 April 2010
  • Apertium language data for Iraqi Turkmen. [[Category:Language data]]
    1 KB (144 words) - 20:07, 15 July 2021
  • ...temen kan maken. Het enige wat je zelf moet doen, is de data schrijven. De data bestaat uit 3 belangrijke delen, de woordenboeken, en enkele regels (woordv ...ems van de oorspronkelijke taal(source language='sl')of de doeltaal(target language='tl') kan kiezen en veranderen.
    36 KB (5,761 words) - 14:34, 4 December 2011
  • ...temen kan maken. Het enige wat je zelf moet doen, is de data schrijven. De data bestaat uit 3 belangrijke delen, de woordenboeken, en enkele regels (woordv ...ems van de oorspronkelijke taal(source language='sl')of de doeltaal(target language='tl') kan kiezen en veranderen.
    36 KB (5,767 words) - 07:07, 16 February 2015
  • ...ion is a very complex problem that depends on almost all fields of natural language processing. As such, it is a very "enabling" field, and can benefit from th ...ings of the 9th International Workshop on Finite State Methods and Natural Language Processing, pages 39--47.</ref>. However, the library currently used to par
    10 KB (1,561 words) - 15:22, 28 May 2013
  • While training can be done directly in the language directory, it is a better idea to train the tagger with copies of the files ...e the training directory (replace <code>lang</code> with the corresponding language code).
    4 KB (651 words) - 13:36, 23 August 2017
  • {{Language Kashmiri is an Indo-Aryan language spoken in the Kashmir Valley and regions around it that were historically a
    6 KB (811 words) - 10:42, 2 July 2018
  • == Proposal: Bringing 4 language pairs up to release quality == ...stvoc and lexical selection that will result in a valid text in the target language.
    4 KB (614 words) - 13:00, 7 April 2019
  • ..., transfer rules, scripting, corpora. The objective is to make an Apertium language pair state-of-the-art, or close to state-of-the-art in terms of translation ...ge pair of your choice in Apertium and install it. (see [[Install language data by compiling]])
    2 KB (383 words) - 19:46, 2 March 2023
  • | 64 || Apertium-tolk should give proper warning when no linguistic data is installed || 2008-03-31 || Wynand Winte ...rg/cgi-bin/bugzilla/index.cgi here]. Please feel to report your bug in any language you are comfortable with.
    12 KB (1,254 words) - 22:08, 7 March 2018
  • | clip || - || N/A || part &rarr; value || Obtains the part in the only language there is (inter/post-chunk) and pushes the value onto the stack ...|| - || link-to || part, pos &rarr; value || Obtains the 'part' in source language in position 'pos' and pushes the 'value' onto the stack. An optional operan
    14 KB (2,020 words) - 13:58, 7 October 2014
  • ...;13:00&nbsp; || &nbsp; '''Practical''': Installing Apertium and creating a language pair ....sf.net/p/apertium/svn/branches/courses/helsinki_2013/slides/session7a.pdf Data consistency, quality] and [https://svn.code.sf.net/p/apertium/svn/branches/
    8 KB (720 words) - 15:18, 20 March 2015
  • # Most language pairs don't need to specify anything else for install-data-local: install-data-local: install-modes
    4 KB (612 words) - 13:09, 18 February 2015
  • install-data-local: Most language pairs have lines like
    3 KB (482 words) - 15:54, 24 March 2014
  • This page contains data for CLD2 coverage. If need help to obtain CLD2 coverage of a certain language, contact [[User:Wei2912]].
    75 KB (7,440 words) - 17:12, 8 August 2014
  • Where LANGUAGE_PAIR is language pair (e.g. en-eo) wget http://sunsite.unc.edu/pub/Linux/system/keyboards/console-data-1999.08.29.tar.gz
    2 KB (281 words) - 02:58, 9 March 2018
  • ** Select a language ** Use the Apertium morphological analyser to analyse the test data
    1 KB (213 words) - 21:13, 18 March 2019
  • ...s, data, and other system resources with applications, software tools, and data of the Unix-like environment. Therefore it is possible to launch Windows ap Now you're ready to download and build language pairs and use them under Cygwin's shell.
    12 KB (1,883 words) - 22:06, 7 March 2018
  • ...is it possible to achieve pretty good results having very small amount of data (like in case of Breton) ...ad of the original syntax module in kmr-eng pipeline. The testpack for two language pairs was built. All code was cleaned up, some docstrings were written. Als
    6 KB (833 words) - 12:56, 22 August 2017
  • * répertoire es-tagger-data : Contient les données nécessaires pour le tagger espagnol (corpus, etc.) * répertoire ca-tagger-data : Contient les données nécessaires pour le tagger catalan (corpus, etc.)
    54 KB (8,480 words) - 18:55, 10 April 2017
  • If you want to work on Apertium language pairs or tools, some knowledge of the Unix shell / command-line scripting w ...hell/ shell scripting] and [https://hacker-tools.github.io/data-wrangling/ Data wrangling] are relevant and succinct
    746 bytes (101 words) - 09:20, 8 February 2019

View (previous 50 | next 50) (20 | 50 | 100 | 250 | 500)