Search results

Jump to navigation Jump to search
  • ...grammatical and other transformations between the two languages involved, and lexical data for the part-of-speech tagger, which is in charge of the disam ...s, and keep only the bilingual data (lexical selection, bilingual transfer and transfer rules) in the "pair" directory.
    50 KB (7,915 words) - 00:04, 10 March 2019
  • ...nd ideas on interesting tasks that will improve your knowledge of Apertium and help you get into the world of open-source development. ...un's [http://www.lama.univ-savoie.fr/~humayoun/UrduMorph/ Urdu Morphology] and convert to lttoolbox format. ||align=center| 8–10 || [[User:Francis T
    187 KB (21,006 words) - 22:14, 12 November 2012
  • ...-generated statistics regarding our morphological dictionaries (analysers) and bilingual dictionaries. Any changes may be over-written. Counts, are not de ...use [http://apertiumtrad.tuxfamily.org/genlistedic.php?lang=eng this link] and take the source code of what you get to paste it as the content of this wik
    125 KB (7,186 words) - 19:11, 11 November 2018
  • * Text and web pages translations for most of the language pairs that reached at least ...ing new ones. Remember in that case to register you on the page [[Language and pair maintainer]].
    41 KB (5,016 words) - 16:08, 6 February 2024
  • .../www.dlsi.ua.es/ http://www.dlsi.ua.es/]) within the Universitat d'Alacant and also by the company Prompsit Language Engineering ([http://www.prompsit.com ...mber of companies including Prompsit Language Engineering, Imaxin|software and Eleka Ingenieritza Linguistikoa, as well as by independent free software de
    26 KB (3,122 words) - 06:25, 27 May 2021
  • I’m a sociolinguist working on language maintenance and shift. I'm very interested in creating resources for minoritised languages. *Because its community is strongly committed to under-resourced and minoritised/marginalised languages.
    16 KB (2,285 words) - 06:46, 12 April 2019
  • ...ease for nine romance languages: [[Aragonese]] (an), [[Asturian]] (ast), [[Catalan]] (ca), [[French]] (fr), [[Galician]] (gl), [[Italian]] (it), [[Occitan]] ( ...able pairs for [[English]] and monodirectional stable pairs for [[Basque]] and [[Esperanto]].
    5 KB (589 words) - 12:07, 9 December 2019
  • * [[Afrikaans and English]]<nowiki>*</nowiki> * [[Afrikaans and Dutch]]<nowiki>*</nowiki>
    3 KB (360 words) - 11:07, 23 March 2018
  • ...of Linguistics through the courses “Theory and techniques of translating” and “Applied linguistics” at the University of Cagliari. ...e of the engine MT, but also to be able to intervene on the code to modify and change the rules.
    21 KB (3,171 words) - 14:34, 3 April 2017
  • ...pertium-cat]]) and as a component of several pairs which translate to/from Catalan. ...center | <code>[[apertium-arg-cat]]</code> || align=center | Aragonese <-> Catalan || align=center | 12 Sep 2016
    4 KB (548 words) - 14:23, 5 October 2019
  • ...ly a few person months to develop, taking advantage of existing resources, and the extensible nature of the engine. ...ion system, a lot can be said for realistic expectations, careful planning and effective contributions by volunteer developers.
    12 KB (1,679 words) - 12:00, 31 January 2012
  • ...ly a few person months to develop, taking advantage of existing resources, and the extensible nature of the engine. ...ion system, a lot can be said for realistic expectations, careful planning and effective contributions by volunteer developers.
    12 KB (1,683 words) - 08:42, 10 May 2013
  • ...ly a few person months to develop, taking advantage of existing resources, and the extensible nature of the engine. ...ion system, a lot can be said for realistic expectations, careful planning and effective contributions by volunteer developers.
    12 KB (1,683 words) - 11:00, 30 October 2015
  • The following 51 pairs have released versions and are considered to be stable: *SpanishCatalan ([http://www.apertium.org/?id=translatetext&dir=spa-cat spa-cat])
    8 KB (1,034 words) - 04:09, 13 October 2023
  • This page serves as a summary of all the work done in the Romanian-Catalan pair during Google Summer of Code 2018. It also includes information on the ...grade the Romanian-Catalan language pair to the monolingual package system and develop it to bring it to release quality. In addition, four other language
    7 KB (1,071 words) - 10:48, 14 August 2018
  • ...age:Language_pair_packages_screenshot1.png|thumb|350px|right|The English ⇆ Spanish package running as a standalone Java application. The same file could be us ...ired), and they can work on practically any platform (Linux, OS X, Windows and even Android!).
    11 KB (1,497 words) - 08:23, 7 April 2020
  • ...(<code>es</code>), [[French]], [[Galician]], [[Portuguese]], [[Romanian]] and [[Italian]] . The languages are related with varying levels of mutual intel ..., [[Neapolitan]], [[Piedmontese]], [[Romansh]], [[Sicilian]], [[Venetian]] and [[Walloon]].
    18 KB (2,312 words) - 18:25, 18 September 2016
  • ...e who are interested in using the dictionaries as analysers or generators, and not as part of a language pair. ...ox XML dictionary file <code>apertium-bn-en.bn.dix</code> into an analyser and save it as <code>bn.analyser.bin</code>:
    5 KB (682 words) - 20:36, 19 March 2021
  • ...se the work for integrating the missing entries in Apertium moorphological and bilingual dictionaries. == Catalan ==
    76 KB (7,700 words) - 17:38, 2 December 2012
  • ...common form of this is Pronominal Anaphora, where the anaphor is a pronoun and the antecedent is a noun. '''''Jessica''' is in the sixth grade and this is '''her''' father.''
    20 KB (3,107 words) - 21:13, 24 June 2022
  • ...ien]] (<code>ast</code>), l'[[Aragonese|aragonais]] (<code>an</code>), l'[[Spanish|espagnol]] (<code>es</code>), le [[French|français]] (<code>fr</code>), le * [[English and Portuguese|Anglais et portugais]]
    2 KB (213 words) - 12:37, 7 October 2014
  • ...rules from https://github.com/apertium/apertium-en-es/tree/ambiguous-rules and Apertium core with modified transfer module from https://svn.code.sf.net/p/ |English - Spanish
    11 KB (1,215 words) - 14:43, 17 April 2018
  • ...ra and a dictionary containing rules to conjugate verbs and to match nouns and adjectives. ...no one better than a translator trained in the use of MT to prepare texts and to correct their outputs, since s/he will be aware, even before the transla
    15 KB (2,339 words) - 00:41, 4 June 2018
  • ...to update “<tt>lang-test</tt>”, go into the “<tt>lang-test</tt>” directory and run the following: * <pre>** MEDIUM ACCURACY</pre> The coverage is greater than 50% and lower than or equal to 80%.
    75 KB (7,440 words) - 17:12, 8 August 2014
  • ...n and language technology. We try to focus our efforts on lesser-resourced and marginalised languages, but also work with larger languages. ...xiliary tools is being developed around the world, largely in universities and companies (e.g. Prompsit Language Engineering), but also independent free-s
    13 KB (2,013 words) - 12:21, 20 June 2019
  • ...and language technology. We try and focus our efforts on lesser-resourced and marginalised languages, but also work with larger languages. ...xiliary tools is being developed around the world, largely in universities and companies (e.g. Prompsit Language Engineering), but also independent free-s
    11 KB (1,802 words) - 19:51, 12 April 2021
  • ...uick and easy deployment with any web server, it is relatively lightweight and user-friendly. It is currently located on Github at [https://github.com/ape Using Html-tools is as easy as cloning from git, changing the configuration and running make.
    17 KB (2,558 words) - 05:25, 24 October 2019
  • ...ed training for machine translation purposes, but much quicker to produce, and with less effort (you don't have to manually tag a corpus). In this descrip ...an ←→ Catalan (<code>apertium-oc-ca</code>) language pair in the Occitan → Catalan (<code>oc-ca</code>) direction. You will need to substitute values that ref
    11 KB (1,470 words) - 08:16, 8 October 2014
  • ...e same between languages, even if both languages express both definiteness and indefiniteness by means of articles. ...ary arrest of Aung San Suu Kyi is illegal as '''-the''' international laws and '''-the''' same Burmese laws.
    3 KB (388 words) - 11:29, 24 March 2012
  • * Trosterud, Trond and Antonsen, Lene (2020). [https://journals.uio.no/osla/article/view/8514/7453 * Cherivirala, S., Chiplunkar, S., Washington, J. and Unhammer, K. (2018). [https://www.aclweb.org/anthology/W18-2207.pdf Apertiu
    33 KB (4,418 words) - 11:52, 29 December 2021
  • ..., a language package should have over 60% coverage on a variety of corpora and should probably have at least 2500 stems to be considered minimally useful. ...guages apertium-languages]. You may also want to browse all the languages and pairs using the [https://apertium.github.io/apertium-on-github/source-brows
    15 KB (1,783 words) - 22:33, 1 February 2019
  • ...same analysis, but only one should be generated; e.g. if you want "colour" and "color" to have the same analysis <nowiki><code>colour<n><sg></code></nowik ...tries are only included when you compile it "rl", from Catalan to Spanish (and LR entries are ignored).
    2 KB (368 words) - 23:00, 4 April 2019
  • .... This page intends to collect a comparison of tagging systems in Apertium and give some ideas of what could be done to improve them. The scripts to generate these results are written in Python and available from SVN, /branches/apertium-tagger/experiments/ : https://svn.co
    16 KB (1,448 words) - 16:50, 22 August 2017
  • ...what is called an 'engine'. It only talks in computer code, or takes input and output in forms very close to computer code. ...s. Building in Apertium needs an interest and an enthusiasm for languages, and a will to edit; with that, you are ready to go.
    17 KB (2,835 words) - 16:16, 24 January 2017
  • ...Tyers, during the period of duration of the project, following the timing and deadlines of the Google Summer of Code program. ...rdinian. Is a collaboration between the Autonomous University of Barcelona and Prompsit, funded by Google via the program ''Google Summer of Code''.
    7 KB (1,110 words) - 11:34, 23 August 2016
  • ...ht want to tell them what your skills are, what you think you are good at, and ask if they have any ideas for how you might get started. That's where Aper Each of our mentors has a list of languages which they speak (in bold) and which they understand, so you can communicate in the language which you pre
    9 KB (1,164 words) - 15:01, 1 April 2021
  • ;Catalan ;Catalan
    13 KB (1,689 words) - 21:42, 28 February 2021
  • * https://apertium.org is the official site, and offers all the released language pairs ...sit is a company heavily involved in development of the Apertium platform, and also offers a simple web interface to the released language pairs
    6 KB (848 words) - 12:51, 1 April 2024
  • Some universities in the Catalan-speaking regions have their own Apertium-powered translation portals: [http ...ductor/ Softcatalà], [http://www.softvalencia.org/traductor/ Softvalencià] and [http://www.softaragones.org/traductor/ Softaragonés] also provide Apertiu
    3 KB (339 words) - 12:09, 5 November 2018
  • ...contiguous multiword expressions, for example, taking 'liggja ekki fyrir' and reordering it as 'liggja# fyrir ekki'. In many languages, such as English, Norwegian and Icelandic, there are discontiguous multiwords, e.g. phrasal verbs, that we
    4 KB (632 words) - 12:33, 4 March 2016
  • == Download and install Apertium DixTools == ...lang-pair-text''': the name of the language pair (Spanish-Catalan, English-Catalan, etc.)
    4 KB (471 words) - 05:03, 8 March 2018
  • * angla-esperanto: [[English and Esperanto/Outstanding tests]] * franca-esperanto: [[French and Esperanto/Outstanding tests]]
    2 KB (286 words) - 20:28, 10 February 2011
  • ...h and Catalan, which are ambiguous between impersonal, complemento directo and reflexive. One way of fixing this would be to look at a parallel corpus and learn which constructions are more frequently translated in one way or anot
    767 bytes (122 words) - 11:44, 24 March 2012
  • ...or example, consider the equivalence <code>tiempo:temps</code> in Spanish--Catalan. In the bilingual dictionary we would need three entries: This is because in the Catalan monolingual dictionary, the paradigm for ''temps'' has only a single entry:
    2 KB (284 words) - 07:32, 14 January 2016
  • [[Category:Spanish and Catalan|*]] [[Category:Spanish]]
    5 KB (717 words) - 13:10, 8 March 2011
  • ...translation above the level of being able to distinguish nouns from verbs (and prepositions etc.) ...evel, of three dictionaries and a few rules (to deal with word re-ordering and other grammatical stuff).
    36 KB (5,933 words) - 16:14, 22 February 2021
  • ...Italian) or apertium eng-cat(English to Catalan), both of which are known and stable. You should try this to make sure things work before you move on to Using the same terminal, you can easily download and add the language pairs you want using a command like:
    10 KB (1,626 words) - 17:46, 13 January 2020
  • # translate from the command line, and then use your favorite subtitling application (e.g. Gaupol or Jubler) to [[ Read on for more information on method 2 and 3.
    6 KB (996 words) - 18:40, 21 April 2018
  • ...tell the whole story – see [[Evaluation]] for some discussion of strengths and limitations of WER/BLEU. * [[wikipedia:Word error rate|Word Error Rate]] (WER) and Position-independent Word Error Rate (PWER) are measures of post-edition ef
    9 KB (1,233 words) - 09:10, 21 November 2021
  • ...as a standalone analyser/generator ([[English#Apertium-eng|apertium-eng]]) and as a component of several pairs which translate to/from English. ...ign=center | <code>[[apertium-en-ca]]</code> || align=center | English <-> Catalan || align=center | 28 Mar 2016
    10 KB (1,275 words) - 22:00, 1 May 2018
  • .../apertium/modes.dtd modes.dtd]) which specify which programs should be run and in what order. Normally each linguistic package has one of these files whic ...ent (eg. ca-es-anmor which only performs morphological analysis on Catalan and nothing else).
    6 KB (824 words) - 22:30, 24 January 2022
  • ...answering a series of multiple-choice questions. The word is the question, and the possible analyses are the answers. Unknown words can be thought of as q ...g for a word is important, as translation depends on it. For instance, the Spanish word ''canto'' can be a verb or a noun. When translating to English, they h
    2 KB (294 words) - 09:47, 12 November 2017
  • Some ideas for expanding and promoting Apertium, like a scratchpad or something. * On Spanish-French, Catalan-French
    3 KB (408 words) - 15:35, 26 September 2016
  • ...//developer.qt.nokia.com/doc/qt-4.8/linguist-ts-file-format.html TS-files] and uses the Apertium web service. But if you want to do it with a local instal ...with translations from the &lt;source&gt; language (typically English) to Catalan:
    893 bytes (138 words) - 09:12, 30 May 2014
  • ...b.com/apertium/apertium-cat-cos apertium-cat-cos] - [[Catalan and Corsican|Catalan-Corsican translation pair]] * [https://github.com/apertium/apertium-spa-cos apertium-spa-cos] - Spanish-Corsican translation pair
    1 KB (176 words) - 01:37, 22 May 2019
  • (tens of thousands of examples needed) and the training time is also long. ...gh our results are not directly comparable because we used the PTB POS set and the Penn2Malt dependency labels,
    4 KB (593 words) - 19:03, 29 November 2016
  • ...panded to deal with more divergent language pairs (such as English-Catalan and even Basque→English). The platform provides ...necessary to build a machine translation system for a given language pair and
    10 KB (1,500 words) - 16:23, 18 February 2016
  • <li>Mediawiki: 2059 sentences - source, Apertium translated and postedited by humans (only bel -> rus)</li> <li>Tatoeba: 1762 sentences: source, target and both ways Apertium translated (bel -> rus, rus -> bel)</li>
    7 KB (1,033 words) - 15:27, 15 August 2018
  • ...s a standalone analyser/generator ([[Romanian#Apertium-ron|apertium-ron]]) and as a component of several pairs which translate to/from Romanian. * '''[[apertium-es-ro|Spanish-Romanian]]''' (trunk)
    7 KB (889 words) - 09:53, 28 November 2018
  • Maltese and Hebrew # <s>Look at how attached/clitic pronouns can be treated (Spanish,Catalan,Italian have similar requirements)</s>
    8 KB (1,159 words) - 04:41, 9 March 2018
  • ...on tests]], [[Catalan and Esperanto]], [[Spanish and Esperanto]], [[French and Esperanto]]. Mostly Im interested in Danish-Esperanto and English to Esperanto.
    4 KB (628 words) - 13:57, 6 September 2010
  • First, find the bilingual dictionary you want and convert it to text format: Now go and edit the file <code>en-cats.txt</code> and add the equivalent Apertium tag to each line, e.g.
    4 KB (755 words) - 11:52, 26 September 2016
  • [[Category:Catalan and Esperanto]] [[Category:Spanish and Esperanto]]
    4 KB (626 words) - 14:34, 7 September 2010
  • ...uage]]) is a Romance language primarily spoken in Portugal, Brazil, Angola and Mozambique. * [http://www.apertium.org/?id=translatetext&dir=es-pt/ Spanish ⇆ Portuguese]
    2 KB (302 words) - 16:23, 26 December 2017
  • ...lity. The results will not be as bad as typical, paste in text in one box, and paste in translation because we leave out the second stage of analysis. Here is an example of translating from English to Spanish through Catalan.
    2 KB (218 words) - 07:46, 8 October 2014
  • Assuming you have apertium and your language pair installed, this is how you do it: # Download tmx-tools and enter the downloaded directory:
    1 KB (153 words) - 02:43, 10 March 2018