Search results

Jump to navigation Jump to search
  • ...hese dictionaries can be found in the [[:Category:Documentation in English|documentation]] pages. | English || eng || apertium-eng.eng.dix || languages
    125 KB (7,186 words) - 19:11, 11 November 2018
  • ...pertium-eng]]) and as a component of several pairs which translate to/from English. In [[trunk]]:
    10 KB (1,275 words) - 22:00, 1 May 2018
  • This page gives a list of language pairs presently being developed in the Apertium project. ...n the original language, that it has a translation, but there is a problem in grammar rules or to generate properly the word translated. But for the majo
    41 KB (5,016 words) - 16:08, 6 February 2024
  • ...tool is able to run the modes, presenting the results from various stages in the translation workflow through a GUI). References to 'xxx' and 'yyy' refer to a language code, for example 'en-es'; 'English' to 'Spanish'.
    29 KB (4,687 words) - 16:28, 5 June 2020
  • ...ny ideas for how you might get started. That's where Apertium mentors come in! ...ich they speak (in bold) and which they understand, so you can communicate in the language which you prefer.
    9 KB (1,164 words) - 15:01, 1 April 2021
  • ...rules. In operation, shallow-transfer is distinguished from deep-transfer in that it doesn't do full syntactic parsing, the rules are typically operatio ...e xxx: this contains the rules of how words in language xxx are inflected. In our example this will be called: <code>apertium-hbs.hbs.dix</code>
    36 KB (5,933 words) - 16:14, 22 February 2021
  • This is the task ideas page for Google Code-in 2011(http://www.google-melange.com/gci/homepage/google/gci2011); here you c <b>For current GCI ideas, see [[Ideas for Google Code-in]]</b>
    187 KB (21,006 words) - 22:14, 12 November 2012
  • =English= The Apertium engine is being developed in the Transducens research group ([http://transducens.dlsi.ua.es http://trans
    26 KB (3,122 words) - 06:25, 27 May 2021
  • ...tten in English, see [[:Category:Documentation in English|documentation in English]]. ...pages anglaises a été traduite. Voir [[:Category:Documentation en français|documentation en français]].
    8 KB (1,034 words) - 04:09, 13 October 2023
  • ...ideas page for [https://developers.google.com/open-source/gci/ Google Code-in], here you can find ideas on interesting tasks that will improve your knowl The people column lists people who you should get in contact with to request further information. All tasks are 2 hours maximum
    32 KB (4,862 words) - 06:23, 5 December 2019
  • This is the task ideas page for Google Code-in 2013 (http://www.google-melange.com/gci/homepage/google/gci2013), here you '''For current GCI task ideas, see [[Task ideas for Google Code-in]]'''
    68 KB (10,323 words) - 15:37, 25 October 2014
  • ...n systems that can produce even a decent sentence-based translation of the English-Chinese language-pair. ...ered language pair in the future. Technical rules and definitions are left in their Chinese forms to assist any future Chinese linguists/developers worki
    12 KB (1,637 words) - 16:03, 21 January 2016
  • This page serves as a summary of all the work done in the English-Catalan pair during Google Summer of Code 2017. For a more detailed workpla Work has affected several areas of the English-Catalan pair. You can see a full list of commits and modified files [https:
    5 KB (887 words) - 22:24, 31 August 2017
  • [[Documentation (français)|En français]] ...rtium.org/w/images/d/d0/Apertium2-documentation.pdf Apertium 2.0: Official documentation] (222 pages)'''
    5 KB (599 words) - 19:04, 17 February 2020
  • ...-eus|English to Basque]] system uses a wrapper around the Stanford parser. In this tutorial we're going to be using [[Constraint Grammar]] (CG) to do dep ...iguating sentences in Turkish. You'll give it a sentence like "Dün benim için aldığın birayı içeceğim." and get some output like:
    53 KB (8,811 words) - 04:05, 21 January 2017
  • ...slating between [[English]] and [[Catalan]]. The pair is currently located in [https://github.com/apertium/apertium-eng-cat trunk]. '''Transfer rules (English to Catalan):''' 221 (T1X), 60 (T2X), 38 (T3X)
    7 KB (973 words) - 21:38, 9 May 2018
  • ...led an 'engine'. It only talks in computer code, or takes input and output in forms very close to computer code. ...uter coding. How much you need to know depends on your interests. Building in Apertium needs an interest and an enthusiasm for languages, and a will to e
    17 KB (2,835 words) - 16:16, 24 January 2017
  • The incubator can be found in the 'incubator' column in https://apertium.github.io/apertium-on-github/source-browser.html. It house ...on work in the Incubator. Below you can put resources which will be useful in the construction. Try and mark them for licence, or at least free/non-free.
    10 KB (1,336 words) - 20:40, 11 December 2019
  • ...ends to give a step-by-step walk-through of how to create a new translator in the [[Matxin]] platform. ...(for the moment these can be found in <code>apertium-tools/freeling</code> in [[Using SVN|apertium SVN]]), if you're planning on using FreeLing for the a
    26 KB (4,167 words) - 13:05, 11 May 2016
  • =English to Kurmanji= ...ng-kmr|a group shall be considered to form a particular social group where in particular:.|komek divê wek komeke civakiye taybet bête dîtin gava bi ta
    61 KB (10,648 words) - 15:50, 2 October 2016
  • ...enerate the target-language inflected forms. For this, we use lt-proc, but in generation (-g), not analysis mode. ...a machine translation system that translates a Serbo-Croatian noun into an English noun. Obviously this isn't very useful, but we'll get onto the more complex
    35 KB (5,602 words) - 07:14, 16 February 2015
  • If you do not have these packages, please install them in the following order: ** For English, you also need the "corpora" package.
    14 KB (2,232 words) - 12:51, 26 September 2018
  • =English to Sorani= ...ng-ckb|a group shall be considered to form a particular social group where in particular:.|}}
    73 KB (5,523 words) - 19:26, 2 October 2016
  • ...ives a brief overview to the kind of data and resources that can be useful in building a new language pair for Apertium, and how to go about building the ...Apertium language pair requires 3 dictionary files. For instance, for the English-Afrikaans pair, these would be:
    13 KB (2,112 words) - 12:11, 26 May 2023
  • |<code>[[apertium-eng-cat]] </code>||align=center| 2006 || English ⇆ Catalan || [[User:Marcriera|Marc Riera]] || {{release-tc|1.0.0}} |<code>[[apertium-cy-en]] </code>||align=center| 2008 || Welsh ⇆ English || [[User:Donnek|Donnek]], [[User:Francis Tyers|Francis Tyers]]
    13 KB (964 words) - 08:20, 31 December 2020
  • ...tributed infrastructure (for example, to collaborate easier with engineers in an offshore country), and so on).]] ...d through a Web Service interface) to implement real-time translation (bot in input and output) of instant messages.]]
    24 KB (3,572 words) - 07:37, 8 March 2018
  • ...t build procedures, this page is principally for people who are interested in using the dictionaries as analysers or generators, and not as part of a lan ...some options to use values of the "alt", "v", "vl" or "vr" dix attributes in compilation:
    5 KB (682 words) - 20:36, 19 March 2021
  • very important to '''add words and rules in decreasing frequency''', starting The reason that words should be added in order of frequency is quite intuitive:
    16 KB (2,566 words) - 21:36, 15 March 2020
  • ...ontributing]]. If you get stuck while going through this guide, please get in touch on IRC or the mailing list (see [[Contact]] for how to do that). ...nguages involved, and lexical data for the part-of-speech tagger, which is in charge of the disambiguation of the source language text.
    50 KB (7,915 words) - 00:04, 10 March 2019
  • * Write documentation on the current status of transfer rules * Reorganise proper nouns in bidix
    5 KB (506 words) - 14:56, 28 August 2017
  • ==== Examples in English ==== ...he noun phrase ''this cat''. Where the word ''this'' is used as a modifier in Apertium we call it a determiner, and where it is used to substitute a noun
    6 KB (1,045 words) - 11:45, 7 October 2014
  • ...f there is no preposition before, but there is an article in Catalan, put "in". ...mmunity", or is (b) an error in the English (probably non-native writer) " In violation of the international law, they were tortured and executed"
    3 KB (388 words) - 11:29, 24 March 2012
  • The term '''multiword''' includes simple words that have spaces in them, words with separable parts, contractions and compounds of several lem ...ode>''' simply inserts a blank; use it if you want a word that has a space in it, but only inflection at the end
    21 KB (3,521 words) - 05:17, 10 December 2017
  • ...tps://link.springer.com/article/10.1007/s10590-021-09260-6 Recent advances in Apertium, a free/open-source rule-based machine translation platform for lo ...Vangsnes (ed.) Bauta: Janne Bondi Johannessen in memoriam, Oslo Studies in Language 11(2), 2020. 489–501. (ISSN 1890-9639 / ISBN 978-82-91398-12-9)
    33 KB (4,418 words) - 11:52, 29 December 2021
  • ...s interpunkcio k.t.p. Ni ne havas nestatn parton tie ĉi, kvankam ni vidos ĝin pli poste.
    34 KB (5,392 words) - 07:13, 16 February 2015
  • This page is a translation from Spanish into English of most of the documentation of [[Matxin]] with some minor ...on can be found [http://matxin.svn.sourceforge.net/viewvc/matxin/trunk/doc/documentation-es.pdf here]. This page does not describe how to install or use Matxin, for
    58 KB (8,964 words) - 11:11, 14 May 2016
  • ...here with a name like <code><lang>.crp.txt</code>. Make sure the corpus is in raw text format. Once you have your corpus in there you need a Makefile that specifies how to generate the probability fi
    7 KB (1,177 words) - 08:34, 8 October 2014
  • * Capability in APY to bypass captcha code (for testing) * Expansion of documentation on suggestions feature
    3 KB (369 words) - 17:06, 1 July 2016
  • This page provides general documentation on the transfer rules used in the English-Catalan pair. The tables are generated from the data and comments found wit == English → Catalan ==
    45 KB (7,840 words) - 10:56, 18 September 2017
  • ...le Apertium tags and parsing UD-tree into a dataset for Kazakh, Breton and English UD dependency treebanks</s> ...ing scripts for converting UD-treebanks (dev and test) of needed languages in Apertium stream format (converted treebanks will be useful for evaluating t
    6 KB (833 words) - 12:56, 22 August 2017
  • This page serves as a summary of all the work done in the Romanian-Catalan pair during Google Summer of Code 2018. It also includ ...monolingual package system and develop it to bring it to release quality. In addition, four other language pairs have been upgraded to the monolingual p
    7 KB (1,071 words) - 10:48, 14 August 2018
  • ...dictionaries that allows a certain level of parameterisation of paradigms in [[monodix | monodixes]]. Metadixes are converted to the standard <code>.dix Metadixes are currently used in some language pairs, such as English-Catalan and Occitan-Catalan.
    5 KB (744 words) - 08:26, 25 April 2016
  • ...er hundreds, of thousands of words) to 'train' the automatic taggers found in some Apertium language pairs. Getting the right tag for a word is important This is why we have many hand-tagging tasks in the Google Code-In.
    13 KB (2,076 words) - 12:13, 26 September 2016
  • We will look at Esperanto <-> English and try to translate the sentence "La libro estas blua" to "The book is blu ...can also be added, removed or made into 'pointers' that points to the tags in the enclosing chunk.
    15 KB (2,237 words) - 12:41, 21 August 2015
  • English - upper-intermediate(IELTS 6.0 - 2015) Turkish - intermediate(5 years of studying in Kazakh-Turkish school)
    8 KB (1,094 words) - 13:10, 14 April 2019
  • ...this full example for English-Galician]</ref> (A-C) language pair '''from English-Spanish'''<ref>[http://apertium.svn.sourceforge.net/viewvc/apertium/trunk/a To get an '''English-Galician (A-C)''' language pair from '''English-Spanish (A-B)''' and '''Spanish-Galician (B-C)''', you need the following d
    6 KB (689 words) - 22:58, 25 October 2018
  • [[Contributing to an existing pair|In English]] ...entrées des mots dans les dictionnaires, consultez la section 3.1.2 de la documentation du système.
    54 KB (8,480 words) - 18:55, 10 April 2017
  • [[Matxin New Language Pair HOWTO|In English]] ...onnaires et grammaires. La plate-forme Matxin est décrite en détail dans [[Documentation de Matxin]] et sur la [http://matxin.sourceforge.net page Matxin de sourcef
    29 KB (4,618 words) - 07:18, 8 October 2014
  • '''These are the key steps one has to take in order to create a new language pair:''' # Decide whether each language in your pair is best implemented with '''[[#Language Properties|lttoolbox or H
    19 KB (3,164 words) - 20:58, 2 April 2021
  • ...''what'' you want or ''how'' you want to do it. You need to do both -- but in different places (but it's quite intuitive). ...if it is only singular, or only plural, then this information needs to go in the [[bilingual dictionary]].
    28 KB (4,478 words) - 12:25, 17 March 2022
  • Here you will find the documentation for the Anaphora Resolution module created during the Google Summer of Code Anaphora Resolution is the process of resolving references to earlier items in discourse.
    20 KB (3,107 words) - 21:13, 24 June 2022
  • ...t be shy to get creative in your use of resources, but be wary of mistakes in various resources (even seemingly reputable ones!) or over-all low-quality ...h pages include [[English and Esperanto/Outstanding tests]] and [[Welsh to English/Regression tests]]. Like with these pages, you should probably plan to kee
    10 KB (1,615 words) - 07:43, 20 December 2015
  • Shallow transfer means there is no parse trees (which are used in "deep transfer"). By chunking (in three-stages): First we reorder words in the chunk, then we reorder chunks.
    4 KB (584 words) - 06:55, 8 October 2014
  • ...lish is not. So for example: I see in English would be translated as vidim in Serbo-Croatian. ...dictionary, we need to add both the verb, and the personal pronouns in the English morpohlogical dictionary. We'll go through both of these.
    26 KB (4,259 words) - 07:00, 16 February 2015
  • ...considered one lexical unit (all of it is within the same <code>^…$</code> in the output of the analyser), while during transfer they are considered sepa ...not in the dictionary, e.g. if <i>vier</i> and <i>letter</i> have analyses in the dictionary that give the non-final hidden tag, and <i>woorden</i> with
    4 KB (654 words) - 08:31, 6 April 2021
  • ::Use this if you want to do English--<something> (funny alignments for non-English pairs) ...heast European Times &mdash; http://opus.lingfil.uu.se/SETIMES.php &mdash; English,Turkish,Bulgarian,Macedonian,Serbo-Croatian,Albanian,Greek,Romanian &mdash;
    5 KB (746 words) - 20:36, 25 January 2020
  • ...e related to [[Apertium]]. It allows for deeper transfer than can be found in Apertium. This page describes how to install the system, see [[Matxin#Documentation]] below for how to create or maintain language pairs.
    1 KB (164 words) - 20:29, 7 May 2016
  • ...ctionary is to model the rules that govern the internal structure of words in a language. ...stood by the speaker reflect specific patterns and regularities in the way in which words are formed from smaller units and how those smaller units inter
    15 KB (2,200 words) - 12:04, 6 October 2014
  • This page lists the symbols in Apertium used to denote part-of-speech and further morphological features, ...es have the same grammatical features (most don't have spatial distinction in articles for example).
    38 KB (4,492 words) - 15:36, 9 May 2024
  • ...''/home/kevin/downloads/linguistics'', but it can be anything you choose. In this note, this choice will be referred to as ''[homedir]''. Obviously, replace ''[homedir]'' here and in subsequent instances by your real path.
    5 KB (808 words) - 02:48, 9 March 2018
  • ...code. For example, you may wish to create a dictionary for a language not in Apertium repositories. If you invent a new language, or identify a new dial ...ect of the city of London called 'Cockney Rhyming Slang' is a variation on English, mainly substituting nouns. Cockney Rhyming Slang is well-known, but is/was
    4 KB (620 words) - 12:36, 24 January 2017
  • ...ing levels of mutual intelligibility. Many of these languages are included in Apertium already. Romance languages that are not yet covered in [[Apertium]] include [[Aromanian]], [[Arpitan]], [[Corsican]], [[Friulan]],
    18 KB (2,312 words) - 18:25, 18 September 2016
  • ...used by the tool ''apertium-transfer''. Writing tags used for [[chunking]] in a 3-stage transfer is not approached there. ...e text not to be translated. For example, HTML tags must not be translated in another language, but only the text of the Web page.
    58 KB (8,365 words) - 20:16, 26 June 2018
  • Currently transfer is the bottleneck in Apertium, processing here takes 95% CPU. ...is because the transfer file is being interpreted (tree walking of the XML in the transfer t1x file) instead of being compiled into machine code.
    16 KB (2,276 words) - 10:47, 9 December 2014
  • [[Romance languages|In English]] * [[English and Portuguese|Anglais et portugais]]
    2 KB (213 words) - 12:37, 7 October 2014
  • E-Mail: tejasvi.chebrolu@research.iiit.ac.in == '''Why is it that I am interested in Apertium?'''==
    9 KB (1,391 words) - 16:41, 31 March 2020
  • languages than the ones dealt with in the first stages of the engine, are more cases, and more critical, in which a source language word can
    11 KB (1,814 words) - 03:22, 9 March 2019
  • ...s kept in [https://github.com/apertium GitHub]. To browse the repositories in a more compact fashion, ...ode>[[incubator]], [[nursery]], [[staging]]</code> &mdash; these hold work-in-progress modules (mostly language pairs) that have not yet made it into <co
    8 KB (1,173 words) - 20:16, 10 March 2018
  • ...n choose between. Before, we had only fixed sets, e.g. British vs American English, which required compiling a new pipeline for each set of preferences. These ...e variation is possible and useful, systematising it, and then enabling it in the language pair by turning hard restrictions into ambiguity and selectors
    2 KB (239 words) - 09:33, 4 March 2024
  • [[Documentation|In English]] ''La [[Documentation|version anglaise de cette page]] mentionne les pages de ce wiki disponibles
    2 KB (329 words) - 13:41, 7 October 2014
  • [[Metadix|In English]] == A partir de la documentation ==
    5 KB (833 words) - 12:13, 7 October 2014
  • * apertium/lttoolbox/hfst, see [[Installation]], in particular the ''prerequisites'' parts. (You most likely don't need to go a * [[apertium-init]].py – put this script in your working directory where you will be downloading language data. You can
    5 KB (824 words) - 15:30, 20 April 2021
  • ...tep guide on how to use a development version of Apertium to make a change in a language pair. ...nguage pair, e.g. apertium-spa-ita(Spanish to Italian) or apertium eng-cat(English to Catalan), both of which are known and stable. You should try this to mak
    10 KB (1,626 words) - 17:46, 13 January 2020
  • ...r [[lexical selection]]. It documents a few approaches, with example rules in the [[constraint-based lexical selection]] format. ...in your source language (e.g. English) which has more than one translation in your target language (e.g. Spanish). For example
    11 KB (1,638 words) - 13:21, 7 December 2015
  • Once your dictionaries are of a reasonable size, say perhaps 3,000 lemmata in total, it is worth training the tagger. To do this, you'll need a couple of Another option for stripping Wikipedia, which will probably result in a higher quality corpus, is as follows. First download the Wikipedia extrac
    7 KB (1,058 words) - 07:37, 4 July 2016
  • ...for lttoolbox, apertium, and the language pairs, the installation is done in <code>/usr/local/bin</code> and <code>/usr/local/share</code>. If you wish tools will be installed in <code>/usr/bin</code> and man pages in <code>/usr/share/man</code> .
    5 KB (780 words) - 11:48, 15 June 2018
  • ...ia article [http://en.wikipedia.org/wiki/Separable_verb here]. These exist in most Germanic languages (Afrikaans, Danish, Dutch, German, Swedish, Norwegi For example, in Afrikaans, the verb "to announce" is "aankondig". The usage is as follows:
    8 KB (1,265 words) - 08:11, 2 January 2016
  • ...[[Wikipedia:Catalan language]]) is a [[Romance language]]. It is available in Apertium as a standalone analyser/generator ([[Catalan#Apertium-cat|apertiu In [[trunk]]:
    4 KB (548 words) - 14:23, 5 October 2019
  • ls /usr/lib/jvm/ if in doubt as to what the exact version on your system is; it may be different t Note where lttoolbox.pc gets placed, in case [[Installing_Apertium_3.0#When_running_configure_script_for_Aperitum|a
    7 KB (973 words) - 02:52, 20 May 2021
  • <spectie> you make templates from the paradigms in the danish analyser lemma in a config file, e.g. for nouns n.*.sg.nom
    2 KB (388 words) - 15:36, 26 January 2020
  • .../viewvc/apertium/trunk/apertium-transfer-tools/example/ example] directory in the source package, which should be the first place to look for information lt-proc (orthographic correction mode - unused in this example):
    6 KB (1,022 words) - 08:15, 8 October 2014
  • * A bilingual dictionary in Apertium format, e.g. [https://svn.code.sf.net/p/apertium/svn/trunk/apertiu ...tem where everything goes via English, so it's best to just start from the English dictionary which will be most complete. You can get a list of words as foll
    4 KB (755 words) - 11:52, 26 September 2016
  • In this page we describe how to morphologically disambiguate (tag) text so tha ...languages this is ok, for translating between less related languages (e.g. English--anything) it causes problems.
    2 KB (228 words) - 12:16, 26 September 2016
  • The idea of this project is to take advantage of all possible resources in creating MT systems for marginalised languages. The general idea is to use This idea is kind of like the TMX support in Apertium, only it goes at the end of the pipeline.
    5 KB (802 words) - 07:04, 10 May 2012
  • * http://aplica.prompsit.com/ – Prompsit is a company heavily involved in development of the Apertium platform, and also offers a simple web interfac ...versity of Tromsø works on Saami language pairs; this site runs the latest in-development version of Northern Saami→Norwegian Bokmål
    6 KB (848 words) - 12:51, 1 April 2024
  • Here are the main tips to help you in writing your GSOC application with Apertium. ...still interested, but we'll try to find a subset of it which is achievable in the time scale available.
    9 KB (1,509 words) - 23:51, 27 February 2023
  • == Why is it you are interested in machine translation? == ...ciolinguist working on language maintenance and shift. I'm very interested in creating resources for minoritised languages.
    16 KB (2,285 words) - 06:46, 12 April 2019
  • The same input word in a lexical processing module can be written differently regarding letter cas # The whole word is in lower case.
    5 KB (801 words) - 07:13, 8 October 2014
  • ...ically one file, a <code>.dix</code> file. This defines both how morphemes in the language are joined together, ''morphotactics'', and how changes happen These two phenomena are treated in the same file.
    19 KB (3,440 words) - 12:10, 26 September 2016
  • ...and other handy features. It is installable from the regular repositories in Ubuntu, Arch Linux, etc. ...ension for Gaupol in incubator. To install, simply check it out and put it in your Gaupol extensions directory. E.g. to install for just your user:
    6 KB (996 words) - 18:40, 21 April 2018
  • ...tallation is down; we're using https://sourceforge.net/p/apertium/tickets/ in the meanwhile''' found these in my feed reader:
    12 KB (1,254 words) - 22:08, 7 March 2018
  • * sıkıştığınız zaman soru sorabilmeniz için bir [[IRC]] programı In the area where it says SVN_MODULE, replace this with the the name of the sv
    10 KB (1,715 words) - 12:29, 28 May 2018
  • ...choose between a series of fine-grained senses if all these senses result in the same final translation. This page has some links to pages about lexical selection in Apertium.
    4 KB (625 words) - 08:36, 29 April 2015
  • ...more than one part-of-speech by [[morphological analysis]]. For example in English, the word "trap" can be both a singular noun ("It's a trap!") or a verb ("I ...verview of constraints (as in [[constraint grammar]]) and restrictions (as in <code>apertium-tagger</code>) as well.
    13 KB (1,833 words) - 11:51, 2 March 2018
  • To figure out what's going wrong in a translation, you have to understand how apertium (and the specific langua ...put stem in both of your monolingual dictionaries and that they're matched in the bilingual dictionary
    5 KB (903 words) - 11:57, 26 September 2016
  • # Install all dependencies by running the following in the terminal: # Install all dependencies by running the following in the terminal:
    4 KB (665 words) - 11:57, 18 November 2022
  • ...still not under the official Apertium organization. If you are interested in transferring this tool to the organization, see [[Migrating tools to GitHub ...ate all incoming messages from buddy1 to Spanish (assuming they will be in English).
    8 KB (1,263 words) - 02:18, 9 March 2018
  • ...itive, which is fair enough if you're not used to thinking of dictionaries in a particular way. This page hopes to be a '''basic''' introduction to how t ...g to, step by step, define a dictionary that will analyse and generate the English word "beer" and its plural form, "beers". The first section defines the alp
    11 KB (1,851 words) - 07:42, 16 February 2015
  • [[List of symbols|In English]] · [[Список символов|по-русски]] Note: Tous les balises XML sont expliqués en profondeur dans la [[documentation]] (pdf), voir aussi les fichiers dix.dtd/dix.rng dans [https://svn.code.sf.
    18 KB (2,089 words) - 11:18, 30 July 2016
  • ...terchunk is a class of interchunk (see "advanced transfer" in the official documentation). Interchunk transfer uses 3 steps and cascaded interchunk uses more than 3 Chunking is based on source language patterns. It is used in all language pairs.
    4 KB (669 words) - 14:12, 6 July 2012
  • [[Apertium on Mac OS X (Local)|In English]] See the Autoconf documentation.
    4 KB (610 words) - 10:57, 6 October 2014
  • [[Apertium on Mandriva|In English]] Pas encore de documentation spécifique, mais au vu des documentation des autres distributions, il faudrait avoir installé '''svn''' et peut êt
    264 bytes (35 words) - 11:15, 6 October 2014
  • [[Translating subtitles|In English]] * La documentation des outils de conversion sur [http://translate.sourceforge.nettoolkit/sub2p
    7 KB (1,019 words) - 10:42, 6 October 2014
  • [[Multiwords|In English]] ...]] et [https://wiki.apertium.org/w/images/d/d0/Apertium2-documentation.pdf documentation] (esp. sec.3.1.2.6).
    11 KB (1,823 words) - 16:45, 3 September 2018
  • ...tch usage, Pology is recommended; if you want Apertium to give suggestions in a po-editor, try Virtaal. in your ~/.bashrc (exchanging POLOGYDIR for the directory you checked it out t
    5 KB (803 words) - 13:03, 25 November 2019
  • == Changes in Apertium implementation mode == ...e current Apertium implementation, any project element is directly located in a first level subdirectory of "apertium" (https://github.com/apertium/) and
    8 KB (1,327 words) - 21:34, 17 February 2019
  • ...your choice to quickly find the errors. Open the original dictionary file in an editor and correct the errors you have found. ...led correctly. This can be done by filtering the list, excluding all words in a list of correctly spelled words.
    6 KB (1,009 words) - 13:14, 15 March 2015
  • ...still not under the official Apertium organization. If you are interested in transferring this tool to the organization, see [[Migrating tools to GitHub ...late all incoming messages from user1 to Spanish (assuming they will be in English).
    9 KB (1,465 words) - 02:19, 9 March 2018
  • ...e, the Spanish word ''canto'' can be a verb or a noun. When translating to English, they have different translations: * [verb] Los domingos ''canto'' en el coro → On Sundays I ''sing'' in the choir.
    4 KB (763 words) - 12:14, 26 September 2016
  • In some language pairs, syntactic function labels are used to improve disambig ...>@+FMAINV @←SUBJ</code> to <code>@→SUBJ @+FMAINV</code> as is the order in English.
    7 KB (922 words) - 08:16, 8 October 2014
  • This a general list. Also applies to Mac OS and building in a *nix environment under Windows. which is used in many examples on the wiki (but is not necessary)
    2 KB (271 words) - 05:35, 8 March 2018
  • ...w repository scheme. (Originally, all monolingual language data was found in language pairs, meaning that there was a lot of duplication.) If you feel ...ator] languages until they're minimally useful, at which point they can go in apertium-languages. There is no fixed criterion for what constitutes a min
    15 KB (1,783 words) - 22:33, 1 February 2019
  • ...results will not be as bad as typical, paste in text in one box, and paste in translation because we leave out the second stage of analysis. Here is an example of translating from English to Spanish through Catalan.
    2 KB (218 words) - 07:46, 8 October 2014
  • ...language]]. It is spoken primarily in Romania and Moldova. It is available in Apertium as a standalone analyser/generator ([[Romanian#Apertium-ron|aperti ...on.cfm?id=1596842 Girju, Roxana. "The syntax and semantics of prepositions in the task of automatic interpretation of nominal phrases and compounds: A cr
    7 KB (889 words) - 09:53, 28 November 2018
  • [[Chunking|In English]] * [[English and Esperanto]]
    4 KB (706 words) - 07:05, 8 October 2014
  • [[Chunking: A full example|In English]] * [[English and Esperanto]]
    16 KB (2,435 words) - 07:05, 8 October 2014
  • [[Unsupervised tagger training|In English]] R: Pour la paire [[English and Esperanto|anglais et espéranto]] on avait approximativement 13000 entr
    8 KB (1,298 words) - 08:43, 8 October 2014
  • ...dictionaries that are standard between dictionaries in a pair are not kept in one file, but several (for example symbol definitions). In some pairs, e.g. Catalan, Portuguese, there is support for generating a par
    3 KB (461 words) - 15:31, 26 September 2016
  • ...want to read [http://www.omegawiki.org/Help:OmegaWiki_database_layout the documentation for the OmegaWiki database] and [http://www.omegawiki.org/DefinedMeaning th 2 rows in set (0.00 sec)
    7 KB (862 words) - 11:49, 14 July 2011
  • ...such an orthographical issue is the "a" vs "an" difference in English. The english generator will output <code>~a</code>, and the post-generation FST changes ...tput will include the approximate line number of each rule that applies as in
    2 KB (311 words) - 18:14, 8 July 2022
  • ...Tagalog-Nenets dependency treebank via the pivot languages of Russian and English These pairs are ideal for development due to the closeness of the languages in question, or historical connection. Some are closer than others, but all ar
    3 KB (408 words) - 15:35, 26 September 2016
  • There is an init file in git that will give your Emacs some useful Apertium-related packages and set The rest of this page gives some documentation of the various modes.
    16 KB (2,637 words) - 12:46, 23 March 2022
  • ...tion, while <code>apertium-rehtml</code> de-encapsulates (restores) it, as in the following example: ...entities, <code>html-noent</code> keeps non-ASCII as-is; the difference is in the reformatter
    13 KB (1,781 words) - 09:49, 6 October 2014
  • ...an happen if you have another apertium-related PPA in your system, so look in /etc/apt/sources.list.d/ and check that only apertium-nightly.list is there The clue here is the "..". Autotools is looking for an auxiliary file in the parent directory of this one. This will happen if you check out one pro
    20 KB (3,153 words) - 08:13, 24 May 2019
  • Some languages have differences in lexis and grammar, but are still The languages are so similar that duplicating the work in many
    14 KB (2,355 words) - 10:33, 1 November 2022
  • ...lem with how formatting / superblanks interacts with word/chunk reordering in Apertium. ...bold has shifted from source word "bar" to the target word that was "foo" in the input.
    10 KB (1,676 words) - 18:32, 21 June 2020
  • This page describes the stream format used in the Apertium machine translation platform. Reserved characters should only appear escaped in the input stream unless they are part of a lexical unit, chunk or superblan
    6 KB (797 words) - 07:12, 29 March 2022
  • ...o produce, and with less effort (you don't have to manually tag a corpus). In this description, a part-of-speech tagger for the source language (<code>SL ...ger for the Occitan ←→ Catalan (<code>apertium-oc-ca</code>) language pair in the Occitan → Catalan (<code>oc-ca</code>) direction. You will need to su
    11 KB (1,470 words) - 08:16, 8 October 2014
  • [[Morphological dictionary|In English]] if S not in state_output_pairs:
    16 KB (2,423 words) - 12:07, 7 October 2014
  • ...ingual dictionary in a way which shows that there is a decision to be made in the transfer rules where the lexical information of the word doesn't direct == in Apertium ==
    2 KB (334 words) - 07:45, 8 October 2014
  • A tutorial built in to the editor can be opened by running <code>vimtutor</code> from a termina Vim starts in 'normal mode', and in order to type any text, you must first switch to 'insert mode'.
    2 KB (359 words) - 16:00, 2 January 2021
  • You will need to install <code>cmake</code>. To install it in Debian/Ubuntu just type <code>apt-get install cmake</code>. You will need to install <code>libicu-dev</code>. To install it in Debian/Ubuntu just type<code> apt-get install libicu-dev</code>.
    7 KB (1,116 words) - 20:57, 2 April 2021
  • | * || In that position or further in that direction If you want to refer to a subreading in a pattern you can use <code>1/N</code>
    8 KB (1,130 words) - 09:47, 23 March 2022
  • ...ng accuracy). This page intends to collect a comparison of tagging systems in Apertium and give some ideas of what could be done to improve them. The scripts to generate these results are written in Python and available from SVN, /branches/apertium-tagger/experiments/ : htt
    16 KB (1,448 words) - 16:50, 22 August 2017
  • apertium-nn-nb is now in a fairly usable state for translating both ending in -lig in nb typically end in -leg in nn) and checking whether
    12 KB (1,886 words) - 12:20, 20 June 2019
  • *[[Prerequisites for nix|*nix (in general)]] You should also put those lines in your <code>~/.bashrc</code> so you don't have to paste them into every term
    5 KB (821 words) - 02:55, 27 July 2022
  • * <code>forvaltningsordbok.nob</code>: A list of sentences in Norwegian * <code>forvaltningsordbok.sme</code>: Translations of the previous sentences in North Sámi
    5 KB (680 words) - 11:53, 26 September 2016
  • ...nstall the actual speller data into a non-standard directory; we'll put it in a special folder inside our home folder which is always searched. You can also search for <code>apertium</code> in case you want a language covered there:
    4 KB (596 words) - 21:02, 2 April 2021
  • ..."building the package", "testing", "uploading to SourceForge" and "tagging in git". You should now have a file in the source directory called <code>apertium-foo-bar-1.0.0.tar.gz</code>.
    3 KB (447 words) - 09:59, 26 April 2021
  • ...solved for writing a twol rule. This is the symbol that gets implemented in the morphology, and constitutes the first part of any rule. ...some restricted condition, then ''x'' is the input symbol, since it occurs in the must unrestricted environment, and ''y'' is a conditioned output symbol
    3 KB (513 words) - 12:26, 26 September 2016
  • In natural languages, we can have many different types of '''ambiguity''', e.g * During the financial crisis, '''banks''' received $1.2 trillion in loans from the Government (noun)
    4 KB (620 words) - 18:06, 26 September 2016
  • [[Apertium on Ubuntu or Debian|In English]] ...s préférées depuis Synaptic. Il y a une [http://doc.ubuntu-fr.org/synaptic documentation sympathique sur Synaptic ici]. Celà devrait également installer les dépe
    4 KB (696 words) - 11:18, 6 October 2014
  • [[File:pairviewer.png|right|thumb|350px|The Pairviewer in action.]] ...nteractive graph initially developed sometime before the [[GCI|Google Code-In]] 2013. Its source code can be [https://github.com/apertium/pairviewer foun
    5 KB (702 words) - 01:34, 9 December 2018
  • ...', you'll have to run it through an analyser. If you have a Spanish corpus in corpus.spa.bz2, then you can get a list of lemmas by analysing it with [[ap for lu in streamparser.parse_file(sys.stdin):
    2 KB (269 words) - 10:10, 9 August 2018
  • ...packaged pairs, see [[Installation]] on how to install pairs that are only in SVN.) if it's started. (Press q to get your terminal back if it put you in less-mode.)
    5 KB (640 words) - 21:02, 2 April 2021
  • ...ach linguistic package has one of these files which specifies various ways in which you can use the data to perform translations. In order to get some statistical information about translations made using Ape
    6 KB (824 words) - 22:30, 24 January 2022
  • We use <code>%&gt;</code> as a morpheme boundary indicator in lexc. == Syntax highlighting and folding in vim ==
    2 KB (281 words) - 11:48, 26 September 2016
  • ...e, consider the equivalence <code>tiempo:temps</code> in Spanish--Catalan. In the bilingual dictionary we would need three entries: This is because in the Catalan monolingual dictionary, the paradigm for ''temps'' has only a s
    2 KB (284 words) - 07:32, 14 January 2016
  • ...packaged pairs, see [[Installation]] on how to install pairs that are only in SVN.) if it's started. (Press q to get your terminal back if it put you in less-mode.)
    5 KB (653 words) - 21:00, 2 April 2021
  • '''GIZA++''' is a program for aligning words and sequences of words in sentence aligned corpora. If you have parallel corpus you can use GIZA++ to and wait... You can watch the log in <code>dictionary.log</code>... but the training is likely to take upwards o
    4 KB (589 words) - 11:51, 29 April 2015
  • ...al consonant mutations are currently analysed, disambiguated and generated in Apertium. It uses as an example the <code>apertium-ga-gd</code> language pa ...based on the type of mutation exhibited by the following noun, for example in the following two phrases:<ref>Note that this is not really relevant for Ir
    6 KB (1,000 words) - 07:12, 8 October 2014
  • This page documents how to count stems in dix files. There's a ready-made script that does it available at [http://s We want to import xml data first in the tree form.
    2 KB (396 words) - 02:41, 10 March 2018
  • ...s or phrases you find that are incorrectly translated, to getting involved in creating a new language pair or programming on tools or user interfaces. He ...en.wikipedia.org/wiki/C++ C++]. The various development helpers are mostly in [https://python.org/ Python].
    7 KB (1,139 words) - 06:27, 27 May 2021
  • ...ms to give an overview of the ''quality'' of various translators available in the Apertium platform. These measures are used in the table below:
    9 KB (1,233 words) - 09:10, 21 November 2021
  • (in this example, I use eng as language resp. eng-deu as pair) Word 'one' not found in the dictionary.
    1 KB (165 words) - 14:16, 28 August 2016
  • ...SourceForge repository and does not exist on GitHub. If you are interested in migrating this tool to GitHub, see [[Migrating tools to GitHub]].''' We could also weight these numbers by number of unique forms in the pardef; if the verb pardef has 6 unique forms and then noun only 3, the
    4 KB (583 words) - 15:18, 10 January 2022
  • ...<code>Local</code> inside your home directory (your home directory will be in <code>/Users/<your user name></code>). This is slightly more complicated th Also, put this in your ~/.bashrc so you don't have to paste it in every time you open a new terminal:
    2 KB (305 words) - 19:35, 12 May 2019
  • ...out:'' Insert the url to the repository, and click on "OK". The repository in the image is the whole source tree (see [[using SVN]]), if you want to chec Image:Tortoise svn commit 1.JPG|''Commit:'' Right click in the directory where you have the repository or module checked out and click
    2 KB (329 words) - 22:04, 7 March 2018
  • ...nce languages]], with '''&lt;j/&gt;''' in the monodix, or compound nominal in [[North Germanic languages]]) are split into two at the '''+''' sign: Multiwords with inner inflection (using the '''&lt;g/&gt;''' in monodix) get the uninflected part, the stuff after the '''#''' sign, moved
    1 KB (220 words) - 14:37, 7 October 2014
  • ...counting only the '''unique''' entries (by stem and continuation lexicon) in each one. # After processing the entire file, lookup the Root lexicon in the stored data and iterate through its pointers
    2 KB (250 words) - 02:42, 10 March 2018
  • ...n experts may still do "pre-post-editing"</ref> and native-speaker fluency in the target language.<ref>http://www.mt-archive.info/10/AMTA-2014-W2-Sanchez ...ng.chi13.pdf The efficacy of human post-editing for language translation]. In Proceedings of CHI. 2013, 439-448.</ref> compared to unassisted translation
    2 KB (266 words) - 11:32, 9 February 2015
  • * '''apertium-desmediawiki/-remediawiki''' in the apertium package is installed by default, but only handles the basics * '''apertium-mediawiki''' -- a newer work-in-progress format handler that handles links better (see below)
    2 KB (240 words) - 10:32, 6 October 2014
  • [[Building dictionaries|In English]] * [http://ogden.basic-english.org/words.html Anglais de base Ogden] (850 mots)
    11 KB (1,852 words) - 07:04, 8 October 2014
  • ...tml-tools. The 13th Conference of The Association for Machine Translation in the Americas: Workshop on Technologies for MT of Low Resource Languages (Lo
    4 KB (435 words) - 16:26, 1 July 2018
  • and install libvoikko from git as shown in [[/Manual_compilation#Libvoikko]] to get it to notice your dictionaries. Or * Click on the fourth tab in status bar to change language
    3 KB (387 words) - 12:21, 26 September 2016
  • You may also have to put <pre>export LD_LIBRARY_PATH=/usr/local/lib</pre> in your ~/.bashrc * Click on the fourth tab in status bar to change language
    4 KB (492 words) - 02:54, 10 March 2018
  • ...d out with the case as it is in the dictionary, not with the case as it is in the superficial form. The <code>-e</code> option is for dynamic [[decompoun [[Category:Documentation]]
    4 KB (647 words) - 07:45, 8 October 2014
  • Copy one of the existing <code>.json</code> files in <code>trunk/apertium-tools/apertium-html-tools/assets/strings/</code>, and * <code>x</code> is the language code you are localising in.
    984 bytes (147 words) - 11:57, 26 September 2016
  • [[Tagger training|In English]] ...ings of TMI, The Tenth Conference on Theoretical and Methodological Issues in Machine Translation'', pp. 135&mdash;144
    8 KB (1,173 words) - 08:20, 8 October 2014
  • In this example, [https://svn.code.sf.net/p/apertium/svn/trunk/apertium-en-ca Text is tagged with PoS (Part of Speech) tags. This helps in translation of text as during translation, words may be translated to other
    2 KB (263 words) - 17:08, 26 September 2016
  • ...ough a language pair only released for one direction might only be trimmed in that direction.</ref> ...gual dictionary entry, it will be output as the analysed lemma with an '@' in front, e.g. "children" will be output as "@child", and worse: A multiword l
    4 KB (679 words) - 16:06, 3 May 2020
  • ...giving some success stories of Apertium language pairs that were developed in whole, or partly with volunteer help. [[Category:Documentation]]
    279 bytes (38 words) - 12:11, 26 September 2016
  • ...x, and may be of interest to Apertium developers and visitors of this Wiki in general. [[Category: Documentation]]
    458 bytes (71 words) - 08:34, 8 October 2014
  • .../*$^/*$^/*$</nowiki></code> and on and on, then you probably have newlines in your <code><nowiki><alphabet></nowiki></code>. Try at first just using an e [[Category:Documentation]]
    405 bytes (55 words) - 12:04, 26 September 2016
  • search for libicu / libicu-dev / icu in your favorite package manager or follow the procedure from [http://beta.vis [[Category:Documentation]]
    1 KB (148 words) - 07:28, 8 October 2014
  • (The traffic can get high, so make a filter in your mail client.) [[Category:Documentation]]
    246 bytes (37 words) - 07:43, 3 December 2015
  • Web stuff, including wordpress is in <code>/var/www/</code>. [[Category:Documentation]]
    689 bytes (99 words) - 11:48, 24 March 2012
  • In the Ubuntu for Windows terminal: [[Category:Documentation]]
    2 KB (279 words) - 20:52, 2 April 2021
  • [[Making a release|In English]] bug #3 in Bugzilla.
    3 KB (527 words) - 07:18, 8 October 2014
  • [[Lexical selection|In English]] * [[Lexical selection in target language]]
    1 KB (210 words) - 07:18, 8 October 2014
  • [[Apertium going SOA|In English]] ...serveurs Apertium peuvent être exposées par un service de traduction dans in service existant orienté infrastructure IT; dans ce cas, l'acteur (un méd
    8 KB (1,197 words) - 14:51, 7 October 2014
  • [[Target-language tagger training|In English]] Calculated parameter will be saved in files with the name 'corpus.N.prob'
    12 KB (1,625 words) - 08:20, 8 October 2014
  • ...st <code>touch</code> these files, they may be empty. Or you can fill them in. [[Category:Documentation]]
    972 bytes (144 words) - 12:09, 26 September 2016
  • You can do transliteration in lttoolbox using a <code>.dix</code> file and <code>lt-proc -t</code>. This [[Category:Documentation]]
    2 KB (426 words) - 12:19, 26 September 2016
  • ...Denmark. Jacob Nordfalk: Maŝintradukado ([http://vimeo.com/54075259 video in Esperanto]) [[Category:Documentation]]
    2 KB (214 words) - 11:51, 26 September 2016
  • [[List of language pairs|In English]] ...de [http://wiki.apertium.org/wiki/Category:Documentation_en_fran%C3%A7ais documentation en français] qui décrivent en détail le fonctionnement d'Apertium.
    40 KB (5,154 words) - 08:38, 11 May 2018
  • [[Compiling dictionaries|In English]] [[Category:Documentation]]
    2 KB (304 words) - 07:05, 8 October 2014
  • [[Using SVN with TortoiseSVN|In English]] [[Category:Documentation]]
    2 KB (344 words) - 08:43, 8 October 2014
  • [[Apertium-pretransfer|In English]] [[Category:Documentation]]
    2 KB (258 words) - 14:51, 7 October 2014
  • [[ICU|In English]] [[Category:Documentation]]
    1 KB (141 words) - 07:18, 8 October 2014
  • [[Category:Documentation]] [[Category:Documentation in English]]
    1 KB (202 words) - 21:24, 13 March 2017
  • [[Part-of-speech tagging|In English]] [[Category:Documentation]]
    14 KB (1,992 words) - 07:52, 8 October 2014
  • [[Initial consonant mutation|In English]] [[Category:Documentation]]
    7 KB (1,126 words) - 07:18, 8 October 2014
  • [[Category:Documentation]] [[Category:Documentation in English]]
    707 bytes (103 words) - 04:57, 27 April 2018
  • [[Using SVN|In English]] [[Category:Documentation]]
    7 KB (1,102 words) - 08:43, 8 October 2014
  • [[Modes|In English]] [[Category:Documentation]]
    3 KB (390 words) - 12:02, 6 April 2017
  • [[Category:Documentation]] [[Category:Documentation in English]]
    567 bytes (82 words) - 17:17, 26 September 2016
  • [[Syntactic labels|In English]] ...ises uniformes utilisées dans plusieurs projets [http://giellatekno.uit.no/english.html Giellatekno]. Il n'est pas nécessaire d'implémenter toutes les analy
    7 KB (983 words) - 08:20, 8 October 2014
  • [[Letter case handling|In English]] [[Category:Documentation]]
    5 KB (884 words) - 07:18, 8 October 2014
  • [[Apertium New Language Pair HOWTO|In English]] ...étapes de l'installation. Pour plus d'informations, veuillez consulter la documentation sur le site du projet Apertium.
    18 KB (2,869 words) - 07:17, 28 November 2015
  • [[Finding_errors_in_dictionaries|In English]] [[Category:Documentation]]
    7 KB (1,144 words) - 14:00, 15 March 2015
  • [[Placeholder attributes|In English]] [[Category:Documentation]]
    2 KB (332 words) - 07:52, 8 October 2014
  • [[Translating man pages|In English]] [http://new.linuxfocus.org/Francais/November2003/article309.shtml Documentation sur le format des pages de man]
    5 KB (877 words) - 11:49, 15 June 2018
  • [[Category:Documentation]] [[Category:Documentation in English]]
    464 bytes (53 words) - 11:49, 24 March 2012
  • [[Category:Documentation]] [[Category:Documentation in English]]
    411 bytes (60 words) - 12:38, 24 January 2017
  • [[Calculating coverage|In English]] [[Category:Documentation]]
    4 KB (621 words) - 07:05, 8 October 2014
  • {{Otherlang|Speeding up monodix creation|In English}} [[Category:Documentation]]
    3 KB (445 words) - 15:35, 26 January 2020
  • [[Listing Apertium element using command-line|In English]] [[Category:Documentation]]
    9 KB (1,525 words) - 21:32, 17 February 2019
  • [[Category:Documentation]] [[Category:Documentation in English]]
    1 KB (160 words) - 07:45, 8 October 2014
  • [[Constraint Grammar|In English]] * [http://beta.visl.sdu.dk/cg3.html VISL CG-3 Development Information] + documentation and downloads
    3 KB (368 words) - 07:05, 8 October 2014
  • [[Analysing Finnish text|In English]] [[Category:Documentation]]
    4 KB (538 words) - 14:51, 7 October 2014
  • [[Apertium and Constraint Grammar|In English]] [[Category:Documentation]]
    8 KB (1,211 words) - 23:02, 4 April 2021
  • [[Category:Documentation]] [[Category:Documentation in English]]
    1 KB (229 words) - 19:35, 12 May 2019
  • [[Preparing to use apertium-transfer-tools|In English]] [[Category:Documentation]]
    7 KB (1,170 words) - 08:20, 8 October 2014
  • [[A long introduction to transfer rules|In English]] [[Category:Documentation]]
    27 KB (4,440 words) - 14:51, 7 October 2014
  • {{Otherlang|Frequently Asked Questions|In English}} [[Category:Documentation]]
    4 KB (565 words) - 06:27, 27 May 2021
  • [[Translating gettext|In English]] ...nt le texte déjà traduit. Voir docs/user/lingo.docbook pour une excellente documentation.
    4 KB (656 words) - 09:57, 6 October 2014
  • [[Mixed modes|In English]] [[Category:Documentation]]
    2 KB (238 words) - 07:51, 8 October 2014
  • [[Preparing data for Moses factored training using Apertium|In English]] [[Category:Documentation]]
    5 KB (699 words) - 07:52, 8 October 2014
  • [[Translating wikimedia|In English]] .../code.google.com/p/apertium-mediawiki/ , ces pages comportent encore de la documentation.
    2 KB (249 words) - 10:37, 6 October 2014
  • [[Ubuntu NLP repository|In English]] [[Category: Documentation]]
    490 bytes (82 words) - 08:42, 8 October 2014
  • [[Pivot translation|In English]] [[Category:Documentation]]
    2 KB (236 words) - 07:52, 8 October 2014
  • ...ation location. A location without special characters (eg. a space or non-English characters) is recommended. ...ygwin setup will give you the option to create desktop icon and a shortcut in the Start Menu. The installer will create a Start Menu shortcut for Apertiu
    9 KB (1,367 words) - 09:17, 26 May 2021
  • ;1. The girl with the telescope shouted at the boy who saw the dog in the field. ...rl] [with] [the telescope] [shouted] [at] [the boy] [who] [saw] [the dog] [in] [the field]
    7 KB (1,079 words) - 13:58, 7 October 2014
  • Now e.g. the clean-corpus and train-model scripts referred to below will be in ~/smt/mosesdecoder/scripts/training/clean-corpus-n.perl ...h the whole process step by step, you can use the Makefile script provided in the [[#Makefile|last section]] of this page.
    15 KB (2,206 words) - 13:58, 7 October 2014
  • In these lines, the mnemonic (starting on the first column) must be kept uncha ...for lttoolbox, apertium, and the language pairs, the installation is done in <code>/usr/local/bin</code> and <code>/usr/local/share</code>. If you wish
    5 KB (789 words) - 12:16, 15 June 2018
  • ...ose which are not used. It works on both normal entries as well as entries in paradigms. ...heck entries in both directions. This tool has been successfully used on [[English and Esperanto]] to make the eo-en direction. Contact me (--[[User:Jacob Nor
    5 KB (882 words) - 13:33, 6 December 2019
  • ...y have problems getting Apertium to run on Windows or OS X, or are limited in bandwidth, [[User:Tino_Didriksen|Tino Didriksen]] has put together a ready- ...#apertium as nick ap-vbox (appending -2 or -3 on conflict), preconfigured in UTF-8 mode
    6 KB (868 words) - 12:55, 1 April 2024
  • Note: If you are using regular expressions (e.g. in <code>is-en</code> for proper name recognition) it might be an idea to disa for i in `cat $1 | sed 's/\t/;/g' | sed 's/ /_/g'`; do
    9 KB (1,470 words) - 11:28, 24 March 2012
  • * [[apertium-mlt-ara]] in trunk: https://svn.code.sf.net/p/apertium/svn/trunk/apertium-mlt-ara * [[apertium-ara-heb]] in incubator: https://svn.code.sf.net/p/apertium/svn/incubator/apertium-ara-he
    3 KB (437 words) - 10:23, 21 November 2021
  • ...uage pair is installed, so Apertium can find it, whatever directory you're in. The <code>-d .</code> means "use the language data in this directory".
    2 KB (368 words) - 06:02, 24 April 2017
  • ;English ...urce A website that hopes to speak the language of freely available data], in The Guardian.
    13 KB (1,689 words) - 21:42, 28 February 2021
  • ...emma-paradigm pairs for Apertium [[monodix|monodices]] easy. It is similar in principle to the "expanded" format of [[lttoolbox]], but with the advantage So for example in English we might see some noun inflection represented as:
    1 KB (204 words) - 09:55, 6 October 2014
  • ...ction''' is the task of choosing the best translation of a polysemous word in a sentence. One way of doing this is to use the sentence context to determi ...rom a paper dictionary for some words in Catalan and their translations in English. Note the proper name ''Estats Units'', multiword ''L'estament mèdic'', po
    1 KB (225 words) - 11:40, 24 March 2012
  • ...m-OmegaT_screenshot1.png|thumb|350px|right|Sample project translating from English to Spanish]] '''Apertium-OmegaT''' is a plug-in for [http://www.omegat.org/en/omegat.html OmegaT] built on top of [[lttoolb
    3 KB (413 words) - 07:43, 10 May 2018
  • ...the whole process step by step, you can use the Makefile scripts provided in the [[#Makefiles|last section]] of this page. We're going to do the example with EuroParl and the English to Spanish pair in Apertium.
    12 KB (1,634 words) - 18:26, 26 September 2016
  • ''&lt;!-- element in bilingual dictionary es-ca --> ''&lt;!-- element in bilingual dictionary ca-fr -->
    2 KB (256 words) - 09:28, 6 October 2014
  • ...word (we assume we already know the part of speech). This can be important in machine translation between less-closely related languages. The problem was :Little John was looking for his toy box. Finally he found it. The box was in the pen.
    2 KB (292 words) - 10:38, 7 September 2012
  • ...special way (don't know how to do that now :P), but keeping the entry for documentation ...to english, not english to bengali, besides we need corresponding entries in en monodix
    31 KB (5,625 words) - 04:28, 19 August 2011
  • ...slating from Spanish to English and have a word which we want to translate in two ways depending on context. If it is followed by the name of a person, o ...e is used in [[apertium-sme-nob]] and [[apertium-is-en]], see files ending in <code>.lex</code>
    7 KB (1,057 words) - 21:07, 1 December 2013
  • ...kage) to get cheap bilingual dictionaries from any language pair available in [http://www.omegawiki.org OmegaWiki] database. # install it locally in your computer (some detailed instructions in [http://www.omegawiki.org/Development www.omegawiki.org/Development])
    2 KB (202 words) - 00:55, 24 January 2018
  • ...stood by the speaker reflect specific patterns and regularities in the way in which words are formed from smaller units and how those smaller units inter [[Category:Documentation in English]]
    787 bytes (118 words) - 01:29, 13 April 2020
  • Click on “View in the Mac App Store”, and once the Mac App Store opens, install XCode. Once ...h has the list of all stable language packs, followed by the language code in brackets. Enter the language code you want to install into the script, with
    5 KB (759 words) - 19:35, 12 May 2019
  • '''Evaluation''' can give you some idea as to how well a language pair works in practice. There are many ways to evaluate, and the test chosen should depen .../apertium-eval-translator apertium-eval-translator.pl] is a script written in that calculates the word error rate (WER) and the position-independent word
    6 KB (981 words) - 09:13, 21 November 2021
  • * '''sl''': source language (for example, in morphological and bilingual dictionaries) * '''tl''': target language (for example, in bilingual dictionaries)
    8 KB (902 words) - 09:19, 6 October 2014
  • ...s.php?ID=5863 yaourt] installed, installing a language pair like Esperanto-English is simply a matter of This will install both apertium, lttoolbox and apertium-eo-en in the /usr prefix. You can of course use your favourite AUR installation meth
    1 KB (184 words) - 02:50, 9 March 2018
  • ...inantly spoken by the Tamil people of India and Sri Lanka. It is currently in Apertium-Incubator. In [[incubator]]:
    718 bytes (84 words) - 20:31, 25 January 2020
  • * <s>When would one call the bilingual dictionary? Apertium Level 2 calls it in the first stage.</s> ::The English → Urdu translation system linked [[Specific resources per language#Urdu|h
    2 KB (236 words) - 21:21, 2 October 2013
  • {{{Otherlang|North Germanic languages|In English}}} | Islandais || || || [[Icelandic and English]]
    5 KB (725 words) - 23:26, 15 March 2020
  • [[Dictionary coverage|In English]] L'outil a été utilisé sur la paire [[English and Esperanto|anglais et esperanto]] pour faire la direction eo-en. Contact
    5 KB (855 words) - 08:55, 6 October 2014

View (previous 250 | next 250) (20 | 50 | 100 | 250 | 500)