Search results

Jump to navigation Jump to search
  • ...tion of each module with more precision. They may also introduce technical language which linguists and/or computer coders would use. The technical description References to 'xxx' and 'yyy' refer to a language code, for example 'en-es'; 'English' to 'Spanish'.
    29 KB (4,687 words) - 16:28, 5 June 2020
  • ...translation, being a sub-field of NLP, enables to explore the grammar of a language and deal with it from a computational perspective. I really like the idea o ...vated community like Apertium's it is possible to accomplish such numerous language-pair translation. The project supports both widely spoken languages and min
    11 KB (1,672 words) - 20:56, 9 April 2010
  • ...of any language in Russia in areas smaller than the Federal Subjects. The data is in Russian and comes from the official 2010 Russian Census website. Here are the steps to access the data:
    2 KB (296 words) - 21:12, 13 January 2018
  • <li>Theory of Language (Phonetics, Morphology, Syntax, Semantics)</li> <li>Language Diversity and Typology</li>
    13 KB (2,187 words) - 09:56, 23 March 2018
  • * Apertium allows one, as a language lover, to work very closely on a language both from a linguistic and a computational perspective. I, being a Research * ''Interested Task: '' Add a new variety to an existing language
    9 KB (1,397 words) - 15:31, 2 April 2024
  • ...//d3js.org/ D3.js] tool that depicts all Apertium [[list of language pairs|language pairs]] in an interactive graph initially developed sometime before the [[G === Updating language data by scraping ===
    5 KB (702 words) - 01:34, 9 December 2018
  • === Language pairs === .../github.com/apertium/apertium-urd-hin?files=1 apertium-urd-hin] Linguistic data for the Apertium Urdu-Hindi machine translator
    6 KB (806 words) - 00:45, 7 December 2018
  • '''Apertium New Language Pair HOWTO''' ...rtium machine translation system from scratch. You can check the [[list of language pairs]] that have already been started.
    36 KB (5,933 words) - 16:14, 22 February 2021
  • ...language is a very complicated system, and translating a sentence from one language to another has always been a challenging task. Meanwhile, the translation n ...are approximations to the real structural rules in a language system, many language pairs have proved a high translation quality. So hopefully, we can expect t
    21 KB (3,340 words) - 10:56, 28 May 2018
  • ...periments and develops all the possible methods, tools and technologies of data processing in natural languages (NLP). Therefore, it can be viewed as some ...pporting mainly the European languages; first of all they are focused on a language pair including English. Besides, those systems are not open, which makes it
    10 KB (1,535 words) - 09:26, 5 April 2012
  • I am master student majoring in Natural Language Processing, and I like many tasks of this area. The machine translation is ...the third case (she'll or shell) the translation is ambiguous. The Trigram Language Model can help to resolve the ambiguity. I think that there are few similar
    14 KB (2,151 words) - 13:35, 21 March 2014
  • I am master student majoring in Natural Language Processing, and I like many tasks of this area. The machine translation is ...the third case (she'll or shell) the translation is ambiguous. The Trigram Language Model can help to resolve the ambiguity. I think that there are few similar
    14 KB (2,151 words) - 16:14, 21 March 2014
  • The number of language pairs in development for Apertium is increasing, and so is the complexity o language pairs. With better tools, more people will be able to develop language pairs.
    29 KB (4,382 words) - 07:53, 6 October 2019
  • <b>Native Language:</b> Bodo ...carried out. Rule-based translation system provides opportunities to write data and linguistic rules of languages.
    9 KB (1,205 words) - 04:13, 2 April 2024
  • ...he implementation of the algorithms must be free/open-source, but also the data themselves. Nowadays, there are many machine translation packages of this t ...morphologically rich languages, which even with large corpora suffer from data sparseness.
    6 KB (905 words) - 17:26, 18 October 2010
  • ...-supervised.make this one] from en-eo. You will need modify it to fit your language pair. This usually means editing the first few lines. ===Tagger data directory===
    3 KB (537 words) - 13:44, 18 June 2014
  • '''GSOC 2019 : Light alternative format for all XML files in an Apertium language pair'''[http://wiki.apertium.org/wiki/Ideas_for_Google_Summer_of_Code#Light ...expression to its referring entity. This is an important task for natural language.
    9 KB (1,494 words) - 00:52, 20 April 2019
  • * linguistic data for the engine * tools for creating/learning/managing/serving linguistic data
    5 KB (733 words) - 03:00, 10 August 2015
  • |Language You will need to install NLTK and NLTK data. Unfortunately, they both only support Python versions 2.6-2.7. If you are
    14 KB (2,232 words) - 12:51, 26 September 2018
  • ...esource languages gives the speakers of those languages access to valuable data and can help in several domains, such as education, news, judiciary, etc. T ...ecause of the level of understanding it provides instead of simply blaming data for poor results, it actually shows that it can perform better for low reso
    27 KB (4,091 words) - 19:32, 4 June 2020
  • <li> Morphology, Syntax, Semantics, Typology/Language Diversity </li> ...ngines. I think rule-based translation very promising if we provide enough data and an effective analysis </p>
    9 KB (1,413 words) - 11:49, 6 May 2018
  • 2.2. Data Stream without format. ...s code1 and code2. The note about regression tests would be removed if the language pair has none of course.
    7 KB (799 words) - 06:26, 27 May 2021
  • ...uide on how to use a development version of Apertium to make a change in a language pair. ...ou should try this to make sure things work before you move on to whatever language pair you plan on working on.
    10 KB (1,626 words) - 17:46, 13 January 2020
  • ...http://wiki.apertium.org/wiki/Mandarin_Chinese#In_Apertium some linguistic data in Apertium]. ...fers to the most commonly spoken form of Chinese that is the sole official language of China and Taiwan. It is also known as Putonghua or Standard Chinese ([[W
    16 KB (2,148 words) - 03:28, 16 December 2015
  • ...mpire, as did all Romance languages. There are currently 4 released French language pairs ...the sixth most spoken language in the world and is the second most studied language worldwide.
    15 KB (2,081 words) - 07:14, 12 August 2020
  • ...hich are mostly commercial. But Apertium is an open-source and uses single language-independent specification, to allow for the ease of contributing to Apertiu ...n work properly and Suggestion styling if people choose an unlikely source language
    13 KB (2,022 words) - 18:40, 27 March 2018
  • ...MT based on corpora: adding new languages ​​is very easy. To create a new language pair, in fact, it is not necessary to include corpora with millions of word ...airs can be added by creating dictionaries and rules containing linguistic data in XML format.
    15 KB (2,339 words) - 00:41, 4 June 2018
  • ...translation makes you notice these little differences by making the cross-language variation explicit. ...of the field to refine the results of the automatically produced data. All data is organized in the XML which is humanly readable and editable. I believe i
    11 KB (1,778 words) - 17:20, 26 April 2011
  • ...orced to limit our knowledge due to localization issues. We use different language,characters to express our selves in day to day life. To tackle these issues ...a programmer, and last and foremost I am interested on learning linguistic data processing where Apertium is great application to start with.
    6 KB (977 words) - 07:49, 20 April 2010
  • ...ind that are incorrectly translated, to getting involved in creating a new language pair or programming on tools or user interfaces. Here are some question fre Our language agnostic tools are native and written in [https://en.wikipedia.org/wiki/C++
    7 KB (1,139 words) - 06:27, 27 May 2021
  • ...are basically for Anel, Aizhan and Assem who have started to develop this language pair... And Aida too... === Download apertium, lttoolbox and eng-kaz data from SVN ===
    20 KB (2,856 words) - 06:26, 27 May 2021
  • ...ll these language pairs. This means that the data can be re-used by other language projects (e.g. in developing spelling or grammar checkers, thesauri, etc). This project was accepted as part of our "adopt a language pair" idea
    12 KB (1,917 words) - 15:54, 12 September 2009
  • program. My courses so far include formal languages, data structures and ...t system used internally at SuSE, I was working on the workflow definition language and the core workflow engine.<br />http://swamp.sf.net <br />
    15 KB (2,372 words) - 19:57, 8 April 2010
  • == Linguistic data == * At least one state-of-the-art language pair (wrt. Google) using all available modules.
    3 KB (447 words) - 12:12, 27 June 2020
  • *'''langpair''': language pair to use for translation curl -G --data "langpair=eng|spa&q=run" http://localhost:2737/dictionaryLookup
    5 KB (712 words) - 21:27, 16 August 2016
  • ...ollected a sample of 2000 tweets to analyze the common patterns, some chat data, and also made a literature survey to check for types of non-standard input sample data : https://docs.google.com/document/d/1fGFO6V-lKcvqgzaQRfxEfLWGXF6AxqTIODKda
    5 KB (817 words) - 21:23, 14 March 2014
  • ...appear at the beginning of a sentence. The unique thing about the persian language though, is that they use prepositions which is quite uncommon in many SOV l ...designed a Two-sided morphology analyst of nouns and adjectives in Persian language, using Xerox Finite State Technology as giving input word (adjective or nou
    16 KB (2,597 words) - 20:58, 12 January 2013
  • ...ect (2009) titled '''''Conversion of Anubadok: Creating an English Bengali Language Pair''''' under Apertium. The project was a great experience for me. I had ...ing offered by Apertium will have far reaching effect in the local Bengali Language adoption and localization of open source softwares.
    16 KB (2,533 words) - 01:16, 10 April 2010
  • ...achine translation can aid a lot of these problems and breaking the &ldquo;language barrier&rdquo; across not just the country and the globe and connect people The project would bring a lot of developers at ease. Python is a high-level language with a lot of features that make it easier to grasp for developers. Python
    15 KB (2,338 words) - 14:45, 27 March 2018
  • 2. en-es language pair(for experiment) https://svn.code.sf.net/p/apertium/svn/branches/aper 3. es-ca language pair(for experiment) https://svn.code.sf.net/p/apertium/svn/branches/aper
    14 KB (1,896 words) - 08:43, 7 October 2013
  • * Apertium language pairs .../engine of Apertium installed (including the requirement lttoolbox, but no language pairs yet).
    9 KB (1,367 words) - 09:17, 26 May 2021
  • = Google Summer of Code 2012 Application - adopting a new language pair fr-ro = ...next year's final thesis I will be working on a project related to natural language processing and pattern recognition.
    8 KB (1,170 words) - 00:36, 13 April 2012
  • ...en source project; it is also a very welcoming family of collaborators and language enthusiasts. After successfully participating in GSoC 2017 with Apertium an I am interested in upgrading several language pairs to ease future development and bring one of them (Romanian-Catalan) t
    11 KB (1,500 words) - 15:44, 30 April 2018
  • ...of the main five data files in any language pair (see also: [[Apertium New Language Pair HOWTO]]). ....dix'' where ''apertium-A-B'' is the name of the [[List of language pairs| language pair]]. For example file ''apertium-af-nl.af-nl.dix'' is the bilingual dict
    7 KB (1,244 words) - 16:41, 17 March 2018
  • ...getting new contributors to Apertium and to helping spread our passion for language technology. ...of other things, live in our '''[[subversion|svn repo]]'''. The language data is found in the following places:
    7 KB (1,091 words) - 19:54, 12 April 2021
  • ...olving the antecedent of the anaphors in text becomes essential in several language pairs. ...ge it to the correct anaphor''' using a macro in the transfer rules of the language pair. (t1x)
    20 KB (3,107 words) - 21:13, 24 June 2022
  • ...e bilingual dictionary, collocations (n-grams) are extracted from a source language corpus. * Translations are scored on a target language corpus. -- The target language model training corpora would need to be preprocessed in some cases, to, for
    51 KB (7,047 words) - 08:49, 9 June 2011
  • ...nders , specially for Indian Languages because we still do not have enough data ...oreign languages. I am specially interested in MT systems where the source language is English and the target languages are Indian Languages. It is impossible
    6 KB (923 words) - 17:57, 3 April 2010
  • First, make a directory called <code><lang>-tagger-data</code>. Put your corpus into there with a name like <code><lang>.crp.txt</c ...cifies how to generate the probability file. You can grab one from another language package. For <code>apertium-en-af</code> I took the Makefile from <code>ape
    7 KB (1,177 words) - 08:34, 8 October 2014
  • '''Track:''' Data Science Dynamic Language Interpreter implementation
    8 KB (1,094 words) - 13:10, 14 April 2019
  • ...the stream, and in the future one can add any amount of information in the language models or the translation modules. Later you can see how this formalism loo ...ynamic. All current pipes will continue to work as-is, unmodified. All old data and files remain valid.'''
    24 KB (4,167 words) - 09:20, 17 July 2020
  • ==Install language module== A language module supporting spelling may be installed, either from our repository, or
    3 KB (387 words) - 12:21, 26 September 2016
  • ...ion of machine translation. The tasks consist of sentences in the original language, reference translation with keywords omitted and the machine translation of ...various { gap } in order to discover phenomena and patterns in the natural language.
    9 KB (1,368 words) - 09:04, 23 April 2015
  • ...y in Egypt. Recently I have been granted a scholarship to study masters in data science at Innopolis University in Russia.<br /> ...subjects I loved the most were artificial intelligence, machine learning, data mining and deep learning, and that's because of the great potential in the
    18 KB (2,903 words) - 22:18, 8 April 2019
  • You should now have the core/engine of Apertium installed (but no language pairs yet). == Apertium Language Pairs Installation ==
    5 KB (822 words) - 04:50, 23 November 2011
  • ...icient digital data, parallel, corpora, etc. to work well for alternative, data-hungry approaches. ...loper can track it down and fix it, instead of simply blaming bad training data.
    9 KB (1,296 words) - 17:42, 4 April 2023
  • ...modular, documented, open platform for machine translation and other human language processing tasks * To favour the interchange and reuse of existing linguistic data.
    15 KB (2,462 words) - 16:57, 31 January 2019
  • ...: Undergraduate Researcher at Jadavpur University specialising in Natural Language Processing<br> Professional Interests : Natural language Processing ,Computational Linguistics , Sentiment Analysis , Statistical a
    16 KB (2,215 words) - 14:49, 31 March 2020
  • ...duce translations which are less fluent, but more preserving of the source language meaning. ...er and number between a determiner and head noun will remain in the target language output.
    12 KB (1,464 words) - 12:00, 31 January 2012
  • ...duce translations which are less fluent, but more preserving of the source language meaning. ...er and number between a determiner and head noun will remain in the target language output.
    11 KB (1,519 words) - 06:51, 11 May 2013
  • ...duce translations which are less fluent, but more preserving of the source language meaning. ...er and number between a determiner and head noun will remain in the target language output.
    11 KB (1,519 words) - 18:27, 16 October 2015
  • More convincing if you have a language pair on the computer somewhere :) ...this should work for both packaged and compiled Apertium. Without language data you can't see a translation, but you can see the help. Try,
    2 KB (368 words) - 06:02, 24 April 2017
  • I am interested in NLP, data mining, math as well system design. I love to do competitive challenges as ...stic diversity, simply because endangered languages don’t offer sufficient data.
    11 KB (1,787 words) - 17:11, 13 April 2021
  • ...probably try this to make sure things work before you move on to whatever language pair you plan on working on. Note that some existing language pairs have external dependencies, like HFST or Constraint Grammar. The [[In
    10 KB (1,715 words) - 12:29, 28 May 2018
  • ...tended to show how you can make an "indirect" contribution, by documenting language resources, helping us to build bilingual test sets, translating, promoting, ...first language, and translate them to the other. A translation in a third language may be useful in enlisting help, but is not required.
    9 KB (1,494 words) - 05:58, 18 March 2015
  • ...effort and can be accessed in a place where there are no speakers of this language. The challenge in itself is very interesting and its application can be see ...t from the past 6 months. I could relate to the project- “Bring a released language pair up to state-of-the-art quality (Hindi-English)”. I have been working
    10 KB (1,482 words) - 22:05, 21 May 2014
  • ...ed translation, morphological analysis, natural language processing, human language technologies ...Spanish–Catalan) but which has been expanded to deal with more divergent language pairs (such as English-Catalan and even Basque→English). The platform pro
    10 KB (1,500 words) - 16:23, 18 February 2016
  • ...probably just search for, tick off and install Apertium and your favorite language pairs in Synaptic. There's a friendly [https://help.ubuntu.com/community/Sy Step 2: '''Download apertium, lttoolbox and language pairs from SVN.'''
    3 KB (475 words) - 16:28, 27 April 2017
  • '''apertium-get''' is a little script to fetch and compile language data, with monolingual dependencies, from Github. ...d and compiled by just going to the directory where you want your language data to be, and running
    2 KB (317 words) - 20:45, 23 March 2019
  • =Apertium Summer of Code application: 
New Maltese-Hebrew language pair= ...and inter-connected world, the spread of information is still limited by a language barrier.<br/>
    13 KB (2,014 words) - 20:05, 4 June 2011
  • File:Altai-alphabet-pronunciation-writing-system omniglot-com.png
    ==== Altai Language Resources ==== Crúbadán language data for Southern Altai. Kevin Scannell. 2015. The Crúbadán Project. oai:cruba
    (588 × 481 (23 KB)) - 07:30, 5 December 2017
  • ==== Data preparation ==== There were three attempts to extract postediting operations for each language pair: with threshold = 0.8 and -m, -M = (1, 3).
    7 KB (1,033 words) - 15:27, 15 August 2018
  • <li>- 4: preprocessing : dictionary data needs some changes to be used in a graph, this step prepares it for further ...recommends what languages will be the most efficient to enrich particular language pair</li>
    19 KB (2,541 words) - 15:44, 12 August 2018
  • ...tium is a free/open-source platform for rule-based machine translation and language technology which is aimed providing support for lesser-resourced and margin ...t lets the user to input a URL, choose a source language and a destination language and translate the webpage. This feature has been successfully completed as
    16 KB (2,280 words) - 01:40, 8 March 2018
  • ...ence if one learns to create good tools for MT, they learn most of Natural Language Processing. A tool which is rule-based and open source really helps the community with language pairs that are resource- poor and gives them free translations for their ne
    6 KB (984 words) - 15:20, 21 March 2020
  • ...ence if one learns to create good tools for MT, they learn most of Natural Language Processing. A tool which is rule-based and open source really helps the community with language pairs that are resource- poor and gives them free translations for their ne
    7 KB (1,078 words) - 07:16, 28 March 2020
  • ...d was exposed to different languages. This led to me being fascinated with language translation and I wanted to contribute to help in making communication easi I am going to work on “ Adopt an unreleased language pair: Hindi - Telugu”. I want to get the pair released in both the direct
    9 KB (1,387 words) - 16:27, 31 March 2020
  • ...d was exposed to different languages. This led to me being fascinated with language translation and I wanted to contribute to help in making communication easi I am going to work on “ Adopt an unreleased language pair: Hindi - Telugu”. I want to get the pair released in both the direct
    9 KB (1,391 words) - 16:41, 31 March 2020
  • == Language data packages == If you've installed tools with install-nightly.sh, you can install language data with
    4 KB (665 words) - 11:57, 18 November 2022
  • == Morphological Analyzer of Braj Language == ...language, and I am interested in making of morphological analyzer for Braj Language. And this rise my more interest in NLP, Machine Learning etc.
    6 KB (870 words) - 18:37, 25 March 2019
  • == Morphological Analyzer of Braj Language == ...language, and I am interested in making of morphological analyzer for Braj Language. And this rise my more interest in NLP, Machine Learning etc.
    6 KB (873 words) - 18:25, 27 March 2019
  • == Morphological Analyzer of Braj Language == ...language, and I am interested in making of morphological analyzer for Braj Language. And this rise my more interest in NLP, Machine Learning etc.
    6 KB (870 words) - 18:42, 25 March 2019
  • ...translation system that is far from perfection has much to offer society. Language barriers are as big an issue today as they have ever been in hindering frui ...or creating new Apertium language pairs. And more immediately, the sme-nob language pair in the incubator will no longer require pipeline hacks to coerce the c
    16 KB (2,502 words) - 19:03, 8 April 2010
  • ...um project is a project which works on open-source machine translation and language technology. We try and focus our efforts on lesser-resourced and marginalis ...versitat d'Alacant] (Alacant, Spain) and [http://www.prompsit.com Prompsit Language Engineering].
    10 KB (1,543 words) - 19:50, 12 April 2021
  • ...f language pairs that may be used to infer new entries for existing or new language pairs using graphs. ...a graph and relevant information is stated about them. The cloud of linked data is intended to be navigated by software agents primarily. In the case of Ap
    3 KB (452 words) - 19:50, 24 March 2020
  • Some sort of language model that takes a list of LUs and dependency relations and determines the Given a syntactic parser for one language and a fairly small parallel corpus it seems like it should be possible to l
    6 KB (972 words) - 18:06, 23 December 2022
  • ...ments and most people have to face the challenge of communicating in other language that their own. So the question is, who is not interested in having access ...resources (I provide) to analyze Chinese and generate Spanish (well, this language is already in Apertium). Then, I will work with an statistical system + pos
    11 KB (1,666 words) - 05:22, 13 May 2013
  • ...oject goal is to create a machine translation package for Sicilian-Spanish language pair on the base of Apertium’s machine translation system. This project i ...he Sicilian dictionary was the abundance of spelling forms in the Sicilian language. For instance, one Sicilian verb with the meaning 'to join' can have the fo
    9 KB (1,370 words) - 13:58, 23 August 2016
  • '''Develop a prototype MT system for Kazakh - Uzbek language pair''' ...ntribute to the platform by extending the list of language pairs my native language - Uzbek has so far.
    6 KB (854 words) - 15:35, 20 April 2021
  • ...tasks is the construction of efficient lexical processors from linguistic data. ...xical forms involves drawing correspondences between a lexical form in one language, and the translation in another. This final operation is crucial in constru
    18 KB (2,967 words) - 19:24, 11 December 2012
  • Qualification: Major in Natural Language Processing 2015: Awardee of graduates’ competition „Natural Language Processing” (''a competition for students hold by National Research Unive
    11 KB (1,652 words) - 15:56, 24 March 2016
  • ...language particularly suitable for various reasons. First, because it is a language in process of standardization, so both the linguistic resources (written do ...he near future, it will be possible to operate in the translation of other language pairs as Sardinian-Catalan and Sardinian-Spanish.
    7 KB (1,110 words) - 11:34, 23 August 2016
  • ...ovide an accurate, universal automated translation engine and accompanying language-specific datasets. ...venience and necessity required by increasingly globalised communication. Language extinction is occurring all around the world, including my native Australia
    9 KB (1,302 words) - 04:11, 8 April 2011
  • ====Rewrite language.php file as an abstract script, and interface modules for Apertium, Aspell "language.php" has been separate in 2 parts : environment management and translation
    32 KB (4,699 words) - 16:32, 19 August 2011
  • ...ulture.Following the principles of preserving culture and heritage through language, Apertium connects both ancient and modern through advances in machine tran ...plays an important role out here.Meeting different people with different language makes it difficult to communicate but it is overcome'd by the translation t
    9 KB (1,415 words) - 00:29, 29 March 2020
  • ...declarative language. A good intro would be to look through [[Apertium New Language Pair HOWTO]], see also [[Contributing to an existing pair]]. If the pair ha #* If there is no translation, translate it into the languages of your language pair first.
    6 KB (1,024 words) - 15:22, 20 April 2021
  • I have an urge to improve open source language translation with Apertium. ...a very noble goal, which is bringing languages with low resource language data to life by linking them with machine translation of high resource languages
    4 KB (630 words) - 11:39, 8 April 2019
  • ...elps people from whole world to understand and get information in foreign language very quickly and easy. Building machine translation systems is very interes I interested in task “Adopt an unreleased language pair”, as a language pair I choose is Kazakh-English, in Kazakh-English translation direction. I
    9 KB (1,088 words) - 19:28, 24 March 2016
  • ...rs independent free-software developers. There are currently 40 published language pairs within the project (including a number of "firsts" — for example Sp natural language processing, machine translation, grammar, python, c++, linguistics, languag
    7 KB (1,111 words) - 10:10, 15 November 2015
  • ...rently one of more successful translation endeavors—and while it lacks the data and traffic that is available to Google Translate, it stands out from corpo ...s; and (3) providing support for discontiguous multiwords in some existing language pairs. See work plan for details.
    9 KB (1,382 words) - 20:53, 3 April 2017

View (previous 100 | next 100) (20 | 50 | 100 | 250 | 500)