Search results

Jump to navigation Jump to search
  • ...er hundreds, of thousands of words) to 'train' the automatic taggers found in some Apertium language pairs. Getting the right tag for a word is important This is why we have many hand-tagging tasks in the Google Code-In.
    2 KB (294 words) - 09:47, 12 November 2017
  • ...e an Apertium language pair state-of-the-art, or close to state-of-the-art in terms of translation quality. This will involve improving coverage to 95-98 * Find a language pair of your choice in Apertium and install it. (see [[Install language data by compiling]])
    2 KB (383 words) - 19:46, 2 March 2023
  • ...ing) purposes. The evaluation described would measure how helpful they are in the task. Starting from files containing sentences in the source language and reference translations, generate tests for human ev
    1 KB (207 words) - 09:03, 23 April 2015
  • At the moment it is not possible to define cyclical paths in [[lttoolbox]]'s XML-based transducer format. The idea of this project is to .../code> element]] in any analysis, meaning there can be no <code>#</code>'s in the actual cycles.
    1 KB (158 words) - 07:23, 2 September 2014
  • ...roviding suggestions. Our [[lttoolbox]] based transducers should be usable in the same way. Additionally, we have the beginnings of a spell checking inte * create clean Makefile rules for speller compilation that are usable in our monolingual modules
    2 KB (282 words) - 13:47, 17 March 2016
  • This Tutorial is prepared by Jatin Luthra in Google Code-In. In case you don't have slapt-get installed already do the following in order to install it:
    1 KB (200 words) - 02:56, 9 March 2018
  • ...most frequent unknown words''' (words in the source document which are not in the dictionary). See below for information about how to do this. Note: th ...nolingual dictionary''' (the appropriate <code>.dix</code> or <code>.lexc</code> file) so that they are not unknown anymore. Make sure to categorise stems
    2 KB (299 words) - 19:44, 30 December 2019
  • Daniel Huang, Google Code In 2012 <br /> ...w.azeri.org/Azeri/az_learn/az_socio/socio_index.html Learning Azerbaijani in Social Context]
    5 KB (601 words) - 11:58, 25 October 2019
  • ...orrect translation in the relevant context. You'll want to write 10 rules in all. [[Category:Tasks for Google Code-in|Add lexical-selection rules]]
    1 KB (199 words) - 21:39, 15 December 2019
  • ...c.) detect the 50 most frequent unknown words (source words which are not in the dictionaries of the language pair). [[Category:Tasks for Google Code-in|Add words]]
    2 KB (271 words) - 05:34, 17 December 2015
  • The objective of this task is to take (or make) a dictionary in text format and extract the ''paradigm sketches'' from it. By this we mean ...them into word categories. The suffixes and categories should be described in the dictionary.
    2 KB (151 words) - 23:20, 14 November 2013
  • ...t). e.g. 1-best would have only the morphological analyses which are found in the 1-best parse tree. ...the morphological analysis of a word from the PGF library and print it out in Apertium format. (Build the library from source; the one on hackage has a f
    2 KB (243 words) - 11:24, 13 March 2015
  • ** Put in template .lsx files * Add about 6 entries in each direction
    343 bytes (51 words) - 18:40, 29 January 2018
  • ...ub.com/apertium/apertium-python apertium] via a <code>setup.py</code> file in a Windows environment. *# Make <tt>show()</tt> return a list of tuples (in, out, in, out)
    1 KB (191 words) - 21:12, 19 March 2019
  • ...you know the the target language, and the target language has good support in LanguageTool (Catalan is one that has support from both Apertium and Langua ...words, you might need to add a multiword so that they translate correctly in that context
    1 KB (186 words) - 09:43, 4 November 2014
  • The equivalent in [[speling format]] would be: Where <code>n.f</code> means "noun, feminine" (this information will also typically be on the Wik
    2 KB (214 words) - 12:10, 26 May 2023
  • ...ord units, for example ''dirección general'' and ''zračna luka''. Although in the Romance languages it is not a big problem, as soon as you start to get Default translation of "ambulantni" would be "ambulatory", but in this case we want to translate "ambulantnih pacijenata" as "outpatients"
    2 KB (256 words) - 23:56, 5 April 2013
  • ...panish--Breton. This task would be to develop it to work with any language in our SVN and fix problems with processing and displaying non-standard HTML. * Make it optionally read in dictionaries in [[lttoolbox]] format.
    2 KB (248 words) - 19:31, 24 February 2014
  • ...errors (the translation is not adequate because the part-of-speech tagger in Apertium has selected the wrong morphological analysis for a word that had ...nd write 5 constraint grammar rules that select the desired part of speech in the relevant context(s);
    1 KB (193 words) - 14:27, 29 October 2013
  • ...rors''' (the translation is not adequate because the part-of-speech tagger in Apertium has selected the wrong morphological analysis for a word that had ...rite 10 constraint grammar rules''' that select the desired part of speech in the relevant context(s);
    1 KB (156 words) - 02:19, 21 October 2018

View (previous 20 | next 20) (20 | 50 | 100 | 250 | 500)