Search results

Jump to navigation Jump to search

Page title matches

  • The language detection in the [[simple-html]] interface currently uses a 2.9M javascript file. The ob ===Implement language detection in apertium-apy===
    1 KB (205 words) - 20:49, 13 November 2013
  • ...of a sentence using GNU Bison. The output could be text, or a a <code>dot</code> file using GraphViz. * [https://svn.code.sf.net/p/apertium/svn/branches/transfer4 An example grammar]
    375 bytes (59 words) - 16:11, 14 November 2013
  • Here are some top-tips for working with Apertium in the [[Google Code-in]]: ...GitHub] account. We use [[git]] and GitHub for collaboratively developing code.
    518 bytes (88 words) - 08:11, 16 September 2018
  • ...ing (local agreement, gender, number, etc. is inadequate, local word order in a phrase is inadequate, there is a word too much or a word missing, etc.). ...text in L₂ through the pair and find a consistent error in the output text in L₁ that isn't grammatical.
    1 KB (208 words) - 21:39, 15 December 2019
  • ...tokenise sentences in South and East Asian languages into words. Sentences in these languages are usually not written with spaces to show word boundaries ...l the possible ways of splitting up the sentence into words that are found in the dictionary:
    3 KB (394 words) - 01:37, 17 June 2023
  • ...North Sámi--Norwegian Bokmål and Kazakh-Tatar among others), and many more in development. ;Why would you organisation like to participate in Google Code-in 2014?*
    6 KB (987 words) - 10:21, 7 November 2014
  • ...numbers, by inserting the special symbol &lt;n&gt; in place of the number in the transducer; at runtime, when this symbol is encountered, numbers are co ...project is to extend lt-tmxproc to include the regular expressions support in lttoolbox.
    2 KB (332 words) - 19:55, 24 March 2020
  • ;Why does your organisation want to participate in Google Code-in 2018? ...that focuses a lot on marginalised languages. GCI gives us a chance to get in touch with the next generation of speakers, and to show them how they can h
    3 KB (443 words) - 11:20, 11 September 2018
  • ...c.) detect the 50 most frequent unknown words (source words which are not in the dictionaries of the language pair). [[Category:Tasks for Google Code-in|Add words]]
    2 KB (271 words) - 05:34, 17 December 2015
  • ...e variation is possible and useful, systematising it, and then enabling it in the language pair by turning hard restrictions into ambiguity and selectors * initial documentation of possible preferences in a pair of your choice (which doesn't already have preferences enabled)
    2 KB (239 words) - 09:33, 4 March 2024
  • ...s page for Google Code-in 2013 (http://www.google-melange.com/gci/homepage/google/gci2013), here you can find ideas on interesting tasks that will improve yo '''For current GCI task ideas, see [[Task ideas for Google Code-in]]'''
    68 KB (10,323 words) - 15:37, 25 October 2014
  • ...ptimisations to the lexical selection module. The lexical selection module in Apertium is currently a prototype. There are many optimisations that could * Do proper processing of tags in all scripts.
    1 KB (186 words) - 18:06, 22 March 2013
  • ...o are able to mentor Apertium-related tasks are eligible to be Google Code-In mentors for Apertium. This can include: ...ive community, but if you don't have any experience with Apertium tools or code, please don't bother asking to be a mentor after it's already been announce
    2 KB (345 words) - 13:19, 14 November 2019
  • ! Part-of-speech !! Code | Noun || <code>n</code>
    3 KB (286 words) - 22:00, 8 December 2019
  • At the moment it is not possible to define cyclical paths in [[lttoolbox]]'s XML-based transducer format. The idea of this project is to .../code> element]] in any analysis, meaning there can be no <code>#</code>'s in the actual cycles.
    1 KB (158 words) - 07:23, 2 September 2014
  • In this page we describe how to morphologically disambiguate (tag) text so tha Example of a tagger error in the English tagger for the sentence "Where do you come from?":
    2 KB (228 words) - 12:16, 26 September 2016
  • ...most frequent unknown words''' (words in the source document which are not in the dictionary). See below for information about how to do this. Note: th ...nolingual dictionary''' (the appropriate <code>.dix</code> or <code>.lexc</code> file) so that they are not unknown anymore. Make sure to categorise stems
    2 KB (299 words) - 19:44, 30 December 2019
  • ...requency. The lemma of a word is it's "base form" (the form you might find in a dictionary) ...t. Work from top to bottom. After each asterisk '<code><nowiki>*</nowiki></code>' you should replace the surface form with the lemma.
    2 KB (207 words) - 16:21, 14 November 2013
  • Flag diacritics are a method used in the [[HFST]] tools to allow the writer of a transducer to exclude impossibl Some work on [[Flag diacritics]] has already been made in [[lttoolbox-java]].
    1 KB (176 words) - 06:40, 20 October 2014
  • [[Category:Tasks for Google Code-in|Unigram tagger]]
    255 bytes (43 words) - 14:56, 26 October 2014

Page text matches

  • ...as page for Google Code-in 2011(http://www.google-melange.com/gci/homepage/google/gci2011); here you can find ideas on interesting tasks that will improve yo <b>For current GCI ideas, see [[Ideas for Google Code-in]]</b>
    187 KB (21,006 words) - 22:14, 12 November 2012
  • ...s page for Google Code-in 2013 (http://www.google-melange.com/gci/homepage/google/gci2013), here you can find ideas on interesting tasks that will improve yo '''For current GCI task ideas, see [[Task ideas for Google Code-in]]'''
    68 KB (10,323 words) - 15:37, 25 October 2014
  • ...ideas page for [https://developers.google.com/open-source/gci/ Google Code-in], here you can find ideas on interesting tasks that will improve your knowl The people column lists people who you should get in contact with to request further information. All tasks are 2 hours maximum
    32 KB (4,862 words) - 06:23, 5 December 2019
  • ...s page for Google Code-In 2012 (http://www.google-melange.com/gci/homepage/google/gci2012), here you can find ideas on interesting tasks that will improve yo '''For current GCI task ideas, see [[Task ideas for Google Code-in]]'''
    14 KB (2,007 words) - 03:06, 27 October 2013
  • ...system. Finally, most of them are not available for most of the languages in the world, as they rely heavily on resources that are not available for the == Why does your org want to participate in Google Summer of Code? ==
    8 KB (1,230 words) - 06:02, 5 February 2019
  • ...system. Finally, most of them are not available for most of the languages in the world, as they rely heavily on resources that are available for only a == Why does your org want to participate in Google Summer of Code? ==
    8 KB (1,248 words) - 15:51, 17 February 2021
  • ...North Sámi–Norwegian Bokmål and Kazakh–Tatar among others), and many more in development. ;Why would you organisation like to participate in Google Code-in 2015?*
    7 KB (1,111 words) - 10:10, 15 November 2015
  • (Second draft in by [[User:Francis Tyers|Francis Tyers]] 15:34, 28 October 2013 (UTC)) ...ation engine and auxiliary tools is being developed around the world, both in universities and companies (e.g. Prompsit Language Engineering) and by a gr
    6 KB (1,057 words) - 15:34, 28 October 2013
  • ...North Sámi--Norwegian Bokmål and Kazakh-Tatar among others), and many more in development. ;Why would you organisation like to participate in Google Code-in 2014?*
    6 KB (987 words) - 10:21, 7 November 2014
  • plugin are in [https://github.com/goavki/phenny/blob/master/modules/git.py modules/git.py receiving updates about commits from sites like GitHub and Bitbucket. In
    8 KB (1,370 words) - 21:21, 22 November 2018
  • == Why is it you are interested in machine translation? == ...ciolinguist working on language maintenance and shift. I'm very interested in creating resources for minoritised languages.
    16 KB (2,285 words) - 06:46, 12 April 2019
  • ..., we might still be interested if we can turn it into something achievable in 3 months. ...ourself familiar with testvoc and other quality controls, and factor those in. If you know of any breaks or absences beforehand, mention them and plan ar
    10 KB (1,500 words) - 16:23, 18 February 2016
  • ;Google+ URL ...the downdown above, please summarise your involvement in Google Summer of Code and the successes and challenges of your participation. Please also list yo
    8 KB (1,240 words) - 12:03, 20 February 2015
  • ...system. Finally, most of them are not available for most of the languages in the world, as they rely heavily on resources that are available for only a === Why does your org want to participate in Google Summer of Code? ===
    10 KB (1,480 words) - 07:00, 23 February 2021
  • ...e Summer of Code. The ideas page can be found [[Ideas for Google Summer of Code|here]]. ...g language data, translation engine and auxiliary tools is being developed in several universities and companies around the world, with the principal par
    10 KB (1,543 words) - 19:50, 12 April 2021
  • ...on engine and auxiliary tools is being developed around the world, largely in universities and companies (e.g. Prompsit Language Engineering), but also i ...Occitan, Breton—French, and Basque—Spanish among others), and several more in development.
    3 KB (424 words) - 19:24, 29 October 2010
  • ...on engine and auxiliary tools is being developed around the world, largely in universities and companies (e.g. Prompsit Language Engineering), but indepe ...ton&mdash;French, and Basque&mdash;Spanish among others), and several more in development.
    11 KB (1,680 words) - 12:22, 20 June 2019
  • ...st-google-summer.html The Apertium Project's First Google Summer of Code], Google Open Source Blog ...urce A website that hopes to speak the language of freely available data], in The Guardian.
    13 KB (1,689 words) - 21:42, 28 February 2021
  • ;Why is your organisation applying to participate in Google Summer of Code 2014? What do you hope to gain by participating?* * Apertium likes Google Summer of Code: it is a programme that supports open-source as much as we do!
    7 KB (1,212 words) - 20:10, 4 February 2014
  • ...on engine and auxiliary tools is being developed around the world, largely in universities and companies (e.g. Prompsit Language Engineering), but indepe ...Occitan, Breton—French, and Basque—Spanish among others), and several more in development.
    9 KB (1,376 words) - 15:24, 22 March 2013

View (previous 20 | next 20) (20 | 50 | 100 | 250 | 500)