Task ideas for Google Code-in

From Apertium
Revision as of 21:08, 19 October 2010 by Francis Tyers (talk | contribs)
Jump to navigation Jump to search

This is the task ideas page for Google Code-in, here you can find ideas on interesting tasks that will improve your knowledge of Apertium and help you get into the world of open-source development.

Task list

Area Difficulty Title Description People
CODE 2. Medium Cross a language pair Take two language pairs, use apertium-crossdics and clean up the resulting bilingual dictionary. For instance, build a dictionary for Occitan-French from Occitan-Catalan and Catalan-French. Francis Tyers
code 1. Hard Convert existing resource Take an existing linguistic resource and adapt it to be used in Apertium. For example, take a morphological analyser for Punjabi in Functional Morphology and convert it to lttoolbox. Francis Tyers
quality 3. Easy Quality evaluation Perform a human post-edition evaluation of one of our non-evaluated pairs. This will involve taking some free text (e.g. from Wikipedia or Wikinews), running it through the translator and then altering the output to be correct. Then using apertium-eval-translator to calculate the Word Error Rate. Francis Tyers
research 3. Easy Catalogue resources Pick an under-resourced language (e.g. Chechen, Guaraní, Aromanian, Chuvash, Swazi, ...) and catalogue all the available resources (grammatical descriptions, wordlists, dictionaries, spellcheckers, papers, corpora, etc.) for it along with the licences they are under. Francis Tyers
quality 1. Hard Improve a language pair Find some faults in an existing language pair and fix them. In particular minor→major pairs, e.g. Welsh-English, Basque-Spanish, Breton-French. Francis Tyers
translation 2. Medium Translate the HOWTO Translate the new language pair HOWTO into another language, and go through it for a new pair of languages. When finished, upload to the Incubator. Francis Tyers
documentation 2. Medium Document undocumented features Find a feature that can't be found in the existing documentation (e.g. cascaded interchunk transfer), and write about it.
outreach 3. Easy Translate Wikipedia Some smaller Wikipedia's could really benefit from having someone translate articles from bigger Wikipedias. And where the languages are close, using Apertium could make this more efficient (e.g. Bulgarian WP has 107,355 articles, Macedonian WP has 42,112, less than half as many. Candidates should check with the local Wikipedia community before proceeding.
training 3. Easy Simple step-by-step "become a developer" guide Write a simple step-by-step guide (on the wiki) for pre-university students (of varying levels of computer literacy) to install a development version of Apertium and start doing development or polishing tasks like the ones above.
code 2. Medium NSIS script Write an NSIS script to install the Cygwin version of Apertium on Windows.
user interface 1. Hard Design a user-friendly interface for Apertium Apertium does not currently have a friendly user interface for translators. Look at other translation software on the market, and sketch out some ideas for how to design a user interface. This will not require programming, but could, for example involve using Glade to demonstrate the ideas.


  1. Outreach Writing a quick guide on 'What Apertium can and cannot do to help you with your homework'.