Difference between revisions of "Task ideas for Google Code-in"
Jump to navigation
Jump to search
Hectoralos (talk | contribs) |
|||
Line 341: | Line 341: | ||
|align=center| {{sc|quality}} || 1. Hard || Fix some tagger errors in Swedish->Danish || [[apertium-sv-da]] could be improved with a [[Constraint Grammar]]. Find 10 sentences that get wrong translations due to tagging, and write CG rules to fix them. The student should have good knowledge of Swedish, or at least some Scandinavian language. || 8–12 ||[[User:Unhammer|Unhammer]] |
|align=center| {{sc|quality}} || 1. Hard || Fix some tagger errors in Swedish->Danish || [[apertium-sv-da]] could be improved with a [[Constraint Grammar]]. Find 10 sentences that get wrong translations due to tagging, and write CG rules to fix them. The student should have good knowledge of Swedish, or at least some Scandinavian language. || 8–12 ||[[User:Unhammer|Unhammer]] |
||
|- |
|- |
||
|align=center| {{sc|quality}} || |
|align=center| {{sc|quality}} || 3. Easy || Improve Swedish-Danish || Add 50 nouns you feel are missing in translations from [[apertium-sv-da|Swedish to Danish]]. || 3–6 || [[User:Jacob Nordfalk|Jacob Nordfalk]] |
||
|- |
|- |
||
|align=center| {{sc|quality}} || |
|align=center| {{sc|quality}} || 3. Easy || Improve English-Esperanto || Add 50 words you feel are missing in translations from [[apertium-eo-en|English to Esperanto]]. || 3–6 || [[User:Jacob Nordfalk|Jacob Nordfalk]] |
||
|- |
|- |
||
|align=center| {{sc|quality}} || |
|align=center| {{sc|quality}} || 3. Easy || Improve Spanish-Esperanto || Add 50 words you feel are missing in translations from [[apertium-eo-es|Spanish to Esperanto]]. || 3–6 || [[User:Hectoralos|Hectoralos]] |
||
|- |
|- |
||
|align=center| {{sc|quality}} || |
|align=center| {{sc|quality}} || 3. Easy || Improve Catalan-Esperanto || Add 50 words you feel are missing in translations from [[apertium-eo-ca|Catalan to Esperanto]]. || 3–6 || [[User:Hectoralos|Hectoralos]] |
||
|- |
|- |
||
|align=center| {{sc|quality}} || |
|align=center| {{sc|quality}} || 3. Easy || Improve French-Esperanto || Add 50 words you feel are missing in translations from [[apertium-eo-fr|French to Esperanto]]. || 3–6 || [[User:Hectoralos|Hectoralos]] |
||
|- |
|- |
||
|align=center| {{sc|quality}} || |
|align=center| {{sc|quality}} || 3. Easy || Improve Spanish-Aragonese || Add 50 words you feel are missing in translations from [[apertium-eo-ca|Aragonese to Spanish]]. || 3–6 || [[User:Juanpabl|Juan Pablo Martínez]] |
||
|- |
|- |
||
|align=center| {{sc|quality}} || |
|align=center| {{sc|quality}} || 3. Easy || Improve Afrikaans-Dutch || Add 50 words you feel are missing in translations from [[apertium-af-nl|Afrikaans-Dutch]]. A list of unknown words can be provided.|| 3–6 || [[User:AureiAnimus|Pim Otte]] |
||
|- |
|- |
||
|align=center| {{sc|quality}} || |
|align=center| {{sc|quality}} || 3. Easy || Afrikaans-Dutch tests || Finish the Afrikaans-Dutch pending tests and move the passing tests to a seperate page for regression testing. || 3–6 || [[User:AureiAnimus|Pim Otte]] |
||
|- |
|- |
||
|align=center| {{sc|quality}} || |
|align=center| {{sc|quality}} || 3. Easy || Improve Irish-Manx Gaelic coverage || I can provide a list of the most common Irish words not covered by the bilingual dictionary, and their English translations. Manx translations needed for these. || 3–6 || [[User:Kevin Scannell|Kevin Scannell]] |
||
|- |
|- |
||
|align=center| {{sc|quality}} || |
|align=center| {{sc|quality}} || 3. Easy || Add gender information to Manx dictionary || Most of the nouns in the Manx dictionary have gender information in place - look up and add any that are missing. || 3–6 || [[User:Kevin Scannell|Kevin Scannell]] |
||
|- |
|- |
||
|align=center| {{sc|quality}} || |
|align=center| {{sc|quality}} || 3. Easy || Proofread Albanian analyser || We have a morphological analyser for Albanian, but it has been written by a non-native speaker and needs to be checked. || 6–10 || [[User:Francis Tyers|Francis Tyers]] |
||
|- |
|- |
||
|align=center| {{sc|translation}} || |
|align=center| {{sc|translation}} || 3. Easy || Proofread Catalan--Sardian dictionary || Go through the Catalan--Sardinian dictionary and check the entries, there are only around a thousand or so. || 1–2 || [[User:Francis Tyers|Francis Tyers]] |
||
|- |
|- |
||
|align=center| {{sc|quality}} || |
|align=center| {{sc|quality}} || 2. Medium || Improve Spanish-Aragonese coverage || Create a corpus from Aragonese Wikipedia (with all the articles which do not have the template <nowiki>{{Grafía_87}}</nowiki> (or equivalently, those which are not in category [http://an.wikipedia.org/wiki/Categor%C3%ADa:Articlos_con_graf%C3%ADa_87]). Then, add the top 50-100 frequently used words which are not covered in the Apertium es-an dictionaries.|| 6–10 || [[User:Juanpabl|Juan Pablo Martínez]] |
||
|- |
|- |
||
|align=center| {{sc|code}} || |
|align=center| {{sc|code}} || 2. Medium || Add toponyms to the Spanish-Aragonese dictionaries || Extract from wikipedia the names in Aragonese for countries in the world, their capital cities, main Spanish cities and municipalities in Aragon, and add them to the es-an dictionaries.|| 6–10 || [[User:Juanpabl|Juan Pablo Martínez]] |
||
|- |
|- |
||
|align=center| {{sc|code}} || |
|align=center| {{sc|code}} || 2. Medium || Update the Apertium TinyLex J2ME apps for one language pair || Update Apertium TinyLex J2ME packages (http://www.tinylex.com/) to contain the most recent versions of dictionaries for one language pair || 4–6 per package || [[User:mlforcada|Mikel L. Forcada]] |
||
|- |
|- |
||
|align=center| {{sc|code}} || |
|align=center| {{sc|code}} || 2. Medium || Create an Apertium TinyLex J2ME app for a new language pair || Create an Apertium TinyLex J2ME package (http://www.tinylex.com/) from an existing Apertium language pair for a language pair not offered yet in Tinylex || 6–10 per package || [[User:mlforcada|Mikel L. Forcada]] |
||
|} |
|} |
||
Revision as of 19:22, 7 November 2011
This is the task ideas page for Google Code-in (http://code.google.com/gci), here you can find ideas on interesting tasks that will improve your knowledge of Apertium and help you get into the world of open-source development.
The people column lists people who you should get in contact with to request further information. The time column gives the minimum estimated amount of time that should be spent on the task. It does not include time taken to install / set up apertium.
Если ты не понимаешь английский язык, или ты предпочитать работать о русском языке или о языках России, смотри: Task ideas for Google Code-in/Russian
Task list
Area | Difficulty | Title | Description | Time (hours) |
People |
---|---|---|---|---|---|
code | 1. Hard | Convert existing resource: Urdu morphological analyser | Take Muhammad Humayoun's Urdu Morphology and convert to lttoolbox format. | 8–10 | Francis Tyers |
code | 1. Hard | Convert existing resource: Punjabi morphological analyser | Take Muhammad Humayoun's Punjabi Morphology and convert to lttoolbox format. | 8–10 | Francis Tyers |
code | 1. Hard | Convert existing resource: Kurdish morphological analyser | Take the Alexina Kurdish Morphology and convert to lttoolbox format. | 8–10 | Francis Tyers |
code | 3. Easy | Convert existing resource: Reta Vortaro Belarusian-Esperanto | Take the Belarusian-Esperanto lexicon and convert to lttoolbox format. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: Reta Vortaro Breton-Esperanto | Take the Breton-Esperanto lexicon and convert to lttoolbox format. | 2–4 | Jacob_Nordfalk |
code | 3. Easy | Convert existing resource: Reta Vortaro Bulgarian-Esperanto | Take the Bulgarian-Esperanto lexicon and convert to lttoolbox format. | 2–4 | Hectoralos |
code | 3. Easy | Convert existing resource: Reta Vortaro Czech-Esperanto | Take the Czech-Esperanto lexicon and convert to lttoolbox format. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: Reta Vortaro Dutch-Esperanto | Take the Dutch-Esperanto lexicon and convert to lttoolbox format. | 2–4 | Hectoralos |
code | 3. Easy | Convert existing resource: Reta Vortaro Finnish-Esperanto | Take the Finnish-Esperanto lexicon and convert to lttoolbox format. | 2–4 | Hectoralos |
code | 3. Easy | Convert existing resource: Reta Vortaro German-Esperanto | Take the German-Esperanto lexicon and convert to lttoolbox format. | 2–4 | Hectoralos |
code | 3. Easy | Convert existing resource: Reta Vortaro Greek-Esperanto | Take the Greek-Esperanto lexicon and convert to lttoolbox format. | 2–4 | Jacob_Nordfalk |
code | 3. Easy | Convert existing resource: Reta Vortaro Hebrew-Esperanto | Take the Hebrew-Esperanto lexicon and convert to lttoolbox format. | 2–4 | Jacob_Nordfalk |
code | 3. Easy | Convert existing resource: Reta Vortaro Hungarian-Esperanto | Take the Hungarian-Esperanto lexicon and convert to lttoolbox format. | 2–4 | Jacob_Nordfalk |
code | 3. Easy | Convert existing resource: Reta Vortaro Italian-Esperanto | Take the Italian-Esperanto lexicon and convert to lttoolbox format. | 2–4 | Hectoralos |
code | 3. Easy | Convert existing resource: Reta Vortaro Persian-Esperanto | Take the Persian-Esperanto lexicon and convert to lttoolbox format. | 2–4 | Hectoralos |
code | 3. Easy | Convert existing resource: Reta Vortaro Portuguese-Esperanto | Take the Portuguese-Esperanto lexicon and convert to lttoolbox format. | 2–4 | Hectoralos |
code | 3. Easy | Convert existing resource: Reta Vortaro Polish-Esperanto | Take the Polish-Esperanto lexicon and convert to lttoolbox format. | 2–4 | Hectoralos |
code | 3. Easy | Convert existing resource: Reta Vortaro Portuguese-Esperanto | Take the Portuguese-Esperanto lexicon and convert to lttoolbox format. | 2–4 | Hectoralos |
code | 3. Easy | Convert existing resource: Reta Vortaro Russian-Esperanto | Take the Russian-Esperanto lexicon and convert to lttoolbox format. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: Reta Vortaro Slovakian-Esperanto | Take the Slovakian-Esperanto lexicon and convert to lttoolbox format. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: Reta Vortaro Swedish-Esperanto | Take the Swedish-Esperanto lexicon and convert to lttoolbox format. | 2–4 | Jacob_Nordfalk |
code | 3. Easy | Convert existing resource: Reta Vortaro Turkish-Esperanto | Take the Turkish-Esperanto lexicon and convert to lttoolbox format. | 2–4 | Jimregan |
code | 2. Medium | Convert Apertium resources: nn-nb for Freedict | Apertium's lexicons would make an excellent start for bilingual dictionaries. FreeDict currently has no Nynorsk-Bokmal dictionary. | 2–4 | Piotr Bański |
code | 2. Medium | Convert Apertium resources: es-ca for Freedict | Apertium's lexicons would make an excellent start for bilingual dictionaries. FreeDict currently has no Spanish-Catalan dictionary. | 2–4 | Piotr Bański |
code | 2. Medium | Convert Apertium resources: is-en for Freedict | Apertium's lexicons would make an excellent start for bilingual dictionaries. FreeDict currently has no Icelandic-English dictionary. | 2–4 | Piotr Bański |
code | 2. Medium | Convert Apertium resources: es-ast for Freedict | Apertium's lexicons would make an excellent start for bilingual dictionaries. FreeDict currently has no Asturian-Spanish dictionary. | 2–4 | Piotr Bański |
code | 2. Medium | Convert Apertium resources: oc-ca for Freedict | Apertium's lexicons would make an excellent start for bilingual dictionaries. FreeDict currently has no Occitan-Catalan dictionary. | 2–4 | Piotr Bański |
code | 2. Medium | Convert Apertium resources: mk-bg for Freedict | Apertium's lexicons would make an excellent start for bilingual dictionaries. FreeDict currently has no Macedonian-Bulgarian dictionary. | 2–4 | Piotr Bański |
code | 2. Medium | Convert Apertium resources: mk-en for Freedict | Apertium's lexicons would make an excellent start for bilingual dictionaries. FreeDict currently has no Macedonian-English dictionary. | 2–4 | Piotr Bański |
code | 3. Easy | Convert existing resource: English-Slovakian dictionary | Take MSAS/MASS and convert to lttoolbox format. | 1–4 | Zdenko Podobný |
code | 2. Medium | Convert existing resource: Slovakian morphological analyser | Take the morphological analyser distributed with LanguageTool and convert to lttoolbox format. | 1–4 | Zdenko Podobný |
code | 3. Easy | Convert existing resource: FreeDict afr-deu | Take the Freedict afr-deu dictionary and convert to lttoolbox format. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict ckb-kmr | Take the Freedict ckb-kmr dictionary and convert to lttoolbox format. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict dan-eng | Take the Freedict dan-eng dictionary and convert to lttoolbox format. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict eng-ell | Take the Freedict eng-ell dictionary and convert to lttoolbox format. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict eng-hin | Take the Freedict eng-hin dictionary and convert to lttoolbox format. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict eng-hrv | Take the Freedict eng-hrv dictionary and convert to lttoolbox format. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict eng-pol | Take the Freedict eng-pol dictionary and convert to lttoolbox format. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict eng-rom | Take the Freedict eng-rom dictionary and convert to lttoolbox format. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict eng-rus | Take the Freedict eng-rus dictionary and convert to lttoolbox format. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict gla-deu | Take the Freedict gla-deu dictionary and convert to lttoolbox format. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict hrv-eng | Take the Freedict hrv-eng dictionary and convert to lttoolbox format. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict jpn-deu | Take the Freedict jpn-deu dictionary and convert to lttoolbox format. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict kha-deu | Take the Freedict kha-deu dictionary and convert to lttoolbox format. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict kha-eng | Take the Freedict kha-eng dictionary and convert to lttoolbox format. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict kur-eng | Take the Freedict kur-eng dictionary and convert to lttoolbox format. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict kur-tur | Take the Freedict kur-tur dictionary and convert to lttoolbox format. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict lat-deu | Take the Freedict lat-deu dictionary and convert to lttoolbox format. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict san-deu | Take the Freedict san-deu dictionary and convert to lttoolbox format. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict slo-eng | Take the Freedict slo-eng dictionary and convert to lttoolbox format. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict swh-pol | Take the Freedict swh-pol dictionary and convert to lttoolbox format. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict ara-eng and eng-ara | Take the Freedict ara-eng and eng-ara dictionaries, convert to lttoolbox format, and merge them. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict bre-fra and fra-bre | Take the Freedict bre-fra and fra-bre dictionaries, convert to lttoolbox format, and merge them. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict deu-fra and fra-deu | Take the Freedict deu-fra and fra-deu dictionaries, convert to lttoolbox format, and merge them. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict deu-ita and ita-deu | Take the Freedict deu-ita and ita-deu dictionaries, convert to lttoolbox format, and merge them. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict deu-kur and kur-deu | Take the Freedict deu-kur and kur-deu dictionaries, convert to lttoolbox format, and merge them. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict deu-nld and nld-deu | Take the Freedict deu-nld and nld-deu dictionaries, convert to lttoolbox format, and merge them. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict deu-por and por-deu | Take the Freedict deu-por and por-deu dictionaries, convert to lttoolbox format, and merge them. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict deu-tur and tur-deu | Take the Freedict deu-tur and tur-deu dictionaries, convert to lttoolbox format, and merge them. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict eng-afr and afr-eng | Take the Freedict eng-afr and afr-eng dictionaries, convert to lttoolbox format, and merge them. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict eng-cym and cym-eng | Take the Freedict eng-cym and cym-eng dictionaries, convert to lttoolbox format, and merge them. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict eng-cze and ces-eng | Take the Freedict eng-cze and ces-eng dictionaries, convert to lttoolbox format, and merge them. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict eng-deu and deu-eng | Take the Freedict eng-deu and deu-eng dictionaries, convert to lttoolbox format, and merge them. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict eng-fra and fra-eng | Take the Freedict eng-fra and fra-eng dictionaries, convert to lttoolbox format, and merge them. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict eng-gle and gle-eng | Take the Freedict eng-gle and gle-eng dictionaries, convert to lttoolbox format, and merge them. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict eng-hun and hun-eng | Take the Freedict eng-hun and hun-eng dictionaries, convert to lttoolbox format, and merge them. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict eng-ita and ita-eng | Take the Freedict eng-ita and ita-eng dictionaries, convert to lttoolbox format, and merge them. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict eng-lat and lat-eng | Take the Freedict eng-lat and lat-eng dictionaries, convert to lttoolbox format, and merge them. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict eng-lit and lit-eng | Take the Freedict eng-lit and lit-eng dictionaries, convert to lttoolbox format, and merge them. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict eng-nld and nld-eng | Take the Freedict eng-nld and nld-eng dictionaries, convert to lttoolbox format, and merge them. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict eng-por and por-eng | Take the Freedict eng-por and por-eng dictionaries, convert to lttoolbox format, and merge them. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict eng-scr and scr-eng | Take the Freedict eng-scr and scr-eng dictionaries, convert to lttoolbox format, and merge them. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict eng-spa and spa-eng | Take the Freedict eng-spa and spa-eng dictionaries, convert to lttoolbox format, and merge them. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict eng-swa and swa-eng | Take the Freedict eng-swa and swa-eng dictionaries, convert to lttoolbox format, and merge them. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict eng-swe and swe-eng | Take the Freedict eng-swe and swe-eng dictionaries, convert to lttoolbox format, and merge them. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict eng-tur and tur-eng | Take the Freedict eng-tur and tur-eng dictionaries, convert to lttoolbox format, and merge them. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict eng-wel and wel-eng | Take the Freedict eng-wel and wel-eng dictionaries, convert to lttoolbox format, and merge them. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict fra-nld and nld-fra | Take the Freedict fra-nld and nld-fra dictionaries, convert to lttoolbox format, and merge them. | 2–4 | Jimregan |
code | 3. Easy | Convert existing resource: FreeDict gle-pol and pol-gle | Take the Freedict gle-pol and pol-gle dictionaries, convert to lttoolbox format, and merge them. | 2–4 | Jimregan |
code | 2. Medium | Convert existing resource: Polish-Slovakian transfer rules | Much of the existing rules in Apertium's pl-cs system originated in pl-sk. Take the new rules in pl-cs and apply them to pl-sk. No knowledge of Polish, Slovakian, or Czech is required, though it will help | 1–4 | Zdenko Podobný |
outreach | 3. Easy | Apertium on Macedonian Wikipedia | Bulgarian WP has 107,355 articles, Macedonian WP has 42,112, less than half as many. Translate some articles from Bulgarian Wikipedia to Macedonian Wikipedia using Apertium, and then postedit them. Explain to the local Wikipedia community what you are doing beforehand. | 1–4 | Francis Tyers |
outreach | 3. Easy | Apertium on Occitan Wikipedia | Catalan WP has 290,059 articles, Occitan WP has 22,579, less than a tenth as many. Translate some articles from Catalan Wikipedia to Occitan Wikipedia using Apertium, and then postedit them. Explain to the local Wikipedia community what you are doing beforehand. | 1–4 | Francis Tyers |
outreach | 3. Easy | Apertium on Asturian Wikipedia | Spanish WP has 663,567 articles, Asturian WP has 13,869, almost a fiftieth as few. Translate some articles from Spanish Wikipedia to Asturian Wikipedia using Apertium, and then postedit them. Explain to the local Wikipedia community what you are doing beforehand. | 1–4 | Francis Tyers |
outreach | 3. Medium | LUG Flyer | Design a flyer that briefly explains Apertium, suitable for handing out at Linux User Group meetings | 1–4 | Jimregan |
outreach | 3. Medium | School Flyer | Design a flyer that briefly explains Apertium, suitable for handing out at your school | 1–4 | Jimregan |
quality | 3. Easy | Thorough checkup of bn-en morphological analyser | While the current bn-en morphological analyser has a pretty good coverage, it should have been higher. Part of the reason is that a lot of verbs have one/two slight different surface forms that differ from the regular ones and the analyser misses them. Using lt-expand it's possible to generate all forms of the verbs, then manually check these and using another script (already in the pair) rebuild the analyser file. This checking will require a native speaker/expert on Bengali language | 2–4 | Abu Zaher |
code | 2. Medium | Dixtools: TEI export | Take the code from Dix2CC.java or Dix2Tiny.java and adapt to export TEI P5 format dictionaries, suitable for FreeDict. This project is suitable for someone interested in learning Java. | 2–4 | Jimregan |
research | 2. Medium | Contrastive analysis: Macedonian and Albanian | Create a set of test sentences (see various pages of 'Pending tests' and 'Regression tests' on the Wiki) for translation between Macedonian and Albanian. The tests should cover as many features of the languages as possible. Some of the examples might be able to be found in a grammar, others might need to be invented. This will not involve programming, only grammatical analysis. | 4–6 | Francis Tyers |
research | 2. Medium | Contrastive analysis: Kurdish and Persian | Create a set of test sentences (see various pages of 'Pending tests' and 'Regression tests' on the Wiki) for translation between Kurdish and Persian. The tests should cover as many features of the languages as possible. Some of the examples might be able to be found in a grammar, others might need to be invented. This will not involve programming, only grammatical analysis. | 4–6 | Francis Tyers |
research | 2. Medium | Contrastive analysis: Hindi and Urdu | Create a set of test sentences (see various pages of 'Pending tests' and 'Regression tests' on the Wiki) for translation between Hindu and Urdu. The tests should cover as many features of the languages as possible. Some of the examples might be able to be found in a grammar, others might need to be invented. This will not involve programming, only grammatical analysis. | 4–6 | Francis Tyers |
research | 2. Medium | Contrastive analysis: Finnish and Estonian | Create a set of test sentences (see various pages of 'Pending tests' and 'Regression tests' on the Wiki) for translation between Finnish and Estonian. The tests should cover as many features of the languages as possible. Some of the examples might be able to be found in a grammar, others might need to be invented. This will not involve programming, only grammatical analysis. | 4–6 | Francis Tyers |
research | 2. Medium | Contrastive analysis: Spanish and Italian | Create a set of test sentences (see various pages of 'Pending tests' and 'Regression tests' on the Wiki) for translation between Spanish and Italian. The tests should cover as many features of the languages as possible. Some of the examples might be able to be found in a grammar, others might need to be invented. This will not involve programming, only grammatical analysis. | 4–6 | Gramirez |
research | 2. Medium | Contrastive analysis: Catalan and Sardinian | Create a set of test sentences (see various pages of 'Pending tests' and 'Regression tests' on the Wiki) for translation between Catalan and Sardinian. The tests should cover as many features of the languages as possible. Some of the examples might be able to be found in a grammar, others might need to be invented. This will not involve programming, only grammatical analysis. | 4–6 | Francis Tyers |
research | 2. Medium | Contrastive analysis: Italian and Sardinian | Create a set of test sentences (see various pages of 'Pending tests' and 'Regression tests' on the Wiki) for translation between Italian and Sardinian. The tests should cover as many features of the languages as possible. Some of the examples might be able to be found in a grammar, others might need to be invented. This will not involve programming, only grammatical analysis. | 4–6 | Deadbeef |
research | 2. Medium | Contrastive analysis: Belorussian and Esperanto | Create a set of test sentences (see various pages of 'Pending tests' and 'Regression tests' on the Wiki) for translation between Belorussian and Esperanto. The tests should cover as many features of the languages as possible. Some of the examples might be able to be found in a grammar, others might need to be invented. This will not involve programming, only grammatical analysis. | 4–6 | Hectoralos |
research | 2. Medium | Contrastive analysis: Breton and Esperanto | Create a set of test sentences (see various pages of 'Pending tests' and 'Regression tests' on the Wiki) for translation between Breton and Esperanto. The tests should cover as many features of the languages as possible. Some of the examples might be able to be found in a grammar, others might need to be invented. This will not involve programming, only grammatical analysis. | 4–6 | Hectoralos |
research | 2. Medium | Contrastive analysis: Bulgarian and Esperanto | Create a set of test sentences (see various pages of 'Pending tests' and 'Regression tests' on the Wiki) for translation between Bulgarian and Esperanto. The tests should cover as many features of the languages as possible. Some of the examples might be able to be found in a grammar, others might need to be invented. This will not involve programming, only grammatical analysis. | 4–6 | Hectoralos |
research | 2. Medium | Contrastive analysis: Czech and Esperanto | Create a set of test sentences (see various pages of 'Pending tests' and 'Regression tests' on the Wiki) for translation between Czech and Esperanto. The tests should cover as many features of the languages as possible. Some of the examples might be able to be found in a grammar, others might need to be invented. This will not involve programming, only grammatical analysis. | 4–6 | Hectoralos |
research | 2. Medium | Contrastive analysis: Dutch and Esperanto | Create a set of test sentences (see various pages of 'Pending tests' and 'Regression tests' on the Wiki) for translation between Dutch and Esperanto. The tests should cover as many features of the languages as possible. Some of the examples might be able to be found in a grammar, others might need to be invented. This will not involve programming, only grammatical analysis. | 4–6 | Hectoralos |
research | 2. Medium | Contrastive analysis: German and Esperanto | Create a set of test sentences (see various pages of 'Pending tests' and 'Regression tests' on the Wiki) for translation between German and Esperanto. The tests should cover as many features of the languages as possible. Some of the examples might be able to be found in a grammar, others might need to be invented. This will not involve programming, only grammatical analysis. | 4–6 | Hectoralos |
research | 2. Medium | Contrastive analysis: Greek and Esperanto | Create a set of test sentences (see various pages of 'Pending tests' and 'Regression tests' on the Wiki) for translation between Greek and Esperanto. The tests should cover as many features of the languages as possible. Some of the examples might be able to be found in a grammar, others might need to be invented. This will not involve programming, only grammatical analysis. | 4–6 | Hectoralos |
research | 2. Medium | Contrastive analysis: Italian and Esperanto | Create a set of test sentences (see various pages of 'Pending tests' and 'Regression tests' on the Wiki) for translation between Italian and Esperanto. The tests should cover as many features of the languages as possible. Some of the examples might be able to be found in a grammar, others might need to be invented. This will not involve programming, only grammatical analysis. | 4–6 | Hectoralos |
research | 2. Medium | Contrastive analysis: Persian and Esperanto | Create a set of test sentences (see various pages of 'Pending tests' and 'Regression tests' on the Wiki) for translation between Persian and Esperanto. The tests should cover as many features of the languages as possible. Some of the examples might be able to be found in a grammar, others might need to be invented. This will not involve programming, only grammatical analysis. | 4–6 | Hectoralos |
research | 2. Medium | Contrastive analysis: Polish and Esperanto | Create a set of test sentences (see various pages of 'Pending tests' and 'Regression tests' on the Wiki) for translation between Polish and Esperanto. The tests should cover as many features of the languages as possible. Some of the examples might be able to be found in a grammar, others might need to be invented. This will not involve programming, only grammatical analysis. | 4–6 | Hectoralos |
research | 2. Medium | Contrastive analysis: Portuguese and Esperanto | Create a set of test sentences (see various pages of 'Pending tests' and 'Regression tests' on the Wiki) for translation between Portuguese and Esperanto. The tests should cover as many features of the languages as possible. Some of the examples might be able to be found in a grammar, others might need to be invented. This will not involve programming, only grammatical analysis. | 4–6 | Hectoralos |
research | 2. Medium | Contrastive analysis: Russian and Esperanto | Create a set of test sentences (see various pages of 'Pending tests' and 'Regression tests' on the Wiki) for translation between Russian and Esperanto. The tests should cover as many features of the languages as possible. Some of the examples might be able to be found in a grammar, others might need to be invented. This will not involve programming, only grammatical analysis. | 4–6 | Hectoralos |
research | 2. Medium | Contrastive analysis: Slovak and Esperanto | Create a set of test sentences (see various pages of 'Pending tests' and 'Regression tests' on the Wiki) for translation between Slovak and Esperanto. The tests should cover as many features of the languages as possible. Some of the examples might be able to be found in a grammar, others might need to be invented. This will not involve programming, only grammatical analysis. | 4–6 | Hectoralos |
research | 2. Medium | Contrastive analysis: Swedish and Esperanto | Create a set of test sentences (see various pages of 'Pending tests' and 'Regression tests' on the Wiki) for translation between Swedish and Esperanto. The tests should cover as many features of the languages as possible. Some of the examples might be able to be found in a grammar, others might need to be invented. This will not involve programming, only grammatical analysis. | 4–6 | Hectoralos |
исследование | 2. Нормальное | Противопоставление: Чувашский и русский языки | Создать множество тестовых фраз (смотреть различные страницы 'Pending tests' и 'Regression tests' в Вики) для перевода между чувашским и русским языками. Тесты должны содержать как можно больше черт языков. Некоторые из примеров можно найти в грамматике, другие могут быть придуманы. Это не будет включать в себя программирование, только грамматический анализ. | 4–6 | Hectoralos |
исследование | 2. Нормальное | Противопоставление: Чувашский и татарский языки | Создать множество тестовых фраз (смотреть различные страницы 'Pending tests' и 'Regression tests' в Вики) для перевода между чувашским и татарским языками. Тесты должны содержать как можно больше черт языков. Некоторые из примеров можно найти в грамматике, другие могут быть придуманы. Это не будет включать в себя программирование, только грамматический анализ. | 4–6 | Hectoralos |
исследование | 2. Нормальное | Противопоставление: Чувашский и башкирский языки | Создать множество тестовых фраз (смотреть различные страницы 'Pending tests' и 'Regression tests' в Вики) для перевода между чувашским и башкирским языками. Тесты должны содержать как можно больше черт языков. Некоторые из примеров можно найти в грамматике, другие могут быть придуманы. Это не будет включать в себя программирование, только грамматический анализ. | 4–6 | Hectoralos |
исследование | 2. Нормальное | Противопоставление: Чувашский и турецкий языки | Создать множество тестовых фраз (смотреть различные страницы 'Pending tests' и 'Regression tests' в Вики) для перевода между чувашским и турецким языками. Тесты должны содержать как можно больше черт языков. Некоторые из примеров можно найти в грамматике, другие могут быть придуманы. Это не будет включать в себя программирование, только грамматический анализ. | 4–6 | Hectoralos |
исследование | 2. Нормальное | Противопоставление: Башкирский и татарский языки | Создать множество тестовых фраз (смотреть различные страницы 'Pending tests' и 'Regression tests' в Вики) для перевода между башкирским и татарским языками. Тесты должны содержать как можно больше черт языков. Некоторые из примеров можно найти в грамматике, другие могут быть придуманы. Это не будет включать в себя программирование, только грамматический анализ. | 4–6 | Hectoralos |
исследование | 2. Нормальное | Противопоставление: Башкирский и турецкий языки | Создать множество тестовых фраз (смотреть различные страницы 'Pending tests' и 'Regression tests' в Вики) для перевода между башкирским и турецким языками. Тесты должны содержать как можно больше черт языков. Некоторые из примеров можно найти в грамматике, другие могут быть придуманы. Это не будет включать в себя программирование, только грамматический анализ. | 4–6 | Hectoralos |
исследование | 2. Нормальное | Противопоставление: Якутский и татарский языки | Создать множество тестовых фраз (смотреть различные страницы 'Pending tests' и 'Regression tests' в Вики) для перевода между якутским и татарским языками. Тесты должны содержать как можно больше черт языков. Некоторые из примеров можно найти в грамматике, другие могут быть придуманы. Это не будет включать в себя программирование, только грамматический анализ. | 4–6 | Hectoralos |
исследование | 2. Нормальное | Противопоставление: Якутский и турецкий языки | Создать множество тестовых фраз (смотреть различные страницы 'Pending tests' и 'Regression tests' в Вики) для перевода между якутским и турецким языками. Тесты должны содержать как можно больше черт языков. Некоторые из примеров можно найти в грамматике, другие могут быть придуманы. Это не будет включать в себя программирование, только грамматический анализ. | 4–6 | Hectoralos |
исследование | 2. Нормальное | Противопоставление: Кумыкский и ногайский языки | Создать множество тестовых фраз (смотреть различные страницы 'Pending tests' и 'Regression tests' в Вики) для перевода между кумыкским и ногайским языками. Тесты должны содержать как можно больше черт языков. Некоторые из примеров можно найти в грамматике, другие могут быть придуманы. Это не будет включать в себя программирование, только грамматический анализ. | 4–6 | Hectoralos |
исследование | 2. Нормальное | Противопоставление: Кумыкский и татарский языки | Создать множество тестовых фраз (смотреть различные страницы 'Pending tests' и 'Regression tests' в Вики) для перевода между кумыкским и татарским языками. Тесты должны содержать как можно больше черт языков. Некоторые из примеров можно найти в грамматике, другие могут быть придуманы. Это не будет включать в себя программирование, только грамматический анализ. | 4–6 | Hectoralos |
исследование | 2. Нормальное | Противопоставление: Кумыкский и турецкий языки | Создать множество тестовых фраз (смотреть различные страницы 'Pending tests' и 'Regression tests' в Вики) для перевода между кумыкским и турецким языками. Тесты должны содержать как можно больше черт языков. Некоторые из примеров можно найти в грамматике, другие могут быть придуманы. Это не будет включать в себя программирование, только грамматический анализ. | 4–6 | Hectoralos |
исследование | 2. Нормальное | Противопоставление: Карачаево-балкарский и татарский языки | Создать множество тестовых фраз (смотреть различные страницы 'Pending tests' и 'Regression tests' в Вики) для перевода между карачаево-балкарским и татарским языками. Тесты должны содержать как можно больше черт языков. Некоторые из примеров можно найти в грамматике, другие могут быть придуманы. Это не будет включать в себя программирование, только грамматический анализ. | 4–6 | Hectoralos |
исследование | 2. Нормальное | Противопоставление: Карачаево-балкарский и турецкий языки | Создать множество тестовых фраз (смотреть различные страницы 'Pending tests' и 'Regression tests' в Вики) для перевода между карачаево-балкарским и турецким языками. Тесты должны содержать как можно больше черт языков. Некоторые из примеров можно найти в грамматике, другие могут быть придуманы. Это не будет включать в себя программирование, только грамматический анализ. | 4–6 | Hectoralos |
исследование | 2. Нормальное | Противопоставление: Тувинский и хакасский языки | Создать множество тестовых фраз (смотреть различные страницы 'Pending tests' и 'Regression tests' в Вики) для перевода между тувинским и хакасским языками. Тесты должны содержать как можно больше черт языков. Некоторые из примеров можно найти в грамматике, другие могут быть придуманы. Это не будет включать в себя программирование, только грамматический анализ. | 4–6 | Hectoralos |
исследование | 2. Нормальное | Противопоставление: Тувинский и татарский языки | Создать множество тестовых фраз (смотреть различные страницы 'Pending tests' и 'Regression tests' в Вики) для перевода между тувинским и татарским языками. Тесты должны содержать как можно больше черт языков. Некоторые из примеров можно найти в грамматике, другие могут быть придуманы. Это не будет включать в себя программирование, только грамматический анализ. | 4–6 | Hectoralos |
исследование | 2. Нормальное | Противопоставление: Тувинский и турецкий языки | Создать множество тестовых фраз (смотреть различные страницы 'Pending tests' и 'Regression tests' в Вики) для перевода между тувинским и турецким языками. Тесты должны содержать как можно больше черт языков. Некоторые из примеров можно найти в грамматике, другие могут быть придуманы. Это не будет включать в себя программирование, только грамматический анализ. | 4–6 | Hectoralos |
research | 3. Easy | Catalogue resources: Aromanian | Catalogue all the available resources (grammatical descriptions, wordlists, dictionaries, spellcheckers, papers, corpora, etc.) for Aromanian, along with the licences they are under. | Francis Tyers | |
translation | 2. Medium | Translate the HOWTO: Norwegian | Translate the new language pair HOWTO into Nynorsk. | 5–8 | Unhammer |
translation | 2. Medium | Translate the HOWTO: Dutch | Translate the new language pair HOWTO into Dutch. | 5–8 | Pim Otte |
translation | 2. Medium | Translate the HOWTO: Aragonese | Translate the new language pair HOWTO into Aragonese. | 5–8 | Juan Pablo Martínez |
translation | 2. Medium | Translate the HOWTO: Turkish | Translate the new language pair HOWTO into Turkish. | 5–8 | Zfe |
quality | 3. Easy | Quality evaluation: Spanish and French | Perform a human post-edition evaluation of the Spanish and French language pair. This will involve taking some free text (e.g. from Wikipedia or Wikinews), running it through the translator and then altering the output to be correct. Then using apertium-eval-translator to calculate the Word Error Rate. The minimum amount of text should be 2,000 words. | 4–8 | Francis Tyers |
quality | 3. Easy | Quality evaluation: Spanish and Occitan | Perform a human post-edition evaluation of the Spanish and Occitan language pair. This will involve taking some free text (e.g. from Wikipedia or Wikinews), running it through the translator and then altering the output to be correct. Then using apertium-eval-translator to calculate the Word Error Rate. The minimum amount of text should be 2,000 words. | 4–8 | Mireia Ginestí |
quality | 3. Easy | Quality evaluation: Spanish and Asturian | Perform a human post-edition evaluation of the Spanish and Asturian language pair. This will involve taking some free text (e.g. from Wikipedia or Wikinews), running it through the translator and then altering the output to be correct. Then using apertium-eval-translator to calculate the Word Error Rate. The minimum amount of text should be 2,000 words. | 4–8 | Francis Tyers |
user interface | 2. Medium | Design a user-friendly Glade interface for Apertium | Apertium does not currently have a friendly user interface for translators. Look at other translation software on the market, and sketch out some ideas for how to design a user interface. We don't require an implementation, just the XML-based interface mockup | 2–4 | Jimregan |
user interface | 2. Medium | Design a user-friendly web interface for Apertium | Apertium has a friendly user interface for translators, but more attention needs to be paid to its visual appearance. This will involve either a user interface mockup (preferably using GWT), or a "theme" using CSS for the existing interface. | 2–4 | Jimregan |
user interface | 2. Medium | Design a user-friendly interface for a web-based dictionary management tool. | Apertium does not currently have a friendly user interface for adding new words to the dictionaries. We need someone with a good sense of design to provide us with a mockup for a web interface for managing a dictionary. | 2–4 | Jimregan |
user interface | 2. Medium | Design a user-friendly interface for an Android version of TinyLex | TinyLex is a dictionary tool for J2ME. We would like to port it to Android, but as we are not UI designers, we would prefer it if someone with a sense for visual design took on this task. There are tools available for drawing an interface in XML - it would be better if they were used. | 2–4 | Jimregan |
user interface | 2. Medium | Design a user-friendly Android interface for Apertium | Design a mockup of a GUI for Apertium for Android. We don't run on Android yet, but work is ongoing. We would like some ideas for an interface that makes sense on phones, primarily, but taking the tablet form factor into account is also an option. | 2–4 | Jimregan |
training | 3. Easy | Step-by-step "become a developer" guide | Write a simple step-by-step guide (on the wiki) for pre-university students (of varying levels of computer literacy) to install a development version of Apertium and make a single change in a language pair. This should include everything, from checking out with SVN to requesting committer access on SourceForge. Document everything you do! | 2–3 | Mikel L. Forcada |
training | 3. Easy | Step-by-step "constraint grammar" guide | Write a simple step-by-step guide (on the wiki) for pre-university students (of varying levels of computer literacy) to install Constraint Grammar and fix 5 disambiguation problems in a single sentence, then committing to the incubator. This should include everything, from checking out with SVN to requesting committer access on SourceForge. Document everything you do! | 2–3 | Unhammer |
training | 2. Easy | Basics of grammar guide | Write a basic guide that teaches the basics of grammar, with reference to the part of speech tags used in Apertium | 2–3 | Jimregan |
training | 2. Medium | Moodle course | Design a Moodle-based course for beginning a new language pair. The New Language Pair HOWTO can be used as a guide | 4–6 | Jimregan |
training | 2. Easy | Apertium AWI Screencase | Create a screencast that gives a step by step guide to using Apertium via Apertium AWI. It's OK to assume that it has already been set up | 2–3 | Jimregan |
training | 1. Hard | Apertium Regular Expressions guide | lttoolbox allows a limited subset of POSIX regular expressions. Create a guide to the regexes allowed, and to using them for common tasks, such as matching dates. | 2–3 | Jimregan |
quality | 3. Easy | Release freshness | Go through all the 25 released pairs and note down their date of last release and how many dictionary entries and rules they have. Then go to SVN and look at the module for the released pair and find out how many dictionary entries and rules it has. Put this into a spreadsheet and email the mailing list. Why? Our release cycle is very slow, and often we get pairs in trunk which have substantial improvements but have not been released. | 2–4 | Francis Tyers |
outreach | 3. Easy | Translate the Wikipedia article on Apertium: Aragonese | Translate the article on Apertium into Aragonese for the Aragonese Wikipedia | 30m-1h | Juan Pablo Martínez |
documentation | 3. Easy | Create a dictionary crossing guide | Create a full guide to crossing dictionaries, using notes that will be provided. | 2–3 | Jimregan |
documentation | 3. Easy | Create an installation guide for Windows users | We have some installation notes, but they were not written with an average user in mind. Write a new installation guide, specifically for Windows users, that don't presume a high level of technical knowledge. | 2–3 | Jimregan |
documentation | 3. Easy | Create an installation guide for Mac users | We have some installation notes, but they were not written with an average user in mind. Write a new installation guide, specifically for Mac users, that don't presume a high level of technical knowledge. | 2–3 | Jimregan |
documentation | 3. Easy | Create an installation guide for Ubuntu users | We have some installation notes, but they were not written with an average user in mind. Write a new installation guide, specifically for Ubuntu users, that don't presume a high level of technical knowledge. Specifically, steer people away from installing the dated Debian/Ubuntu packages. | 2–3 | Jimregan |
outreach | 3. Easy | Writing a quick guide on 'What Apertium can and cannot do to help you with your homework'. | Students around the world use Apertium (and other MT systems) to do their second-language homework. The documents would summarize the do's and don'ts, and could even elaborate on how students using Apertium for their homework could discover ways in which Apertium could be improved. | 2–3 | Mikel L. Forcada |
documentation | 3. Easy | Document undocumented features: manpages | Work through each of the manpages in apertium and lttoolbox, checking that each of the options listed by --help is documented. | 2–4 | Jimregan |
research | 3. Easy | Create manually tagged corpora: Occitan | Fix tagging errors in a piece of analysed text, for use in tagger training. This will involve taking some free text (such as from Wikipedia), running it through the analyser and tagger, and replacing incorrect analyses with the correct one. | 2–4 | Mireia Ginestí |
research | 3. Easy | Create manually tagged corpora: Italian | Fix tagging errors in a piece of analysed text, for use in tagger training. This will involve taking some free text (such as from Wikipedia), running it through the analyser and tagger, and replacing incorrect analyses with the correct one. | 2–4 | Mireia Ginestí |
research | 3. Easy | Create manually tagged corpora: Catalan | Fix tagging errors in a piece of analysed text, for use in tagger training. This will involve taking the corpus in the es-ca package, and adapting it in terms of the multiwords present in en-ca, but absent in es-ca. | 2–4 | Mireia Ginestí |
research | 3. Easy | Create manually tagged corpora: Polish | Fix tagging errors in a piece of analysed text, for use in tagger training. This will involve taking some free text (such as from Wikipedia), running it through the analyser and tagger, and replacing incorrect analyses with the correct one. It may be preferable to use LanguageTool's tagger. | 2–4 | Jimregan |
research | 3. Easy | Create manually tagged corpora: Czech | Fix tagging errors in a piece of analysed text, for use in tagger training. This will involve taking some free text (such as from Wikipedia), running it through the analyser and tagger, and replacing incorrect analyses with the correct one. It may be preferable to use LanguageTool's tagger. | 2–4 | Jimregan |
research | 3. Easy | Create manually tagged corpora: Slovakian | Fix tagging errors in a piece of analysed text, for use in tagger training. This will involve taking some free text (such as from Wikipedia), running it through the analyser and tagger, and replacing incorrect analyses with the correct one. It may be preferable to use LanguageTool's tagger. | 2–4 | Jimregan |
research | 3. Easy | Create manually tagged corpora: Russian | Fix tagging errors in a piece of analysed text, for use in tagger training. This will involve taking some free text (such as from Wikipedia), running it through the analyser and tagger, and replacing incorrect analyses with the correct one. It may be preferable to use LanguageTool's tagger. | 2–4 | Jimregan |
research | 3. Easy | Create manually tagged corpora: Ukrainian | Fix tagging errors in a piece of analysed text, for use in tagger training. This will involve taking some free text (such as from Wikipedia), running it through the analyser and tagger, and replacing incorrect analyses with the correct one. It may be preferable to use LanguageTool's tagger. | 2–4 | Jimregan |
quality | 1. Hard | Improve a language pair: Welsh-English | Find some faults in Welsh-English and fix them. | 8–12 | Francis Tyers |
quality | 1. Hard | Improve a language pair: Breton-French | Find some faults in Breton-French and fix them. | 8–12 | Francis Tyers |
quality | 1. Hard | Improve a language pair: Basque-Spanish | Find some faults in Basque-Spanish and fix them. | 8–12 | Mireia Ginestí |
documentation | 2. Medium | Document undocumented features: cascaded interchunk | Update the Apertium manual to document cascaded interchunk. | 4–8 | Mikel L. Forcada |
documentation | 2. Medium | Document undocumented features: transliteration | Update the Apertium manual to document the transliteration features in lttoolbox. | 4–8 | Francis Tyers |
quality | 1. Hard | Fix some tagger errors in Swedish->Danish | apertium-sv-da could be improved with a Constraint Grammar. Find 10 sentences that get wrong translations due to tagging, and write CG rules to fix them. The student should have good knowledge of Swedish, or at least some Scandinavian language. | 8–12 | Unhammer |
quality | 3. Easy | Improve Swedish-Danish | Add 50 nouns you feel are missing in translations from Swedish to Danish. | 3–6 | Jacob Nordfalk |
quality | 3. Easy | Improve English-Esperanto | Add 50 words you feel are missing in translations from English to Esperanto. | 3–6 | Jacob Nordfalk |
quality | 3. Easy | Improve Spanish-Esperanto | Add 50 words you feel are missing in translations from Spanish to Esperanto. | 3–6 | Hectoralos |
quality | 3. Easy | Improve Catalan-Esperanto | Add 50 words you feel are missing in translations from Catalan to Esperanto. | 3–6 | Hectoralos |
quality | 3. Easy | Improve French-Esperanto | Add 50 words you feel are missing in translations from French to Esperanto. | 3–6 | Hectoralos |
quality | 3. Easy | Improve Spanish-Aragonese | Add 50 words you feel are missing in translations from Aragonese to Spanish. | 3–6 | Juan Pablo Martínez |
quality | 3. Easy | Improve Afrikaans-Dutch | Add 50 words you feel are missing in translations from Afrikaans-Dutch. A list of unknown words can be provided. | 3–6 | Pim Otte |
quality | 3. Easy | Afrikaans-Dutch tests | Finish the Afrikaans-Dutch pending tests and move the passing tests to a seperate page for regression testing. | 3–6 | Pim Otte |
quality | 3. Easy | Improve Irish-Manx Gaelic coverage | I can provide a list of the most common Irish words not covered by the bilingual dictionary, and their English translations. Manx translations needed for these. | 3–6 | Kevin Scannell |
quality | 3. Easy | Add gender information to Manx dictionary | Most of the nouns in the Manx dictionary have gender information in place - look up and add any that are missing. | 3–6 | Kevin Scannell |
quality | 3. Easy | Proofread Albanian analyser | We have a morphological analyser for Albanian, but it has been written by a non-native speaker and needs to be checked. | 6–10 | Francis Tyers |
translation | 3. Easy | Proofread Catalan--Sardian dictionary | Go through the Catalan--Sardinian dictionary and check the entries, there are only around a thousand or so. | 1–2 | Francis Tyers |
quality | 2. Medium | Improve Spanish-Aragonese coverage | Create a corpus from Aragonese Wikipedia (with all the articles which do not have the template {{Grafía_87}} (or equivalently, those which are not in category [1]). Then, add the top 50-100 frequently used words which are not covered in the Apertium es-an dictionaries. | 6–10 | Juan Pablo Martínez |
code | 2. Medium | Add toponyms to the Spanish-Aragonese dictionaries | Extract from wikipedia the names in Aragonese for countries in the world, their capital cities, main Spanish cities and municipalities in Aragon, and add them to the es-an dictionaries. | 6–10 | Juan Pablo Martínez |
code | 2. Medium | Update the Apertium TinyLex J2ME apps for one language pair | Update Apertium TinyLex J2ME packages (http://www.tinylex.com/) to contain the most recent versions of dictionaries for one language pair | 4–6 per package | Mikel L. Forcada |
code | 2. Medium | Create an Apertium TinyLex J2ME app for a new language pair | Create an Apertium TinyLex J2ME package (http://www.tinylex.com/) from an existing Apertium language pair for a language pair not offered yet in Tinylex | 6–10 per package | Mikel L. Forcada |