Difference between revisions of "Task ideas for Google Code-in"
Jump to navigation
Jump to search
Firespeaker (talk | contribs) |
|||
Line 19: | Line 19: | ||
==Task ideas== |
==Task ideas== |
||
− | <table class="sortable wikitable"> |
+ | <table class="sortable wikitable"> |
<tr><th>type</th><th>title</th><th>description</th><th>tags</th><th>mentors</th></tr> |
<tr><th>type</th><th>title</th><th>description</th><th>tags</th><th>mentors</th></tr> |
||
{{Taskidea|type=code|mentors=Fran|tags=c++ |
{{Taskidea|type=code|mentors=Fran|tags=c++ |
||
Line 45: | Line 45: | ||
|description=convert [http://youssefsan.eu/wiki/index.php?title=Wolof] to apertium-style documentation |
|description=convert [http://youssefsan.eu/wiki/index.php?title=Wolof] to apertium-style documentation |
||
}} |
}} |
||
− | {{Taskidea|type=code|tags=c++ |
+ | {{Taskidea|Implement `lt-print --strings` lt-print -s|type=code|tags=c++|mentors=Fran}} |
− | Implement |
+ | {{Taskidea|Implement an algorithm that prints out a transducer but only follows ''n'' cycles.|type=code|tags=c++|mentors=Fran}} |
− | {{Taskidea|type=code|tags=c++ |
||
− | Implement an algorithm that prints out a transducer but only follows ''n'' cycles.|mentors=Fran}} |
||
{{Taskidea|title=in-browser globe with apertium languages as points|description=Use d3 globe to make an apertium language/pair viewer (like [[pairviewer]]), maybe based on [https://www.jasondavies.com/maps/rotate/ this] or [http://bl.ocks.org/KoGor/5994804 this] or [http://bl.ocks.org/dwtkns/4973620 this].|mentors=Firespeaker|type=coding|tags=js,html,maps}} |
{{Taskidea|title=in-browser globe with apertium languages as points|description=Use d3 globe to make an apertium language/pair viewer (like [[pairviewer]]), maybe based on [https://www.jasondavies.com/maps/rotate/ this] or [http://bl.ocks.org/KoGor/5994804 this] or [http://bl.ocks.org/dwtkns/4973620 this].|mentors=Firespeaker|type=coding|tags=js,html,maps}} |
||
{{Taskidea|make a thing to detect contexts where a path in a compiled transducer begins with a whitespace}} |
{{Taskidea|make a thing to detect contexts where a path in a compiled transducer begins with a whitespace}} |
Revision as of 18:56, 29 October 2016
Contents |
This is the task ideas page for Google Code-in, here you can find ideas on interesting tasks that will improve your knowledge of Apertium and help you get into the world of open-source development.
The people column lists people who you should get in contact with to request further information. All tasks are 2 hours maximum estimated amount of time that would be spent on the task by an experienced developer, however:
- this does not include time taken to install / set up apertium.
- this is the time expected to take by an experienced developer, you may find that you spend more time on the task because of the learning curve.
Categories:
- code: Tasks related to writing or refactoring code
- documentation: Tasks related to creating/editing documents and helping others learn more
- research: Tasks related to community management, outreach/marketting, or studying problems and recommending solutions
- quality: Tasks related to testing and ensuring code is of high quality.
- interface: Tasks related to user experience research or user interface design and interaction
You can find descriptions of some of the mentors here: List_of_Apertium_mentors.
Task ideas
type | title | description | tags | mentors | |||
---|---|---|---|---|---|---|---|
code | Fix a memory leak in matxin-transfer | hargle bargle | c++ | Fran | |||
Tag text in Apertium format | Fran | ||||||
code | Convert Chukchi Nouns to HFST/lexc | Fran | |||||
code | Convert Chukchi Numerals to HFST/lexc | Fran | |||||
code | Convert Chukchi Adjectives to HFST/lexc | Fran | |||||
Make a (web) viewer for parallel treebanks (also for viewing diff annotation for same sentence) | |||||||
Write a script to convert a UD treebank for a given language to a format suitable for training the perceptron tagger | |||||||
Train the perceptron tagger for a language | Fran | ||||||
Design an annotation tool for disambiguation | |||||||
Design an annotation tool for adding dependencies | |||||||
Train lexical selection rules from a large parallel corpus for a language pair | Fran | ||||||
Document how to set up the experiments for weighted transfer rules | Fran | ||||||
convert UD treebank to apertium tags, use unigram tagger (see #apertium logs 2016-06-22) | |||||||
Write a script to extract sentences from CoNLL-U where they have the same tokenisation as Apertium. | Fran | ||||||
convert [1] to apertium-style documentation | |||||||
code | Implement `lt-print --strings` lt-print -s | c++ | Fran | ||||
code | Implement an algorithm that prints out a transducer but only follows n cycles. | c++ | Fran | ||||
coding | in-browser globe with apertium languages as points | Use d3 globe to make an apertium language/pair viewer (like pairviewer), maybe based on this or this or this. | js,html,maps | Firespeaker | |||
make a thing to detect contexts where a path in a compiled transducer begins with a whitespace | |||||||
make the lt-comp compiler print a warning when a path begins with a whitespace. | |||||||
apertium-mar-hin: make the SL morph for any part of speech less daft | |||||||
add other indic scripts/formal latin transliterations to the currently-just-WX transliterator | |||||||
code | refactor x file in apy | Putti | |||||
documentation | add docstrings to x file in apy | Putti | |||||
code | write 10 unit tests for apy | Putti, (sushain, unhammer ?) |
- apertium-hun:
- hunmorph.db conversions
- additional morphology
- stuff
- suggest a word feature to web translator:
- user managements
- saving data
- deploy it to apertium.org? If it's already exisitng somewhere in githubs, maybe also to merge
- merge Kira's contributions more generally
- apertium-mar:
- more consistency with apertium-hin for verbs
- replace cases with postpositions
- fix modals and quasi-modals
- APYrtium CLARIN switchboard (oder Weblicht) compatabilitet:
- make apy i/o into clarin compatible formats
- make cmdi metadata of apertium (translators, analysers, etc.)
- make parts of pipelines callable
- apertium-fin-eng
- sieve through the rubbish words and translations from finnwordnet (i.e. most of fin-eng bidix)
- add words from eng side of the dix to apertium-eng
- do stuff
- add 1 transfer rule (fran, vin)
- add 50 entries to a bidix (fran, vin)
- write 10 lexsel rules for a pair already set up with lexical selection (fran, vin)
- write 10 constraint grammar rules (fran, vin)