Ideas for Google Summer of Code/Adopt a language pair
< Ideas for Google Summer of Code
Jump to navigation
Jump to search
Revision as of 13:45, 5 February 2011 by Francis Tyers (talk | contribs)
This project will involve writing linguistic data, including morphological rules and transfer rules — which are specified in a declarative language. A good intro would be to look through Apertium New Language Pair HOWTO. If the pair has OK dictionaries but a bad tagger (disambiguator), a GsoC project might involve writing a good Constraint Grammar for the pair (although this would likely require more knowledge of and interest in linguistics).
Here are a few links to pages of pairs which are up for adoption:
Closely related:
- Hindi and Urdu
- Scottish Gaelic and Irish
- Iranian Persian and Tajik
- Serbo-Croatian and Macedonian
- Slovenian and Macedonian
- Czech and Slovenian
- Faroese and Icelandic
- Breton and Welsh
- Indonesian and Malaysian
- Russian and Ukrainian
- Czech and Slovak
Not closely related:
- Afrikaans to English
- Bengali and English
- Breton and English
- Dhivehi and English
- French and Esperanto
- Haitian Creole and English
If you find a pair in the Incubator in SVN, feel free to write up its status on a page on the Wiki and add it here.
And pairs which were adopted in past years:
- 2010
- 2009
- Swedish and Danish - new adopter needed
- Norwegian Nynorsk and Norwegian Bokmål