Difference between revisions of "Ideas for Google Summer of Code/Adopt a language pair"
Jump to navigation
Jump to search
Line 13: | Line 13: | ||
* [[Faroese and Icelandic]] |
* [[Faroese and Icelandic]] |
||
* [[Breton and Welsh]] |
* [[Breton and Welsh]] |
||
+ | * [[Indonesian and Malaysian]] |
||
+ | * [[Russian and Ukrainian]] |
||
+ | * [[Czech and Slovak]] |
||
Not closely related: |
Not closely related: |
Revision as of 13:45, 5 February 2011
This project will involve writing linguistic data, including morphological rules and transfer rules — which are specified in a declarative language. A good intro would be to look through Apertium New Language Pair HOWTO. If the pair has OK dictionaries but a bad tagger (disambiguator), a GsoC project might involve writing a good Constraint Grammar for the pair (although this would likely require more knowledge of and interest in linguistics).
Here are a few links to pages of pairs which are up for adoption:
Closely related:
- Hindi and Urdu
- Scottish Gaelic and Irish
- Iranian Persian and Tajik
- Serbo-Croatian and Macedonian
- Slovenian and Macedonian
- Czech and Slovenian
- Faroese and Icelandic
- Breton and Welsh
- Indonesian and Malaysian
- Russian and Ukrainian
- Czech and Slovak
Not closely related:
- Afrikaans to English
- Bengali and English
- Breton and English
- Dhivehi and English
- French and Esperanto
- Haitian Creole and English
If you find a pair in the Incubator in SVN, feel free to write up its status on a page on the Wiki and add it here.
And pairs which were adopted in past years:
- 2010
- 2009
- Swedish and Danish - new adopter needed
- Norwegian Nynorsk and Norwegian Bokmål