Difference between revisions of "Ideas for Google Summer of Code/Adopt a language pair"

From Apertium
Jump to navigation Jump to search
Line 1: Line 1:
 
This project will involve writing linguistic data, including morphological rules and transfer rules — which are specified in a declarative language. A good intro would be to look through [[Apertium New Language Pair HOWTO]], see also [[Contributing to an existing pair]]. If the pair has OK dictionaries but a bad tagger (disambiguator), a GsoC project might involve writing a good [[Constraint Grammar]] for the pair (although this would likely require more knowledge of and interest in linguistics).
 
This project will involve writing linguistic data, including morphological rules and transfer rules — which are specified in a declarative language. A good intro would be to look through [[Apertium New Language Pair HOWTO]], see also [[Contributing to an existing pair]]. If the pair has OK dictionaries but a bad tagger (disambiguator), a GsoC project might involve writing a good [[Constraint Grammar]] for the pair (although this would likely require more knowledge of and interest in linguistics).
   
  +
==Coding challenge==
Here are a few links to pages of pairs which are up for adoption:
 
   
  +
The coding challenge for this task is to ...
Closely related:
 
   
  +
==Previous GSOC projects==
* [[Hindi and Urdu]]
 
* [[Scottish Gaelic and Irish]]
 
* [[Iranian Persian and Tajik]]
 
* [[Serbo-Croatian and Macedonian]]
 
* [[Slovenian and Macedonian]]
 
* [[Czech and Slovenian]]
 
* [[Faroese and Icelandic]]
 
* [[Breton and Welsh]]
 
* [[Indonesian and Malaysian]]
 
* [[Russian and Ukrainian]]
 
* [[Czech and Slovak]]
 
 
Not closely related:
 
 
* [[Afrikaans to English]]
 
* [[Bengali and English]]
 
* [[Breton and English]]
 
* [[Dhivehi and English]]
 
* [[French_and_Esperanto/Notoj|French and Esperanto]]
 
* [[Haitian Creole and English]]
 
 
If you find a pair in the [[Incubator]] in [[SVN]], feel free to write up its status on a page on the Wiki and add it here.
 
   
 
And pairs which were adopted in past years:
 
And pairs which were adopted in past years:
   
  +
* 2011
 
** [[Serbo-Croatian and Macedonian]]
 
* 2010
 
* 2010
 
** [[Macedonian and Bulgarian]]
 
** [[Macedonian and Bulgarian]]
Line 38: Line 19:
 
** [[Swedish and Danish]] - new adopter needed
 
** [[Swedish and Danish]] - new adopter needed
 
** [[Norwegian Nynorsk and Norwegian Bokmål]]
 
** [[Norwegian Nynorsk and Norwegian Bokmål]]
  +
  +
==See also==
  +
  +
* [[List of language pairs]]
   
 
[[Category:Ideas for Google Summer of Code]]
 
[[Category:Ideas for Google Summer of Code]]

Revision as of 10:45, 16 February 2012

This project will involve writing linguistic data, including morphological rules and transfer rules — which are specified in a declarative language. A good intro would be to look through Apertium New Language Pair HOWTO, see also Contributing to an existing pair. If the pair has OK dictionaries but a bad tagger (disambiguator), a GsoC project might involve writing a good Constraint Grammar for the pair (although this would likely require more knowledge of and interest in linguistics).

Coding challenge

The coding challenge for this task is to ...

Previous GSOC projects

And pairs which were adopted in past years:

See also