Apertium has moved from SourceForge to GitHub.
If you have any questions, please come and talk to us on #apertium on irc.freenode.net or contact the GitHub migration team.

Ideas for Google Summer of Code/Adopt a language pair

From Apertium
< Ideas for Google Summer of Code(Difference between revisions)
Jump to: navigation, search
(Coding challenge)
(Previous GSOC projects)
Line 26: Line 26:
 
* 2011
 
* 2011
 
** [[Serbo-Croatian and Macedonian]]
 
** [[Serbo-Croatian and Macedonian]]
  +
** [[Turkish and Azerbaijani]]
  +
** [[Turkish and Kyrgyz]]
  +
** [[Maltese and Hebrew]]
 
* 2010
 
* 2010
 
** [[Macedonian and Bulgarian]]
 
** [[Macedonian and Bulgarian]]
Line 32: Line 35:
 
** [[Afrikaans and Dutch]] (GCI)
 
** [[Afrikaans and Dutch]] (GCI)
 
* 2009
 
* 2009
** [[Swedish and Danish]] - new adopter needed
+
** [[Swedish and Danish]]
 
** [[Norwegian Nynorsk and Norwegian Bokmål]]
 
** [[Norwegian Nynorsk and Norwegian Bokmål]]
   

Revision as of 12:04, 16 February 2012

Contents

This project will involve writing linguistic data, including morphological rules and transfer rules — which are specified in a declarative language. A good intro would be to look through Apertium New Language Pair HOWTO, see also Contributing to an existing pair. If the pair has OK dictionaries but a bad tagger (disambiguator), a GsoC project might involve writing a good Constraint Grammar for the pair.

Coding challenge

The coding challenge for this task is to:

  1. Install Apertium (see Minimal installation from SVN)
  2. Go through the HOWTO
  3. Go through the MT course here (или здесь)
  4. Write a translator that translates as much of this story as possible. (Другие переводы рассказа здесь.)
    • If there is no translation, translate it into the languages of your language pair first.
  5. Upload your work to Apertium SVN.

The URL to your coding challenge work should be included in your application.

Frequently asked questions

  • Can I do a pair with language x and language y ?
— Yes. But you should take the following into consideration: (a) Are there existing MT systems for this pair? (b) If there are existing systems, how good are they? -- Could you do better in three months? (c) How closely related is the pair? (d) How many resources already exist for the pair? (e) Are there any mentors who can evaluate your work?

Previous GSOC projects

And pairs which were adopted in past years:

See also

Personal tools