Ideas for Google Summer of Code/Apertium African

From Apertium
Jump to navigation Jump to search

Apertium English--Hausa/Igbo/Swahili/Tigrinya/Yoruba

African languages are not particularly well served by Apertium. The four languages listed are quite important, and are only currently served by commercial machine translation companies such as Google, which makes these language communities dependent on a specific commercial provider. The objective is to start these language pairs (which haven't been started or have currentlu very little data in Apertium) and write an usable version which provides intelligible output.

Coding challenge

  • If necessary, install Apertium.
  • If there is some data for the language pair in the Apertium Github server, check it out and install it.
  • Add some minimal vocabulary and rules and check that they work. Ideally, select a few sentences that are translated with this vocabulary and show that they are translated correctly.
  • If the language pair is already in the Apertium GitHub server, submit a pull request.
  • If the language pair is not there, contact your mentor(s) so that they can start a repository for you to submit a pull request.