Apertium has moved from SourceForge to GitHub.
If you have any questions, please come and talk to us on #apertium on irc.freenode.net or contact the GitHub migration team.

Ideas for Google Summer of Code/Apertium African

From Apertium
< Ideas for Google Summer of Code(Difference between revisions)
Jump to: navigation, search
(Created page with "== Apertium English--Hausa/Igbo/Swahili/Tigrinya/Yoruba == African languages are not particularly well served by Apertium. The four languages listed are quite important, and ...")
 
(Coding challenge)
 
Line 7: Line 7:
 
* Install a GNU/Linux system. There is an [http://wiki.apertium.org/wiki/Apertium_VirtualBox Apertium virtual machine] you can install using VirtualBox.
 
* Install a GNU/Linux system. There is an [http://wiki.apertium.org/wiki/Apertium_VirtualBox Apertium virtual machine] you can install using VirtualBox.
   
* If necessary, install Apertium, [https://github.com/apertium/apertium-oci the Occitan language data], [https://github.com/apertium/apertium-fra the French language data], and [https://github.com/apertium/apertium-oci-fra the Apertium Occitan-French package]
+
* If necessary, install Apertium.
  +
  +
* If there is some data for the language pair in the Apertium Github server, check it out and install it.
   
 
* Check out a language pair that may be similar, and build similar files for your English--(African language) system. See [[Apertium_New_Language_Pair_HOWTO]].
 
* Check out a language pair that may be similar, and build similar files for your English--(African language) system. See [[Apertium_New_Language_Pair_HOWTO]].

Latest revision as of 20:37, 4 February 2019

[edit] Apertium English--Hausa/Igbo/Swahili/Tigrinya/Yoruba

African languages are not particularly well served by Apertium. The four languages listed are quite important, and are only currently served by commercial machine translation companies such as Google, which makes these language communities dependent on a specific commercial provider. The objective is to start these language pairs (which haven't been started or have currentlu very little data in Apertium) and write an usable version which provides intelligible output.

[edit] Coding challenge

  • If necessary, install Apertium.
  • If there is some data for the language pair in the Apertium Github server, check it out and install it.
  • Add some minimal vocabulary and rules and check that they work. Ideally, select a few sentences that are translated with this vocabulary and show that they are translated correctly.
  • If the language pair is already in the Apertium GitHub server, submit a pull request.
  • If the language pair is not there, contact your mentor(s) so that they can start a repository for you to submit a pull request.
Personal tools