Difference between revisions of "Ideas for Google Summer of Code/Apertium African"

From Apertium
Jump to navigation Jump to search
(Created page with "== Apertium English--Hausa/Igbo/Swahili/Tigrinya/Yoruba == African languages are not particularly well served by Apertium. The four languages listed are quite important, and ...")
 
m (categorize)
 
(One intermediate revision by one other user not shown)
Line 7: Line 7:
 
* Install a GNU/Linux system. There is an [http://wiki.apertium.org/wiki/Apertium_VirtualBox Apertium virtual machine] you can install using VirtualBox.
 
* Install a GNU/Linux system. There is an [http://wiki.apertium.org/wiki/Apertium_VirtualBox Apertium virtual machine] you can install using VirtualBox.
   
  +
* If necessary, install Apertium.
* If necessary, install Apertium, [https://github.com/apertium/apertium-oci the Occitan language data], [https://github.com/apertium/apertium-fra the French language data], and [https://github.com/apertium/apertium-oci-fra the Apertium Occitan-French package]
 
  +
  +
* If there is some data for the language pair in the Apertium Github server, check it out and install it.
   
 
* Check out a language pair that may be similar, and build similar files for your English--(African language) system. See [[Apertium_New_Language_Pair_HOWTO]].
 
* Check out a language pair that may be similar, and build similar files for your English--(African language) system. See [[Apertium_New_Language_Pair_HOWTO]].
Line 16: Line 18:
   
 
* If the language pair is not there, contact your mentor(s) so that they can start a repository for you to submit a pull request.
 
* If the language pair is not there, contact your mentor(s) so that they can start a repository for you to submit a pull request.
  +
  +
  +
[[Category:Ideas_for_Google_Summer_of_Code]]

Latest revision as of 19:48, 24 March 2020

Apertium English--Hausa/Igbo/Swahili/Tigrinya/Yoruba[edit]

African languages are not particularly well served by Apertium. The four languages listed are quite important, and are only currently served by commercial machine translation companies such as Google, which makes these language communities dependent on a specific commercial provider. The objective is to start these language pairs (which haven't been started or have currentlu very little data in Apertium) and write an usable version which provides intelligible output.

Coding challenge[edit]

  • If necessary, install Apertium.
  • If there is some data for the language pair in the Apertium Github server, check it out and install it.
  • Add some minimal vocabulary and rules and check that they work. Ideally, select a few sentences that are translated with this vocabulary and show that they are translated correctly.
  • If the language pair is already in the Apertium GitHub server, submit a pull request.
  • If the language pair is not there, contact your mentor(s) so that they can start a repository for you to submit a pull request.