Difference between revisions of "Ideas for Google Summer of Code/Apertium Occitan French"
Jump to navigation
Jump to search
(Created page with "== Improving Apertium Occitan-French == The [https://github.com/apertium/apertium-oci-fra Occitan--French language pair] has been recently published. This language pair is of...") |
Popcorndude (talk | contribs) m (categorize) |
||
(One intermediate revision by one other user not shown) | |||
Line 7: | Line 7: | ||
* Install a GNU/Linux system. There is an [http://wiki.apertium.org/wiki/Apertium_VirtualBox Apertium virtual machine] you can install using VirtualBox. |
* Install a GNU/Linux system. There is an [http://wiki.apertium.org/wiki/Apertium_VirtualBox Apertium virtual machine] you can install using VirtualBox. |
||
− | * If necessary, install |
+ | * If necessary, install Apertium, [https://github.com/apertium/apertium-oci the Occitan language data], [https://github.com/apertium/apertium-fra the French language data], and [https://github.com/apertium/apertium-oci-fra the Apertium Occitan-French package] |
* Look for representative standard Occitan and French texts. |
* Look for representative standard Occitan and French texts. |
||
Line 24: | Line 24: | ||
* Submit a pull request with your modifications. |
* Submit a pull request with your modifications. |
||
+ | |||
+ | [[Category:Ideas_for_Google_Summer_of_Code]] |
Latest revision as of 19:48, 24 March 2020
Improving Apertium Occitan-French[edit]
The Occitan--French language pair has been recently published. This language pair is of strategic importance for the Occitan language, as Apertium offers the only machine translation system for this language pair. The idea is to make Occitan output easier to postedit and French output easier to understand. This entails increasing the monolingual and bilingual dictionaries, improving disambiguation, and writing new structural transfer rules.
Coding challenge[edit]
- Install a GNU/Linux system. There is an Apertium virtual machine you can install using VirtualBox.
- If necessary, install Apertium, the Occitan language data, the French language data, and the Apertium Occitan-French package
- Look for representative standard Occitan and French texts.
- Search for frequent words that are not translated in either direction.
- Modify the data packages so that the system translates the word correctly now.
To convince us even more:
- Search for a structure that is frequently mistranslated and that can be easily repaired with a structural transfer rule
- Modify the structural transfer rule packages so that the structure is now correctly translated.
Finally:
- Submit a pull request with your modifications.