Apertium has moved from SourceForge to GitHub.
If you have any questions, please come and talk to us on #apertium on irc.freenode.net or contact the GitHub migration team.

Corpora

From Apertium
(Difference between revisions)
Jump to: navigation, search
m (== Corpus tools == BootCaT/Bitextor)
m (Corpora: OPUS)
Line 10: Line 10:
 
* South African Government Services — http://xixona.dlsi.ua.es/~fran/services-gov-za-en_ZA-af_ZA.txt — English—Afrikaans — 2,500 approx. sentence aligned, 49,375 words.
 
* South African Government Services — http://xixona.dlsi.ua.es/~fran/services-gov-za-en_ZA-af_ZA.txt — English—Afrikaans — 2,500 approx. sentence aligned, 49,375 words.
 
* IJS-ELAN — http://nl.ijs.si/elan/ — English-Slovenian
 
* IJS-ELAN — http://nl.ijs.si/elan/ — English-Slovenian
  +
* OPUS — http://urd.let.rug.nl/tiedeman/OPUS/index.php ‐ Open Source multilingual corpora
   
 
== Corpus tools ==
 
== Corpus tools ==

Revision as of 00:18, 30 August 2008

Lists of corpora under free licences (public domain, CC-BY-SA, GPL, etc.)

Corpora

Use this if you want to do English--<something> (funny alignments for non-English pairs)
Use this if you want to do <anything>--<anything>

Corpus tools

Personal tools