Difference between revisions of "Corpora"

From Apertium
Jump to navigation Jump to search
m (→‎Corpora: OPUS)
Line 14: Line 14:
== Corpus tools ==
== Corpus tools ==


* BootCaT — http://sslmit.unibo.it/~baroni/bootcat.html Simple Utilities to Bootstrap Corpora and Terms from the Web
* Corpus Catcher — http://translate.sourceforge.net/wiki/corpuscatcher/index - Bootstrap corpora from the web
* BootCaT — http://sslmit.unibo.it/~baroni/bootcat.html - Simple Utilities to Bootstrap Corpora and Terms from the Web
* Bitextor — http://sourceforge.net/projects/bitextor/ - Bootstrap bilingual corpora from the web
* Bitextor — http://sourceforge.net/projects/bitextor/ - Bootstrap bilingual corpora from the web



Revision as of 08:44, 30 August 2008

Lists of corpora under free licences (public domain, CC-BY-SA, GPL, etc.)

Corpora

Use this if you want to do English--<something> (funny alignments for non-English pairs)
Use this if you want to do <anything>--<anything>

Corpus tools