Apertium has moved from SourceForge to GitHub.
If you have any questions, please come and talk to us on #apertium on irc.freenode.net or contact the GitHub migration team.

Wikipedia dumps

From Apertium
(Difference between revisions)
Jump to: navigation, search
Line 15: Line 15:
 
[[Category:Development]]
 
[[Category:Development]]
 
[[Category:Corpora]]
 
[[Category:Corpora]]
  +
[[Category:Documentation in English]]

Revision as of 18:41, 26 September 2016

Wikipedia dumps are useful for quickly getting a corpus. They are also the best corpora for making your language pair are useful for Wikipedia's Content Translation tool :-)

You download them from


There are several tools for turning dumps into useful plaintext, e.g.

Personal tools