Translating wikimedia

From Apertium
Revision as of 17:57, 3 September 2011 by Bech (talk | contribs) (Documentation in English)
Jump to navigation Jump to search

To translate wikimedia formatted text, like Wikipedia articles, there are several options.

  • apertium-desmediawiki/-remediawiki in the apertium package is installed by default, but only handles the basics
    • Usage: cat es-article.wiki | apertium-desmediawiki | apertium -f none es-ca | apertium-remediawiki > ca-article.wiki


  • apertium-mediawiki -- a newer work-in-progress format handler that handles links better (see below)


but can you actually translate wikimedia-formatted articles with it, or just po-files and such?


  • you can run a clean-up script such as wikifix.pl (which is written for nn-nb) after just using the regular format handler


apertium-mediawiki

This is a work-in-progress full format handler for mediawiki.

Download and compile like this:

   svn co https://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-mediawiki
   sh autogen.sh
   make
   sudo make install

You should now be able to do

   cat es-article.wiki | demediawiki | apertium -f none es-ca | remediawiki > ca-article.wiki


This extension was originally hosted at https://code.google.com/p/apertium-mediawiki/ , those pages still hold some documentation.

To run the regression tests:

   make test

See also