Difference between revisions of "Java port of Apertium runtime"

From Apertium
Jump to navigation Jump to search
Line 22: Line 22:
 
Af I see it:
 
Af I see it:
 
* Interchunk/postchunk I could do in very short time, so I could mentor this
 
* Interchunk/postchunk I could do in very short time, so I could mentor this
* Tagger: The C++ code is there and some has already been ported to Java (lttoolbox parts), probably someone could port it relatively easy, if someone understanding tagger would co-mentor that part. I'd say its not necessary to port tagger training, just the core (bigram) tagging during translation.
+
* Tagger: The C++ code is there and some has already been ported to Java ([[lttoolbox-java]] parts), probably someone could port it relatively easy, if someone understanding tagger would co-mentor that part. I'd say its not necessary to port tagger training, just the core (bigram) tagging during translation.
 
* Piping/modes isnt too hard.
 
* Piping/modes isnt too hard.
 
* Format handling: I'd say that it would be OK just to be able to handle normal text.
 
* Format handling: I'd say that it would be OK just to be able to handle normal text.
   
 
--[[User:Jacob Nordfalk|Jacob Nordfalk]] 08:33, 14 March 2010 (UTC)
 
--[[User:Jacob Nordfalk|Jacob Nordfalk]] 08:33, 14 March 2010 (UTC)
  +
  +
  +
See also this thread: [http://sourceforge.net/mailarchive/message.php?msg_name=20cf28cd1003080815x56dd1969h229c3f1c7c2e81e2%40mail.gmail.com]

Revision as of 09:00, 14 March 2010

A "Java port" of Apertium would enable use on

  • Windows,
  • J2ME/Android phones,
  • web pages (applets),
  • desktop application,
  • Java server applications.

The last 2 is relevant as, for example an OpenOffice.org plugin should be platform independent to be maintainable.

AFAIK we havent seen anyone embedding Apertium in a desktop application. Currently Apertium is usable in a local subdir but installation isnt trivial to an end user, but note that 'embedding' something isnt the same as 'using a locally installed version'.

Having a packaged easy-to-use version of Apertium ready for embedding MT in a larger program would be very cool. Ideally should a self-contained Apertium JAR file, only dependent on JRE and an additional JAR file per language pair.

An "embedding" approach is to use a client stub to an Apertium web service, but there can be reasons why people prefers to have things installed locally (we don't need to repeat them here).


Missing for a complete port of apertium in java is tagger, piping/modes, interchunk/postchunk and format handling.


Af I see it:

  • Interchunk/postchunk I could do in very short time, so I could mentor this
  • Tagger: The C++ code is there and some has already been ported to Java (lttoolbox-java parts), probably someone could port it relatively easy, if someone understanding tagger would co-mentor that part. I'd say its not necessary to port tagger training, just the core (bigram) tagging during translation.
  • Piping/modes isnt too hard.
  • Format handling: I'd say that it would be OK just to be able to handle normal text.

--Jacob Nordfalk 08:33, 14 March 2010 (UTC)


See also this thread: [1]