Apertium-neural

From Apertium
Revision as of 18:13, 4 June 2020 by Francis Tyers (talk | contribs) (Created page with "Apertium was originally developed to offer a free/open-source framework for creating RBMT systems. It was modelled on existing systems, but targetted at related languages, try...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Apertium was originally developed to offer a free/open-source framework for creating RBMT systems. It was modelled on existing systems, but targetted at related languages, trying to do one thing well.

What might an Apertium NMT system for lesser-resourced and marginalised languages look like?

Thoughts:

  • Trains without GPU or large compute
  • Optimised for small corpora (under 100k parallel sentences)
  • Includes linguistic tricks
  • C++, autotools
  • Works with existing tools (formatters, APY etc.)

Pipeline(?):

apertium-destxt | apertium-preprocess | apertium-encode | apertium-decode | apertium-retxt