Difference between revisions of "User:Francis Tyers/Apertium 4"

Revision as of 20:05, 21 June 2020

Use embeddings for morphological disambiguation and lexical selection
Pass the surface form until transfer (to allow modules to look up surface form embeddings)
Retire the HMM tagger
Be able to train weights for morph analysis + morph. disambiguation + lexical selection + transfer end to end.
Fully functional recursive transfer

There should be a basic NMT implementation that functions in the Apertium ecosystem (C++,autotools,bash,apy,html-tools) for communities that want to build their own NMT systems and still take advantage of our ecosystem. We should be a one-stop shop for MT for marginalised langs.

@@ Line 9: / Line 9: @@
 * Extract multiwords from lexicons into "separable" FSTs
-* Train taggers for all languages
+* Train taggers for all languages using available corpora and a TLM
 * At least one state-of-the-art language pair (wrt. Google).