User:Firespeaker/Cleaning up a tail

From Apertium
< User:Firespeaker
Revision as of 16:44, 11 March 2014 by Firespeaker (talk | contribs) (Created page with "== The problem == For Turkic languages, there is a huge tail of unknown words when running coverage. Presumably this is because of morphological complexity—i.e., a small ha...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

The problem

For Turkic languages, there is a huge tail of unknown words when running coverage. Presumably this is because of morphological complexity—i.e., a small handful of unknown stems can result in hundreds of unknown forms.

A proposed solution