User:Firespeaker/Cleaning up a tail
< User:Firespeaker
Jump to navigation
Jump to search
Revision as of 16:44, 11 March 2014 by Firespeaker (talk | contribs) (Created page with "== The problem == For Turkic languages, there is a huge tail of unknown words when running coverage. Presumably this is because of morphological complexity—i.e., a small ha...")
The problem
For Turkic languages, there is a huge tail of unknown words when running coverage. Presumably this is because of morphological complexity—i.e., a small handful of unknown stems can result in hundreds of unknown forms.