Difference between revisions of "User:Firespeaker/Cleaning up a tail"
Jump to navigation
Jump to search
Firespeaker (talk | contribs) (Created page with "== The problem == For Turkic languages, there is a huge tail of unknown words when running coverage. Presumably this is because of morphological complexity—i.e., a small ha...") |
(No difference)
|
Revision as of 16:44, 11 March 2014
The problem
For Turkic languages, there is a huge tail of unknown words when running coverage. Presumably this is because of morphological complexity—i.e., a small handful of unknown stems can result in hundreds of unknown forms.