Talk:Ideas for Google Summer of Code/Spell checking

From Apertium
< Talk:Ideas for Google Summer of Code
Revision as of 10:31, 13 February 2015 by Unhammer (talk | contribs) (Created page with " The speller should not give suggestions that aren't in bidix, since clicking "oh yes that's what I meant" and then it's not translated anyway, will just be frustrating. On th...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

The speller should not give suggestions that aren't in bidix, since clicking "oh yes that's what I meant" and then it's not translated anyway, will just be frustrating. On the other hand, the speller should use as large a lexicon as possible, so that we don't suggest bidix-words for words that are actually correct (but just happen not to be in bidix). One way of doing this in a space-efficient way would be to have an FSA where there are three possible states of finality:

  1. not final,
  2. correct but not suggest (words only in spelldix), and
  3. correct and suggest (words that are in bidix).