Ideas for Google Summer of Code/Improving support for non-standard text input

From Apertium
< Ideas for Google Summer of Code
Revision as of 16:01, 12 February 2014 by Francis Tyers (talk | contribs) (Created page with "Create a module that will standardise non-standard input. For example, slang, abbreviations. ==Some examples from English== * Extra space: "he he" (hehe) * Spacing and hyph...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Create a module that will standardise non-standard input. For example, slang, abbreviations.

Some examples from English

  • Extra space: "he he" (hehe)
  • Spacing and hyphen variation: no-one, noone, no one
  • Optional hyphen: re-integrate, reintegrate
  • Missing apostrophe: shes thinking about it
  • Non-standard capitalisation: im thinking about it
  • Abbreviated words: fav,

Coding challenge

Tasks

Frequently asked questions

See also