Ideas for Google Summer of Code/Sliding-window part-of-speech tagger

From Apertium
< Ideas for Google Summer of Code
Revision as of 09:52, 13 March 2013 by Francis Tyers (talk | contribs) (Created page with 'The idea is to implement the unsupervised part-of-speech tagger (http://en.wikipedia.org/wiki/Sliding_window_based_part-of-speech_tagging) as a drop-in replacement for the curren…')
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

The idea is to implement the unsupervised part-of-speech tagger (http://en.wikipedia.org/wiki/Sliding_window_based_part-of-speech_tagging) as a drop-in replacement for the current hidden-Markov-model tagger. It should have support for unknown words, and also for "forbid" descriptions (not described in the paper). The tagger has a very intuitive interpretation (believe me, even if you find the maths a bit daunting). I am available for questions (I invented the tagger, I should be able to remember!).


Task

Coding challenge

Frequently asked questions

See also