Prefixes and infixes
Revision as of 15:07, 26 May 2007 by Francis Tyers (talk | contribs)
One solution would be to see lexical forms (LFs) as sets and not as sequences. e.g. pl.n.kitabu or pl.kitabu.n would be the same (swahili)
Two possible ways:
- normalization of some kind, perhaps specified with some rules
- treating LFs as sets
- detecting lexical forms in the sequence they are generated by the tagger
The tagger should probably also be changed to deal with prefixes.