Prefixes and infixes
Jump to navigation Jump to search
One solution would be to see lexical forms (LFs) as sets and not as sequences. e.g. pl.n.kitabu or pl.kitabu.n would be the same (swahili)
Two possible ways:
- normalization of some kind, perhaps specified with some rules
- treating LFs as sets
- detecting lexical forms in the sequence they are generated by the tagger
The tagger should probably also be changed to deal with prefixes.