Difference between revisions of "Prefixes and infixes"
Jump to navigation
Jump to search
(New page: <pre> <mlforcada> I haven't thought much about it but I think the solution would be <mlforcada> to see lexical forms as sets and not as sequences <spectie> sergio said something about 'car...) |
|||
Line 1: | Line 1: | ||
One solution would be to see lexical forms (LFs) as sets and not as sequences. e.g. pl.n.kitabu or pl.kitabu.n would be the same (swahili) |
|||
<pre> |
|||
<mlforcada> I haven't thought much about it but I think the solution would be |
|||
Two possible ways: |
|||
<mlforcada> to see lexical forms as sets and not as sequences |
|||
<spectie> sergio said something about 'cartesian product' |
|||
⚫ | |||
<spectie> would this require changing the 'translation as a stream' thing ? |
|||
⚫ | |||
<mlforcada> so that pl.n.kitabu or pl.kitabu.n would be the same (swahili) |
|||
⚫ | |||
<mlforcada> in fact vitabu would naturally be pl.kitabu.n |
|||
<mlforcada> I think we should only change transfer |
|||
<mlforcada> something that either normalizes it or deals it in whatever order |
|||
<mlforcada> so that one could deal with pl.n.kitabu |
|||
<mlforcada> in fact |
|||
<mlforcada> in basque |
|||
<mlforcada> it would be very nice to have this for verbs: |
|||
<mlforcada> dakarzu: you bring it ; nakarzu : you bring me |
|||
<spectie> yep |
|||
<mlforcada> the morpheme for object/absolutive comes first |
|||
<mlforcada> so there are two possible ways |
|||
⚫ | |||
⚫ | |||
<mlforcada> and a middle way |
|||
⚫ | |||
<mlforcada> however |
|||
<spectie> i think sets is probably cleaner ... although is there any information lost in the loss of ordering ? |
|||
<mlforcada> I think tagger rules should also be changed |
|||
<mlforcada> and we could have moyogo discuss lingala morphology which is also prefix, word classes, etc. as swahiki is |
|||
<mlforcada> I like it that verbs come marked with the classs of their object |
|||
<mlforcada> sw: nilikisoma: I read it where -ki- is it for a ki- class noun as kitabu |
|||
<spectie> is this inflectional or derivational ? |
|||
<mlforcada> ni-li-ki-soma : I-past-[ki object] soma |
|||
<mlforcada> I think it is inflection |
|||
</pre> |
Revision as of 15:03, 26 May 2007
One solution would be to see lexical forms (LFs) as sets and not as sequences. e.g. pl.n.kitabu or pl.kitabu.n would be the same (swahili)
Two possible ways:
- normalization of some kind, perhaps specified with some rules
- treating LFs as sets
- detecting lexical forms in the sequence they are generated by the tagger