Talk:Northern Sámi and Norwegian
Transfer strategy
So far I've been thinking this:
- t1x: chunking
- Turn adjectives and nouns into SN chunks, give them the right gender and number
- Derivations into phrases?
- t2x: movement
- Put adpositions in front of SN chunks
- In general move SN chunks around verbs, adverbs etc. to get right word order
- Guess definiteness from word order, case, syntactic function
- t3x: cleanup
- Eg. if definiteness changed, make sure adj tags are consistent
- We could also do:
- t1x: light chunking (SN, ...)
- t2x: more chunking (Relatives, subordinate clauses)
- t3x: moving around and stuff
- t4x: cleanup.
- Francis Tyers 18:32, 18 January 2010 (UTC)
The 1-4 are different files, is that it? There are both easy and hard issues when it comes to phrases, this speaks in favour of 4. But the clear-cut criterion for light vs. heavy?Trondtr 12:26, 19 January 2010 (UTC).
- We'll need rules to cover both compounding and derivation, this speaks for 4-stage (eg. each noun could be a compound, multiplying each noun rule by two--or more if we have longer compounds?). We need to figure out what phenomena go in what stage though.unhammer 13:09, 19 January 2010 (UTC)