Difference between revisions of "Talk:Northern Sámi and Norwegian"

From Apertium
Jump to navigation Jump to search
Line 36: Line 36:
 
::* t3x
 
::* t3x
 
::** move postpositions (SN ADPOS -> ADPOS SN) "[1 big house which is on the hill] [2 in]"
 
::** move postpositions (SN ADPOS -> ADPOS SN) "[1 big house which is on the hill] [2 in]"
::** V2? --[[User:Unhammer|unhammer]] 13:04, 20 January 2010 (UTC)
+
::** V2? --[[User:Unhammer|unhammer]] 13:04, 20 January 2010 (UTC) +1 [[User:Francis Tyers|Francis Tyers]]
::** Insert dropped pronouns? (Or tags for them?)--[[User:Unhammer|unhammer]] 14:25, 20 January 2010 (UTC)
+
::** Insert dropped pronouns? (Or tags for them?)--[[User:Unhammer|unhammer]] 14:25, 20 January 2010 (UTC) +1 [[User:Francis Tyers|Francis Tyers]]
 
::* t4x
 
::* t4x
 
::** Insert prepositions.
 
::** Insert prepositions.

Revision as of 14:37, 20 January 2010

Transfer strategy

So far I've been thinking this:

  • t1x: chunking
    • Turn adjectives and nouns into SN chunks, give them the right gender and number
    • Derivations into phrases?
  • t2x: movement
    • Put adpositions in front of SN chunks
    • In general move SN chunks around verbs, adverbs etc. to get right word order
    • Guess definiteness from word order, case, syntactic function
  • t3x: cleanup
    • Eg. if definiteness changed, make sure adj tags are consistent


We could also do:
  • t1x: light chunking (SN, ...)
  • t2x: more chunking (Relatives, subordinate clauses)
  • t3x: moving around and stuff
  • t4x: cleanup.

- Francis Tyers 18:32, 18 January 2010 (UTC)

The 1-4 are different files, is that it? There are both easy and hard issues when it comes to phrases, this speaks in favour of 4. But the clear-cut criterion for light vs. heavy?Trondtr 12:26, 19 January 2010 (UTC).

We'll need rules to cover both compounding and derivation, this speaks for 4-stage (eg. each noun could be a compound, multiplying each noun rule by two--or more if we have longer compounds?). We need to figure out what phenomena go in what stage though.unhammer 13:09, 19 January 2010 (UTC)
  • t1x
    • (de-)compounding,
    • derivation,
    • simple noun phrases (heads and their simple modifiers/specifiers: adj nom, adj adj nom, det adj adj nom, num adj nom),
    • simple periphrastic verb combinations (verb, vaux pp, vaux inf)
  • t2x
    • relatives (SN "who" SV -> SN)
    • co-ordination (SN "and" SN -> SN)
    • genitive modifiers (SN SN-Gen " [University of Reykjavik] [big old library]-GEN"
  • t3x
    • move postpositions (SN ADPOS -> ADPOS SN) "[1 big house which is on the hill] [2 in]"
    • V2? --unhammer 13:04, 20 January 2010 (UTC) +1 Francis Tyers
    • Insert dropped pronouns? (Or tags for them?)--unhammer 14:25, 20 January 2010 (UTC) +1 Francis Tyers
  • t4x
    • Insert prepositions.
    • Insert articles? --unhammer 13:32, 20 January 2010 (UTC)
    • Cleanup
- Francis Tyers 14:37, 19 January 2010 (UTC)