Northern Sámi and Norwegian/Derivations

From Apertium
Jump to navigation Jump to search

Derivations: general rules and exceptions

Sámi has a lot of derivation rules; sometimes the derived words have lexicalised translations in Bokmål, like ráhkisvuohta→kjærlighet, these we treat as exceptions which have to be specified in bidix. Other times we can use a general rule, like lohkagohten→begynte.1SG å lese.

We have two strategies for handling the rule/exception situation.

  1. For the situation where we have many exceptions, we let the analysis be eg. geavaheaddjiid/geavahit<V><TV><Der2><Actor><N><Pl> and from here there are two paths
    1. either this specific analysis is in bidix, here translating into bruker<n><m><pl>, or
    2. we have to use a transfer rule, in this case translating into de som bruker
  2. For the situation where we have few exceptions, we use dev/xfst2apertium.relabel to split the analysis into two lexical units. Two lexical units can't be specified in bidix, so here
    1. exceptions have to be added to the .lexc file as if they were lexicalised, so they remain one lexical unit
    2. while general transfer rules now match a pattern of two lexical units

More detailed: Deverbal nouns

Sámi verbs can turn into nouns. We want to be able to put this explicitly into the bidix (eg. sometimes the nob noun is not even based on the nob verb), but if it's not in bidix we want to be able to fall back on a construction using the verb, so

  • from geavaheaddjiid/geavahit<V><TV><Der2><Actor><N>
  • with fallback => de som bruker<vblex> (or something)
  • bidix specified => bruker<n><m>

With the following bidix entries we specify that we want bruker<n><m> in the above example:

    <e><p><l>geavahit<s n="V"/><s n="TV"/></l><r>bruke<s n="vblex"/><s n="pers"/></r></p><par n="__verb"/></e>
    <e><p><l>geavahit<s n="V"/><s n="TV"/><s n="Der2"/><s n="Actor"/><s n="N"/></l><r>bruker<s n="n"/><s n="m"/></r></p><par n="__n"/></e>

while if the second bidix line isn't there, we get the fallback. Transfer rules can now check

 <equal><clip side="tl" part="pos" ...><lit-tag v="N"/></equal>
 <equal><clip side="sl" part="pos" ...><lit-tag v="V"/></equal>

The same specification/fallback might be applied with other Derivations.

Other fallbacks

  • Der/at, adj->adv, gets adj.posi.nt.sg.ind ("vid" => "vidt")
  • Der/vuohta, adj->n, gets adj...def, eg. "grunn" -> "det grunne" (bidix overrides "fattig" to "fattigdom")
  • Der/st, v->v_diminutive, passes through as if nothing happened (although we could add the adverb "litt"?)

Derivation tags and their meanings

Note: Við eigum að breyta mörk neðan af því að það er ekki hægt að nota /. í mörkum í apertium. En þá eigum við að breyta CG líka...

There are also derivations of derivations:

"<geavaheaddjis>"
...
          "geavvat" V* IV* Der1 Der/h V* TV Der2 Actor N Sg Acc PxSg3

For transfer purposes it might be simplest to treat these "flatly" as if they were single derivations (ie. Der1_Der_h_V_TV_Der2).

Tag Type Example in Bokmål
Der/Dimin Diminutive mánáš "mánná" N Der1 Der/Dimin N Sg Nom barn→lite barn
Der/1 Der/st Diminutive verb attestit "addit" V TV Der1 Der/st V Inf gi→gi litt
Der/adda V→N/PrfPrc/Actio bassaladdan "bassalit" V* TV Der2 Der/adda →vaske tøy (bassat=vaske)
Der/ahtti V→V vajálduhttit "vajálduvvat" V* IV* Der2 Der/ahtti V TV →overse/glemme
Der/alla suffix bázáhallan "bázihit" V* TV Der2 Der/alla V Actio
Der/amoš suffix muitalamoš "muitalit" V TV Der3 Der/amoš N Sg Nom fortelle→
Der/asti suffix muitalastit "muitalit" V TV Der2 Der/asti V Inf fortelle→
Der/at Adj→Adv viidát "viiddis" A* Der2 Der/at Adv vid→vidt
Der/d V→V[refl] basadit "bassat" V TV Der1 Der/d V vaske→vaske seg
Der/eaddji V→N.Actor muitaleaddji "muitalit" V TV Der2 Actor N Sg Nom fortelle→forteller
Der/eamoš suffix muitaleamoš "muitalit" V* TV Der3 Der/eamoš fortelle→
Der/eapmi V→N deaivvadeapmi "deaivvadit" V IV Der2 Der/eapmi N Sg Nom møte(V) → møte(N)
Der/easti suffix muitaleastit "muitalit" V TV Der2 Der/easti V Inf fortelle →
Der/geahtes suffix eaiggátkeahtes "eaiggát" N* Der3 Der/geahtes eier →
Der/goahti V→V Inchoative boradišgohten "boradit" V TV Der3 Der/goahti V Ind Prt Sg1 spise → jeg begynte å spise
Der/h suffix geavaheaddji "geavvat" V* IV* Der1 Der/h V* TV Der2 Actor; orrohit "orrot" V* IV Der1 Der/h V heve seg→ ; bli/synes→
Der/halla V→V[recip] gulahallat "gullat" V* TV Der1 Der2 Der/halla høre→forstå hverandre («høre hverandre»?)
Der/heapmi suffix čađaheapmi "čađđa" N* Der1 Der2 Der/heapmi A
Der/huhtti suffix muosehuhttit "muoseheapme" A* Der1 Der/huhtti V* TV urolig→
Der/huvva suffix čađahuvvo "čađđa" N* Der1 Der2 Der/huvva V IV Imprt Prs ConNegII
Der/j suffix sáddejuvvot "sáddet" V* TV Der1 Der/j V* Der2 Der/PassL V sende→
Der1 Der/l V→V[subitive] borralit "borralit" V TV Der1 Der/l V spise→spise (i hast)
Der/l ???? ohcalit "ohcat" V* TV Der1 Der/l V lete→savne/lengte etter
Der/las suffix lotnolas "lotnut" V* TV Der1 Der2 Der/las A betale→
Der/laš N→Adj dábálaš "dáhpi" N Der1 Der/laš A Sg Nom skikk→vanlig
Der/lágan suffix earálágan "eará" Pron Indef Sg Gen Der1 Der/lágan A annen/andre→
Der/meahttun V→Adj[Neg] jáhkkemeahttun "jáhkkit" V TV Der1 Der/meahttun A Sg Nom tro/anta→utrolig
Der/muš suffix ??? "juhkat" V TV Der3 Der/muš N Sg Nom drikke→
Der/n suffix oažžun "oažžut" V* TV Der3 Der/n N få→?
Der/st Diminutive V→V oainnestit, várástit "várát" V TV Der1 Der/st V se→skimte (add "litt"?)
Der/stuvva suffix fuolastuvvat "fuollat" V* TV Der1 Der2 Der/stuvva V bry seg om→
Der/supmi suffix čállosupmi "čállit" V* TV Der2 Der/PassL V* Der3 Der/supmi N skrive/...→
Der/upmi suffix mearkkašupmi "mearkkašit" V* TV Der2 Der/PassL V* Der3 Der/upmi merge seg→
Der/viđá suffix málestanviđá "málet" V TV Der1 Der/st V Der2 Der/eapmi N SgCmp Der/viđá Adv male→
Der/vuohta Adj→N ráhkisvuohta "ráhkis" A Der3 Der/vuohta N Sg Nom kjær→kjærlighet