Difference between revisions of "Northern Sámi and Norwegian/anaphora"

From Apertium
Jump to navigation Jump to search
 
Line 1: Line 1:
There is some simple anaphora resolution in apertium-sme-nob. Most of this happens in t2x (the first interchunk stage).
+
There is some simple anaphora resolution in apertium-sme-nob. Most of this happens in [http://apertium.svn.sourceforge.net/viewvc/apertium/incubator/apertium-sme-nob/apertium-sme-nob.sme-nob.t2x t2x] (the first interchunk stage).
   
  +
* A subject (or habitive) will set the variable "ana_gen" to its gender.
A subject (or habitive) will set the variable "ana_gen" to its gender, and additionally, if it is masc or fem, set the variable "ana_m_f" to its gender. Any other adj, noun or verb chunk tagged GD (or having no gender tag) will use the "ana_gen" gender. If it is tagged "GD_pers", "ana_m_f" is used -- this is useful for words which are definitely animate, like ''son''.
 
  +
** Additionally, if it is masc or fem, it sets the variable "ana_m_f" to its gender.
  +
  +
* Any other adj, noun or verb chunk tagged GD (or having no gender tag) will use the previously set "ana_gen" gender.
  +
** If it is tagged "GD_pers", "ana_m_f" is used -- this is useful for words which are definitely animate, like ''son''.
   
 
==TODO==
 
==TODO==

Latest revision as of 13:03, 14 July 2010

There is some simple anaphora resolution in apertium-sme-nob. Most of this happens in t2x (the first interchunk stage).

  • A subject (or habitive) will set the variable "ana_gen" to its gender.
    • Additionally, if it is masc or fem, it sets the variable "ana_m_f" to its gender.
  • Any other adj, noun or verb chunk tagged GD (or having no gender tag) will use the previously set "ana_gen" gender.
    • If it is tagged "GD_pers", "ana_m_f" is used -- this is useful for words which are definitely animate, like son.

TODO[edit]

Nickel gives this example:

Anne muitalii, ahte son lea leamaš Norggas.

here son == Anne, thus feminine

Anne jearai, ahte mii dan namma lei?

here dan != Anne

We _could_ keep a second-to-last subject, but this might quickly get too complex. TODO: get more examples.