Basque and Spanish

From Apertium
Revision as of 15:58, 18 June 2007 by Mlforcada (talk | contribs) (started a document to discuss euskara problems)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

The idea

Mireia Ginestí is recycling Matxin data to build an Apertium-based system that would allow Spanish speakers to read Basque newspapers.

Some of the morphological choices in Matxin will be revised.

This document is to keep track of decisions and to raise questions

Deklinabidea?

For instance, "declination" will be treated as postpositions:

gizonentzat : gizon.n + a.det.pl + tzat.prep

In principle, the absolutive will not be marked:

gizonak : gizon.n + a.det.pl

Determiners and postpositions will be given mnemonic lemmas, one per case.

gizonei : gizon.n + a.det.pl + i.prep
Mirenekin : Miren.NP + kin.prep
katuarentzat : katu.n + a.det.sg + tzat.prep

Postpositions which can modify a noun phrase will be marked explicitly as ko

etxeetako: etxe.n + a.det.pl + ko.prep.ko
Mikelekin : Mikel.NP + kin.prep
Mikelekiko : Mikel.NP + kin.prep.ko

Possessives?

A problem appears with "possessives" like 'nire', 'gure', 'zuen', 'haien', 'bere'. Should they be treated as preadjectives ('izenlagun') or as genitive constructs:

nire: ni.pron.sg + ren.gen.ko
haien : hura.pron.pl + ren.gen.ko