Difference between revisions of "Basque and Spanish"

From Apertium
Jump to navigation Jump to search
Line 12: Line 12:


<pre>
<pre>
gizonentzat : gizon.n + a.det.pl + tzat.prep
gizonentzat : gizon.n + a.det.pl + tzat.post
</pre>
</pre>


Line 24: Line 24:


<pre>
<pre>
gizonei : gizon.n + a.det.pl + i.prep
gizonei : gizon.n + a.det.pl + i.post
Mirenekin : Miren.NP + kin.prep
Mirenekin : Miren.NP + kin.post
katuarentzat : katu.n + a.det.sg + tzat.prep
katuarentzat : katu.n + a.det.sg + tzat.post
</pre>
</pre>


Line 32: Line 32:


<pre>
<pre>
etxeetako: etxe.n + a.det.pl + ko.prep.ko
etxeetako: etxe.n + a.det.pl + ko.post.ko
Mikelekin : Mikel.NP + kin.prep
Mikelekin : Mikel.NP + kin.post
Mikelekiko : Mikel.NP + kin.prep.ko
Mikelekiko : Mikel.NP + kin.post.ko
</pre>
</pre>


Line 41: Line 41:
A problem appears with "possessives" like 'nire', 'gure', 'zuen', 'haien', 'bere'. Should they be treated as preadjectives ('izenlagun') or as genitive constructs:
A problem appears with "possessives" like 'nire', 'gure', 'zuen', 'haien', 'bere'. Should they be treated as preadjectives ('izenlagun') or as genitive constructs:
<pre>
<pre>
nire: ni.pron.sg + ren.gen.ko
nire: ni.pron.sg + ren.post.ko
haien : hura.pron.pl + ren.gen.ko
haien : hura.pron.pl + ren.post.ko
</pre>
</pre>



Revision as of 09:17, 19 June 2007

The idea

Mireia Ginestí is recycling Matxin data to build an Apertium-based system that would allow Spanish speakers to read Basque newspapers.

Some of the morphological choices in Matxin will be revised.

This document is to keep track of decisions and to raise questions

Deklinabidea?

For instance, "declination" will be treated as postpositions:

gizonentzat : gizon.n + a.det.pl + tzat.post

In principle, the absolutive will not be marked:

gizonak : gizon.n + a.det.pl

Determiners and postpositions will be given mnemonic lemmas, one per case.

gizonei : gizon.n + a.det.pl + i.post
Mirenekin : Miren.NP + kin.post
katuarentzat : katu.n + a.det.sg + tzat.post

Postpositions which can modify a noun phrase will be marked explicitly as ko

etxeetako: etxe.n + a.det.pl + ko.post.ko
Mikelekin : Mikel.NP + kin.post
Mikelekiko : Mikel.NP + kin.post.ko

Possessives?

A problem appears with "possessives" like 'nire', 'gure', 'zuen', 'haien', 'bere'. Should they be treated as preadjectives ('izenlagun') or as genitive constructs:

nire: ni.pron.sg + ren.post.ko
haien : hura.pron.pl + ren.post.ko