Difference between revisions of "North Saami and Lule Saami"

From Apertium
Jump to navigation Jump to search
(5 intermediate revisions by the same user not shown)
Line 1: Line 1:
{{TOCD}}
{{TOCD}}


This page gives some details about the North Sámi to Lule Sámi translator.
==Files==


==Standardisation==
*<code>apertium-sme-smj.sme.dix</code> &mdash; Northern Sami transducer

*<code>apertium-sme-smj.sme-smj.dix</code> &mdash; Transfer lexicon
* Leahppi go gávdnan gusade?
*<code>apertium-sme-smj.smj.dix</code> &mdash; Lule Sami transducer
** Læhppe gu gávnnam gusáda?
*<code>apertium-sme-smj.sme-smj.rlx</code> &mdash; Constraint grammar
** Lihppe gu gávnnam gusáda?
*<code>apertium-sme-smj.sme-smj.t1x</code> &mdash; Transfer rule file (level 1 -- Local re-ordering, chunking)
*<code>apertium-sme-smj.sme-smj.t2x</code> &mdash; Transfer rule file (level 2 -- Phrase and chunk re-ordering)
*<code>apertium-sme-smj.sme-smj.t3x</code> &mdash; Transfer rule file (level 3 -- Final touches)


==TODO==
==TODO==
Line 42: Line 40:
divnna divnna+Pron+Indef+Sg+Nom
divnna divnna+Pron+Indef+Sg+Nom
divnna divnna+Pron+Indef+Attr
divnna divnna+Pron+Indef+Attr
</pre>

; ieš#guhtet -- iesj#guhtik

<pre>
$ echo "iešguđetge" | osme
0%>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>100%
iešguđetge ieš#guhtege+Pron+Indef+Pl+Nom
iešguđetge ieš#guhtet+Pron+Indef+Acc+Foc/ge
iešguđetge ieš#guhtet+Pron+Indef+Gen+Foc/ge
iešguđetge ieš#guđet+Pron+Indef+Foc/ge

$ echo "iesj#guhtik+Pron+Indef+Gen+Foc/ge" | dsmj
0%>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>100%
iesj#guhtik+Pron+Indef+Gen+Foc/ge iesj#guhtik+Pron+Indef+Gen+Foc/ge +?

$ echo "iesj#guhtik+Pron+Indef+Foc/ge" | dsmj
0%>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>100%
iesj#guhtik+Pron+Indef+Foc/ge iesj#guhtik#ge
iesj#guhtik+Pron+Indef+Foc/ge iesj#guhtikge
iesj#guhtik+Pron+Indef+Foc/ge iesjguhtik#ge
iesj#guhtik+Pron+Indef+Foc/ge iesjguhtikge
</pre>

; maid -- aj

<pre>
$ echo aj | osmj
0%>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>100%
aj aj+Pcle

$ echo maid | osme
0%>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>100%
maid mii+Pron+Interr+Pl+Acc
maid mii+Pron+Interr+Pl+Gen
maid mii+Pron+Interr+Sg+Acc
maid mii+Pron+Rel+Pl+Acc
maid mii+Pron+Rel+Pl+Gen
maid mii+Pron+Rel+Sg+Acc
maid maid+Adv
maid maid+Interj

</pre>
</pre>


Line 54: Line 94:
$ echo "Wikipedia lea máŋggagielat prošeakta man ulbmilin lea ráhkadit almmolaš diehtosátnegirjji \
$ echo "Wikipedia lea máŋggagielat prošeakta man ulbmilin lea ráhkadit almmolaš diehtosátnegirjji \
gosa gii beare sáhttá čállit artihkkaliid." | apertium -d . sme-smj
gosa gii beare sáhttá čállit artihkkaliid." | apertium -d . sme-smj
#Wikipedia la moattegielak prosjækta/prosjäkta #mij #ulmme la dahkat almulasj #diehtobáhkogirjje
Wikipedia la moattegielak prosjækta/prosjäkta #mij ulmmen la dahkat almulasj #diehtobáhkogirjje
#masi guhti beru máhttá tjállet artihkkalijt.
#masi guhti beru máhttá tjállet artihkkalijt.
</pre>
</pre>
Line 61: Line 101:


*[[/Regression tests]]
*[[/Regression tests]]
*[[/Pending tests]]


==External links==
==External links==

Revision as of 14:22, 17 June 2010

This page gives some details about the North Sámi to Lule Sámi translator.

Standardisation

  • Leahppi go gávdnan gusade?
    • Læhppe gu gávnnam gusáda?
    • Lihppe gu gávnnam gusáda?

TODO

Tagset mismatches

eará -- ietjá
$ echo "eará" | osme
191480 0
eará	eará+Pron+Indef+Sg+Nom
eará	eará+Pron+Indef+Sg+Gen
eará	eará+Pron+Indef+Sg+Acc
eará	eará+Pron+Indef+Attr

$ echo "ietjá+Pron+Indef+Attr" | dsmj
0%>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>100%
ietjá+Pron+Indef+Attr	ietjá+Pron+Indef+Attr	+?
buot -- divnna
$ echo "buot" | osme
0%>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>100%
buot	buot+Adv
buot	buot+Pron+Indef

$ echo "divnna" | osmj
0%>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>100%
divnna	divnna+Pron+Indef+Sg+Nom
divnna	divnna+Pron+Indef+Attr
ieš#guhtet -- iesj#guhtik
$ echo "iešguđetge" | osme
0%>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>100%
iešguđetge	ieš#guhtege+Pron+Indef+Pl+Nom
iešguđetge	ieš#guhtet+Pron+Indef+Acc+Foc/ge
iešguđetge	ieš#guhtet+Pron+Indef+Gen+Foc/ge
iešguđetge	ieš#guđet+Pron+Indef+Foc/ge

$ echo "iesj#guhtik+Pron+Indef+Gen+Foc/ge" | dsmj
0%>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>100%
iesj#guhtik+Pron+Indef+Gen+Foc/ge	iesj#guhtik+Pron+Indef+Gen+Foc/ge	+?

$ echo "iesj#guhtik+Pron+Indef+Foc/ge" | dsmj
0%>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>100%
iesj#guhtik+Pron+Indef+Foc/ge	iesj#guhtik#ge
iesj#guhtik+Pron+Indef+Foc/ge	iesj#guhtikge
iesj#guhtik+Pron+Indef+Foc/ge	iesjguhtik#ge
iesj#guhtik+Pron+Indef+Foc/ge	iesjguhtikge
maid -- aj
$ echo aj | osmj
0%>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>100%
aj	aj+Pcle

$ echo maid | osme
0%>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>100%
maid	mii+Pron+Interr+Pl+Acc
maid	mii+Pron+Interr+Pl+Gen
maid	mii+Pron+Interr+Sg+Acc
maid	mii+Pron+Rel+Pl+Acc
maid	mii+Pron+Rel+Pl+Gen
maid	mii+Pron+Rel+Sg+Acc
maid	maid+Adv
maid	maid+Interj

Reminders

  • In the transfer rule files, don't forget to escape the '+' character in tags, for example:
no: <attr-item tags="@+FMAINV"/> ,
yes: <attr-item tags="@\+FMAINV"/>

Testing

$ echo "Wikipedia lea máŋggagielat prošeakta man ulbmilin lea ráhkadit almmolaš diehtosátnegirjji \
  gosa gii beare sáhttá čállit artihkkaliid." | apertium -d . sme-smj
Wikipedia la moattegielak prosjækta/prosjäkta #mij ulmmen la dahkat almulasj #diehtobáhkogirjje  
  #masi guhti beru máhttá tjállet artihkkalijt.

See also

External links