Difference between revisions of "North Saami and Lule Saami"

From Apertium
Jump to navigation Jump to search
Line 17: Line 17:
 
:::<code>^Wikipedia<N><Prop><Sg><Nom><@SUBJ>>$</code>
 
:::<code>^Wikipedia<N><Prop><Sg><Nom><@SUBJ>>$</code>
 
:::This comes from the CG tag @SUBJ>
 
:::This comes from the CG tag @SUBJ>
  +
* Re-train the HMM-based POS tagger on a Sami corpus.
   
 
[[Category:Language pairs]]
 
[[Category:Language pairs]]

Revision as of 13:24, 6 October 2008

Contents

Files

  • apertium-sme-smj.sme.dix — Northern Sami transducer
  • apertium-sme-smj.sme-smj.dix — Transfer lexicon
  • apertium-sme-smj.smj.dix — Lule Sami transducer
  • apertium-sme-smj.sme-smj.rlx — Constraint grammar
  • apertium-sme-smj.sme-smj.t1x — Transfer rule file (level 1 -- Local re-ordering, chunking)
  • apertium-sme-smj.sme-smj.t2x — Transfer rule file (level 2 -- Phrase and chunk re-ordering)
  • apertium-sme-smj.sme-smj.t3x — Transfer rule file (level 3 -- Final touches)

TODO

  • Mapped tags in the CG use special characters in Apertium, for example '>' (used for delimiting tags) and '-'. These should be replaced somehow.
Example:
^Wikipedia<N><Prop><Sg><Nom><@SUBJ>>$
This comes from the CG tag @SUBJ>
  • Re-train the HMM-based POS tagger on a Sami corpus.