North Saami and Lule Saami

From Apertium
Revision as of 13:24, 6 October 2008 by Francis Tyers (talk | contribs)
Jump to navigation Jump to search

Contents

Files

  • apertium-sme-smj.sme.dix — Northern Sami transducer
  • apertium-sme-smj.sme-smj.dix — Transfer lexicon
  • apertium-sme-smj.smj.dix — Lule Sami transducer
  • apertium-sme-smj.sme-smj.rlx — Constraint grammar
  • apertium-sme-smj.sme-smj.t1x — Transfer rule file (level 1 -- Local re-ordering, chunking)
  • apertium-sme-smj.sme-smj.t2x — Transfer rule file (level 2 -- Phrase and chunk re-ordering)
  • apertium-sme-smj.sme-smj.t3x — Transfer rule file (level 3 -- Final touches)

TODO

  • Mapped tags in the CG use special characters in Apertium, for example '>' (used for delimiting tags) and '-'. These should be replaced somehow.
Example:
^Wikipedia<N><Prop><Sg><Nom><@SUBJ>>$
This comes from the CG tag @SUBJ>
  • Re-train the HMM-based POS tagger on a Sami corpus.