Difference between revisions of "Icelandic and English"

From Apertium
Jump to navigation Jump to search
Line 4: Line 4:
* Tag a corpus with IceTagger and train the <code>apertium-tagger</code>
* Tag a corpus with IceTagger and train the <code>apertium-tagger</code>
* Post-edit automatically-generated bilingual dictionaries
* Post-edit automatically-generated bilingual dictionaries
* Use IceParser to parse a corpus and extract the most frequent phrase patterns.
* Merge analysed corpus (IceMorphy full-form list) with Apertium dictionary &mdash; will require matching partial information to paradigms... perhaps use [[extract]] ?


==Resources==
==Resources==
Line 18: Line 20:
** And [http://www.northvegr.org/vigfusson/index.php here]
** And [http://www.northvegr.org/vigfusson/index.php here]
* Wordbank [http://www.ismal.hi.is/ob/birta/index.cgi at ismal.hi.is] (licence unknown)
* Wordbank [http://www.ismal.hi.is/ob/birta/index.cgi at ismal.hi.is] (licence unknown)




[[Category:Language pairs]]
[[Category:Language pairs]]

Revision as of 23:44, 7 February 2009

Pending tasks

  • Tag a corpus with IceTagger and train the apertium-tagger
  • Post-edit automatically-generated bilingual dictionaries
  • Use IceParser to parse a corpus and extract the most frequent phrase patterns.
  • Merge analysed corpus (IceMorphy full-form list) with Apertium dictionary — will require matching partial information to paradigms... perhaps use extract ?

Resources

Corpora

  • Mediawiki l10n, KDE4, OpenSubtitles, etc. — from OPUS (~60k sentences)

Bilingual dictionaries