Icelandic and English
Revision as of 23:45, 7 February 2009 by 89.129.232.136 (talk)
Pending tasks
- Tag a corpus with IceTagger and train the
apertium-tagger
- Post-edit automatically-generated bilingual dictionaries
- Use IceParser to parse a corpus and extract the most frequent phrase patterns.
- Merge analysed corpus (IceMorphy full-form list) with Apertium dictionary — will require matching partial information to paradigms... perhaps use extract ?
Resources
Corpora
- Mediawiki l10n, KDE4, OpenSubtitles, etc. — from OPUS (~60k sentences)
Bilingual dictionaries
- Wikipedia interwiki (~1100 entries)
- Freelang (~1000 entries)
- An Icelandic-English Dictionary (Old Icelandic, 1876 — Public Domain)
- And here
- Wordbank at ismal.hi.is (licence unknown)