Difference between revisions of "Icelandic and English"
Jump to navigation
Jump to search
Line 21: | Line 21: | ||
* Wordbank [http://www.ismal.hi.is/ob/birta/index.cgi at ismal.hi.is] (licence unknown) |
* Wordbank [http://www.ismal.hi.is/ob/birta/index.cgi at ismal.hi.is] (licence unknown) |
||
==See also== |
|||
* [[/Regression tests|Regression tests]] |
|||
Revision as of 23:45, 7 February 2009
Pending tasks
- Tag a corpus with IceTagger and train the
apertium-tagger
- Post-edit automatically-generated bilingual dictionaries
- Use IceParser to parse a corpus and extract the most frequent phrase patterns.
- Merge analysed corpus (IceMorphy full-form list) with Apertium dictionary — will require matching partial information to paradigms... perhaps use extract ?
Resources
Corpora
- Mediawiki l10n, KDE4, OpenSubtitles, etc. — from OPUS (~60k sentences)
Bilingual dictionaries
- Wikipedia interwiki (~1100 entries)
- Freelang (~1000 entries)
- An Icelandic-English Dictionary (Old Icelandic, 1876 — Public Domain)
- And here
- Wordbank at ismal.hi.is (licence unknown)