Difference between revisions of "Icelandic and English"
Jump to navigation
Jump to search
Line 21: | Line 21: | ||
** And [http://www.northvegr.org/vigfusson/index.php here] |
** And [http://www.northvegr.org/vigfusson/index.php here] |
||
* Wordbank [http://www.ismal.hi.is/ob/birta/index.cgi at ismal.hi.is] (licence unknown) |
* Wordbank [http://www.ismal.hi.is/ob/birta/index.cgi at ismal.hi.is] (licence unknown) |
||
==Example phrase== |
|||
* Hver maður er borinn frjáls og jafn öðrum að virðingu og réttindum. |
|||
==See also== |
==See also== |
Revision as of 01:07, 8 February 2009
Pending tasks
- Tag a corpus with IceTagger and train the
apertium-tagger
- Post-edit automatically-generated bilingual dictionaries
- Use IceParser to parse a corpus and extract the most frequent patterns in terms of chunks/phrases (lists of coarse POS tags) and phrase patterns (in terms of chunks/phrases).
- Merge analysed corpus (IceMorphy full-form list) with Apertium dictionary — will require matching partial information to paradigms... perhaps use extract ?
Resources
Corpora
- Mediawiki l10n, KDE4, OpenSubtitles, etc. — from OPUS (~60k sentences)
Bilingual dictionaries
- Wikipedia interwiki (~1,100 entries)
- Freelang (~1,000 entries)
- Wiktionary (en) (~3,200 entries)
- An Icelandic-English Dictionary (Old Icelandic, 1876 — Public Domain)
- And here
- Wordbank at ismal.hi.is (licence unknown)
Example phrase
- Hver maður er borinn frjáls og jafn öðrum að virðingu og réttindum.