North Germanic languages

From Apertium
Revision as of 08:22, 27 February 2008 by Francis Tyers (talk | contribs)
Jump to navigation Jump to search

The North Germanic languages include Danish (da), Faroese (fo), Icelandic (is), Norwegian (Nynorsk, nn and Bokmål, nb) and Swedish (sv). The languages are related with varying levels of mutual intelligibility. This group would make a nice group for Apertium systems.

Existing pairs

  • apertium-sv-da
  • apertium-nn-nb

Resources

Resources listed below will be useful in building machine translation systems for these languages.

Monolingual
Language Resource Description See also
Norwegian Norsk ordbank Large >100,000 lemma morphological dictionary of both Nynorsk and Bokmål, GPL. Norsk ordbank
Swedish Talbanken A 300,000-word tree-bank: it is in XML, all words are nicely tagged with PAROLE-style tags.
Danish Danish Dependency Treebank Danish tree bank, 100,000-word, XML, PAROLE tagged, under the GPL.
Icelandic
Faroese
Bilingual

Funding possibilities