Difference between revisions of "North Germanic languages"
Jump to navigation
Jump to search
Line 17: | Line 17: | ||
! Language !! File !! Paradigms !! Lemmata |
! Language !! File !! Paradigms !! Lemmata |
||
|- |
|- |
||
| Norwegian Nynorsk || [https://svn.code.sf.net/p/apertium/svn/languages/apertium-nno/apertium-nno.nno.dix apertium-nno.nno.dix] || 770 || |
| Norwegian Nynorsk || [https://svn.code.sf.net/p/apertium/svn/languages/apertium-nno/apertium-nno.nno.dix apertium-nno.nno.dix] || 770 || {{#lst:apertium-nno/stats|stems}} |
||
|- |
|- |
||
| Norwegian Bokmål || [https://svn.code.sf.net/p/apertium/svn/languages/apertium-nob/apertium-nob.nob.dix apertium-nob.nob.dix] || 705 || |
| Norwegian Bokmål || [https://svn.code.sf.net/p/apertium/svn/languages/apertium-nob/apertium-nob.nob.dix apertium-nob.nob.dix] || 705 || {{#lst:apertium-nob/stats|stems}} |
||
|- |
|- |
||
| Swedish || [https://svn.code.sf.net/p/apertium/svn/languages/apertium-swe/apertium-swe.swe.dix apertium-swe.swe.dix] || 277 || |
| Swedish || [https://svn.code.sf.net/p/apertium/svn/languages/apertium-swe/apertium-swe.swe.dix apertium-swe.swe.dix] || 277 || {{#lst:apertium-swe/stats|stems}} |
||
|- |
|- |
||
| Danish || [https://svn.code.sf.net/p/apertium/svn/languages/apertium-dan/apertium-dan.dan.dix apertium-dan.dan.dix] || 341 || |
| Danish || [https://svn.code.sf.net/p/apertium/svn/languages/apertium-dan/apertium-dan.dan.dix apertium-dan.dan.dix] || 341 || {{#lst:apertium-dan/stats|stems}} |
||
|- |
|- |
||
| Faroese || [https://svn.code.sf.net/p/apertium/svn/languages/apertium-fao/apertium-fao.fao.dix apertium-fao.fao.dix] || 113 || |
| Faroese || [https://svn.code.sf.net/p/apertium/svn/languages/apertium-fao/apertium-fao.fao.dix apertium-fao.fao.dix] || 113 || {{#lst:apertium-fao/stats|stems}} |
||
|- |
|- |
||
| Icelandic || [https://svn.code.sf.net/p/apertium/svn/languages/apertium-isl/apertium-isl.isl.dix apertium-isl.isl.dix] || 1,881 || |
| Icelandic || [https://svn.code.sf.net/p/apertium/svn/languages/apertium-isl/apertium-isl.isl.dix apertium-isl.isl.dix] || 1,881 || {{#lst:apertium-isl/stats|stems}} |
||
|- |
|- |
||
|} |
|} |
Revision as of 18:54, 7 June 2016
The North Germanic languages include Danish (dan
), Faroese (fao
), Icelandic (isl
), Norwegian (Nynorsk, nno
and Bokmål, nob
) and Swedish (swe
). The languages are related with varying levels of mutual intelligibility. This group would make a nice group for Apertium systems.
Status
Text in italic denotes an unreleased pair.
Template:North Germanic language translations
Existing
- Dictionaries
- See also: List of dictionaries
Language | File | Paradigms | Lemmata |
---|---|---|---|
Norwegian Nynorsk | apertium-nno.nno.dix | 770 | 182,497 |
Norwegian Bokmål | apertium-nob.nob.dix | 705 | 246,281 |
Swedish | apertium-swe.swe.dix | 277 | 138,490 |
Danish | apertium-dan.dan.dix | 341 | 52,133 |
Faroese | apertium-fao.fao.dix | 113 | 2,318 |
Icelandic | apertium-isl.isl.dix | 1,881 | 8,770 |
Resources
Resources listed below will be useful in building machine translation systems for these languages.
- Monolingual
Language | Resource | Description | See also |
---|---|---|---|
Norwegian | Norsk ordbank | Large >100,000 lemma morphological dictionary of both Nynorsk and Bokmål, GPL. | Norsk ordbank, Norwegian |
Norwegian | Oslo-Bergen tagger | Constraint grammar tagger for Norwegian, GPL. (converted for CG-3) | Norwegian |
Swedish | Talbanken | A 300,000-word tree-bank: it is in XML, all words are nicely tagged with PAROLE-style tags. | |
Swedish | SALDO | Swedish inflectional lexicon, LGPL | |
Danish | Danish Dependency Treebank | Danish tree bank, 100,000-word, XML, PAROLE tagged, under the GPL. | |
Danish | DanNet | Danish WordNet (~32,000 words), MIT licensed. | |
Icelandic | Icelandic and English | ||
Faroese | apertium-fo-is.fo.rlx | A constraint grammar for morphological disambiguation with ~120 rules |
- Bilingual
Language pair | Resource | Description | See also |
---|---|---|---|
Icelandic—Danish | Apertium bidix with ~960 entries | ||
Icelandic—Faroese | Apertium bidix with ~30 entries | ||
Norwegian (Nynorsk)—Norwegian (Bokmål) | Apertium bidix with ~36,000 entries | Norwegian | |
Swedish—Danish | Apertium bidix with ~2,000 entries | Swedish and Danish |
Funding possibilities
Samples
Language | Text |
---|---|
Danish | Alle mennesker er født frie og lige i værdighed og rettigheder. De er udstyret med fornuft og samvittighed, og de bør handle mod hverandre i en broderskabets ånd. |
Norwegian (Bokmål) | Alle mennesker er født frie og med samme menneskeverd og menneskerettigheter. De er utstyrt med fornuft og samvittighet og bør handle mot hverandre i brorskapets ånd. |
Norwegian (Nynorsk) | Alle menneske er fødde til fridom og med same menneskeverd og menneskerettar. Dei har fått fornuft og samvit og skal leve med kvarandre som brør. |
Swedish | Alla människor är födda fria och lika i värde och rättigheter. De har utrustats med förnuft och samvete och bör handla gentemot varandra i en anda av gemenskap. |
Faroese | Øll menniskju eru fødd fræls og jøvn til virðingar og mannarættindi. Tey hava skil og samvitsku og eiga at fara hvørt um annað í bróðuranda. |
Icelandic | Hver maður er borinn frjáls og jafn öðrum að virðingu og réttindum. Menn eru gæddir vitsmunum og samvizku, og ber þeim að breyta bróðurlega hverjum við annan. |