Difference between revisions of "North Germanic languages"

From Apertium
Jump to navigation Jump to search
(3 intermediate revisions by the same user not shown)
Line 8: Line 8:
 
Text in ''italic'' denotes an unreleased pair.
 
Text in ''italic'' denotes an unreleased pair.
   
  +
{| style="text-align: center;" class="wikitable"
{{North Germanic language translations}}
 
  +
|- style="background: #ececec"
 
! !! dan !! fao !! isl !! nor !! nob !! nno !! swe
 
|-
  +
| '''dan''' || — || ''[[fao-dan]]'' || || [[dan-nor]] || — || — || [[dan-swe]]
 
|-
  +
| '''fao''' || ''[[fao-dan]]'' || — || ''[[fao-isl]]'' || ''[[fao-nor]]'' || — || — ||
 
|-
  +
| '''isl''' || || ''[[fao-isl]]'' || — || || — || — || [[isl-swe]]
 
|-
  +
| '''nor''' || [[dan-nor]] || ''[[fao-nor]]'' || || — || — || — || [[swe-nor]]
 
|-
  +
| '''nob''' || — || — || — || — || — || [[nno-nob]] || —
  +
|-
  +
| '''nno''' || — || — || — || — || [[nno-nob]] || — || —
 
|-
  +
| '''swe''' || [[dan-swe]] || || [[isl-swe]] || [[swe-nor]] || — || — || —
 
|-
 
|}
  +
   
 
==Existing==
 
==Existing==
Line 17: Line 36:
 
! Language !! File !! Paradigms !! Lemmata
 
! Language !! File !! Paradigms !! Lemmata
 
|-
 
|-
| Norwegian Nynorsk || [https://svn.code.sf.net/p/apertium/svn/languages/apertium-nno/apertium-nno.nno.dix apertium-nno.nno.dix] || 770 || 83,584
+
| Norwegian Nynorsk || [https://svn.code.sf.net/p/apertium/svn/languages/apertium-nno/apertium-nno.nno.dix apertium-nno.nno.dix] || 770 || {{#lst:apertium-nno/stats|stems}}
 
|-
 
|-
| Norwegian Bokmål || [https://svn.code.sf.net/p/apertium/svn/languages/apertium-nob/apertium-nob.nob.dix apertium-nob.nob.dix] || 705 || 119,567
+
| Norwegian Bokmål || [https://svn.code.sf.net/p/apertium/svn/languages/apertium-nob/apertium-nob.nob.dix apertium-nob.nob.dix] || 705 || {{#lst:apertium-nob/stats|stems}}
 
|-
 
|-
| Swedish || [https://svn.code.sf.net/p/apertium/svn/languages/apertium-swe/apertium-swe.swe.dix apertium-swe.swe.dix] || 277 || 5,177
+
| Swedish || [https://svn.code.sf.net/p/apertium/svn/languages/apertium-swe/apertium-swe.swe.dix apertium-swe.swe.dix] || 277 || {{#lst:apertium-swe/stats|stems}}
 
|-
 
|-
| Danish || [https://svn.code.sf.net/p/apertium/svn/languages/apertium-dan/apertium-dan.dan.dix apertium-dan.dan.dix] || 341 || 10,709
+
| Danish || [https://svn.code.sf.net/p/apertium/svn/languages/apertium-dan/apertium-dan.dan.dix apertium-dan.dan.dix] || 341 || {{#lst:apertium-dan/stats|stems}}
 
|-
 
|-
| Faroese || [https://svn.code.sf.net/p/apertium/svn/languages/apertium-fao/apertium-fao.fao.dix apertium-fao.fao.dix] || 113 || 1,864
+
| Icelandic || [https://svn.code.sf.net/p/apertium/svn/languages/apertium-isl/apertium-isl.isl.dix apertium-isl.isl.dix] || 1,881 || {{#lst:apertium-isl/stats|stems}}
 
|-
 
|-
| Icelandic || [https://svn.code.sf.net/p/apertium/svn/languages/apertium-isl/apertium-isl.isl.dix apertium-isl.isl.dix] || 1,881 || 9,134
+
| Faroese || [https://svn.code.sf.net/p/apertium/svn/languages/apertium-fao/apertium-fao.fao.dix apertium-fao.fao.dix] || 113 || {{#lst:apertium-fao/stats|stems}}
 
|-
 
|-
 
|}
 
|}
Line 34: Line 53:
   
 
Resources listed below will be useful in building machine translation systems for these languages.
 
Resources listed below will be useful in building machine translation systems for these languages.
 
;Monolingual
 
 
{|class=wikitable
 
! Language !! Resource !! Description !! See also
 
|-
 
| Norwegian || [http://www.edd.uio.no/prosjekt/ordbanken/ Norsk ordbank] || Large >100,000 lemma morphological dictionary of both Nynorsk and Bokmål, GPL. || [[Norsk ordbank]], [[Norwegian]]
 
|-
 
| Norwegian || [http://maximos.aksis.uib.no/Aksis-wiki/Oslo-Bergen_Tagger Oslo-Bergen tagger] || Constraint grammar tagger for Norwegian, GPL. (converted for CG-3) || [[Norwegian]]
 
|-
 
| Swedish || [http://w3.msi.vxu.se/~nivre/research/Talbanken05.html Talbanken] || A 300,000-word tree-bank: it is in XML, all words are nicely tagged with PAROLE-style tags. ||
 
|-
 
| Swedish || [http://spraakbanken.gu.se/sal/eng/ SALDO] || Swedish inflectional lexicon, LGPL ||
 
|-
 
| Danish || [http://www.isv.cbs.dk/~mbk/treebank/ Danish Dependency Treebank] || Danish tree bank, 100,000-word, XML, PAROLE tagged, under the GPL. ||
 
|-
 
| Danish || [http://wordnet.dk/dannet/menu?item=0&lang=1 DanNet] || Danish WordNet (~32,000 words), MIT licensed.
 
|-
 
| Icelandic || || || [[Icelandic and English]]
 
|-
 
| Faroese || [http://apertium.svn.sourceforge.net/svnroot/apertium/trunk/incubator/apertium-fo-is.fo.rlx apertium-fo-is.fo.rlx] || A [[constraint grammar]] for morphological disambiguation with ~120 rules ||
 
|-
 
|}
 
   
 
;Bilingual
 
;Bilingual
Line 67: Line 63:
 
|Icelandic—Faroese || Apertium bidix with ~30 entries || ||
 
|Icelandic—Faroese || Apertium bidix with ~30 entries || ||
 
|-
 
|-
|Norwegian (Nynorsk)—Norwegian (Bokmål) || Apertium bidix with ~36,000 entries || || [[Norwegian]]
 
|-
 
|Swedish—Danish || Apertium bidix with ~2,000 entries || || [[Swedish and Danish]]
 
 
|}
 
|}
   

Revision as of 13:13, 16 April 2017

En français

The North Germanic languages include Danish (dan), Faroese (fao), Icelandic (isl), Norwegian (Nynorsk, nno and Bokmål, nob) and Swedish (swe). The languages are related with varying levels of mutual intelligibility. This group would make a nice group for Apertium systems.

Status

Text in italic denotes an unreleased pair.

dan fao isl nor nob nno swe
dan fao-dan dan-nor dan-swe
fao fao-dan fao-isl fao-nor
isl fao-isl isl-swe
nor dan-nor fao-nor swe-nor
nob nno-nob
nno nno-nob
swe dan-swe isl-swe swe-nor


Existing

Dictionaries
See also: List of dictionaries
Language File Paradigms Lemmata
Norwegian Nynorsk apertium-nno.nno.dix 770 182,497
Norwegian Bokmål apertium-nob.nob.dix 705 246,281
Swedish apertium-swe.swe.dix 277 138,490
Danish apertium-dan.dan.dix 341 52,133
Icelandic apertium-isl.isl.dix 1,881 8,770
Faroese apertium-fao.fao.dix 113 2,318

Resources

Resources listed below will be useful in building machine translation systems for these languages.

Bilingual
Language pair Resource Description See also
Icelandic—Danish Apertium bidix with ~960 entries
Icelandic—Faroese Apertium bidix with ~30 entries

Funding possibilities

Samples

Language Text
Danish Alle mennesker er født frie og lige i værdighed og rettigheder. De er udstyret med fornuft og samvittighed, og de bør handle mod hverandre i en broderskabets ånd.
Norwegian (Bokmål) Alle mennesker er født frie og med samme menneskeverd og menneskerettigheter. De er utstyrt med fornuft og samvittighet og bør handle mot hverandre i brorskapets ånd.
Norwegian (Nynorsk) Alle menneske er fødde til fridom og med same menneskeverd og menneskerettar. Dei har fått fornuft og samvit og skal leve med kvarandre som brør.
Swedish Alla människor är födda fria och lika i värde och rättigheter. De har utrustats med förnuft och samvete och bör handla gentemot varandra i en anda av gemenskap.
Faroese Øll menniskju eru fødd fræls og jøvn til virðingar og mannarættindi. Tey hava skil og samvitsku og eiga at fara hvørt um annað í bróðuranda.
Icelandic Hver maður er borinn frjáls og jafn öðrum að virðingu og réttindum. Menn eru gæddir vitsmunum og samvizku, og ber þeim að breyta bróðurlega hverjum við annan.