Difference between revisions of "North Germanic languages"

From Apertium
Jump to navigation Jump to search
(Undo revision 22334 by Francis Tyers (Talk))
Line 1: Line 1:
h2THbt <a href="http://secocjzxszjq.com/">secocjzxszjq</a>, [url=http://oghtbcfizhox.com/]oghtbcfizhox[/url], [link=http://qpqctgdrnxpy.com/]qpqctgdrnxpy[/link], http://rafwjhpofzcp.com/
{{TOCD}}
The '''North Germanic languages''' include Danish (<code>da</code>), Faroese (<code>fo</code>), Icelandic (<code>is</code>), Norwegian (Nynorsk, <code>nn</code> and Bokmål, <code>nb</code>) and Swedish (<code>sv</code>). The languages are related with varying levels of mutual intelligibility. This group would make a nice group for Apertium systems.

==Status==

Text in ''italic'' denotes an unreleased pair.

{| style="text-align: center;" class="wikitable"
|- style="background: #ececec"
! !! da !! fo !! is !! nb !! nn !! sv
|-
| '''da''' || &mdash; || || || || || [[da-sv]]
|-
| '''fo''' || || &mdash; || ''[[fo-is]]'' || || ||
|-
| '''is''' || || ''[[fo-is]]'' || &mdash; || || ||
|-
| '''nb''' || || || || &mdash; || [[nn-nb]] ||
|-
| '''nn''' || || || || [[nn-nb]] || &mdash; ||
|-
| '''sv''' || [[da-sv]] || || || || || &mdash;
|-
|}

==Existing==

;Dictionaries
{{see-also|List of dictionaries}}
{|class="wikitable sortable"
! Language !! File !! Paradigms !! Lemmata
|-
| Norwegian Nynorsk || [http://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-nn-nb/apertium-nn-nb.nn.dix apertium-nn-nb.nn.dix] || 770 || 83,584
|-
| Norwegian Bokmål || [http://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-nn-nb/apertium-nn-nb.nb.dix apertium-nn-nb.nb.dix] || 705 || 119,567
|-
| Swedish || [http://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-sv-da/apertium-sv-da.sv.dix apertium-sv-da.sv.dix] || 277 || 5,177
|-
| Danish || [http://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-sv-da/apertium-sv-da.da.dix apertium-sv-da.da.dix] || 341 || 10,709
|-
| Faroese || [http://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-fo-is.fo.dix apertium-fo-is.fo.dix] || 113 || 1,864
|-
| Icelandic || [http://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-is-en/apertium-is-en.is.dix apertium-is-en.is.dix] || 1,881 || 9,134
|-
|}

==Resources==

Resources listed below will be useful in building machine translation systems for these languages.

;Monolingual

{|class=wikitable
! Language !! Resource !! Description !! See also
|-
| Norwegian || [http://www.edd.uio.no/prosjekt/ordbanken/ Norsk ordbank] || Large >100,000 lemma morphological dictionary of both Nynorsk and Bokmål, GPL. || [[Norsk ordbank]], [[Norwegian]]
|-
| Norwegian || [http://maximos.aksis.uib.no/Aksis-wiki/Oslo-Bergen_Tagger Oslo-Bergen tagger] || Constraint grammar tagger for Norwegian, GPL. (converted for CG-3) || [[Norwegian]]
|-
| Swedish || [http://w3.msi.vxu.se/~nivre/research/Talbanken05.html Talbanken] || A 300,000-word tree-bank: it is in XML, all words are nicely tagged with PAROLE-style tags. ||
|-
| Swedish || [http://spraakbanken.gu.se/sal/eng/ SALDO] || Swedish inflectional lexicon, LGPL ||
|-
| Danish || [http://www.isv.cbs.dk/~mbk/treebank/ Danish Dependency Treebank] || Danish tree bank, 100,000-word, XML, PAROLE tagged, under the GPL. ||
|-
| Danish || [http://wordnet.dk/dannet/menu?item=0&lang=1 DanNet] || Danish WordNet (~32,000 words), MIT licensed.
|-
| Icelandic || || || [[Icelandic and English]]
|-
| Faroese || [http://apertium.svn.sourceforge.net/svnroot/apertium/trunk/incubator/apertium-fo-is.fo.rlx apertium-fo-is.fo.rlx] || A [[constraint grammar]] for morphological disambiguation with ~120 rules ||
|-
|}

;Bilingual

{|class=wikitable
! Language pair !! Resource !! Description !! See also
|-
|Icelandic&mdash;Danish || Apertium bidix with ~960 entries || ||
|-
|Icelandic&mdash;Faroese || Apertium bidix with ~30 entries || ||
|-
|Norwegian (Nynorsk)&mdash;Norwegian (Bokmål) || Apertium bidix with ~36,000 entries || || [[Norwegian]]
|-
|Swedish&mdash;Danish || Apertium bidix with ~2,000 entries || || [[Swedish and Danish]]
|}

==Funding possibilities==

* [http://www.norden.org/start/start.asp Nordic Council]

==Samples==

{|class=wikitable
! Language !! Text
|-
| Danish || Alle mennesker er født frie og lige i værdighed og rettigheder. De er udstyret med fornuft og samvittighed, og de bør handle mod hverandre i en broderskabets ånd.
|-
| Norwegian&nbsp;(Bokmål) || Alle mennesker er født frie og med samme menneskeverd og menneskerettigheter. De er utstyrt med fornuft og samvittighet og bør handle mot hverandre i brorskapets ånd.
|-
| Norwegian&nbsp;(Nynorsk) || Alle menneske er fødde til fridom og med same menneskeverd og menneskerettar. Dei har fått fornuft og samvit og skal leve med kvarandre som brør.
|-
| Swedish || Alla människor är födda fria och lika i värde och rättigheter. De har utrustats med förnuft och samvete och bör handla gentemot varandra i en anda av gemenskap.
|-
| Faroese || Øll menniskju eru fødd fræls og jøvn til virðingar og mannarættindi. Tey hava skil og samvitsku og eiga at fara hvørt um annað í bróðuranda.
|-
| Icelandic || Hver maður er borinn frjáls og jafn öðrum að virðingu og réttindum. Menn eru gæddir vitsmunum og samvizku, og ber þeim að breyta bróðurlega hverjum við annan.
|-
|}

[[Category:Languages]]
[[Category:North Germanic languages]]

Revision as of 11:39, 17 November 2010

h2THbt <a href="http://secocjzxszjq.com/">secocjzxszjq</a>, [url=http://oghtbcfizhox.com/]oghtbcfizhox[/url], [link=http://qpqctgdrnxpy.com/]qpqctgdrnxpy[/link], http://rafwjhpofzcp.com/