Difference between revisions of "Icelandic and English"

From Apertium
Jump to navigation Jump to search
 
(30 intermediate revisions by 4 users not shown)
Line 2: Line 2:
 
==Pending tasks==
 
==Pending tasks==
   
  +
* Try and convert some IceTagger constraint rules to work in [[constraint grammar]]
* Tag a corpus with IceTagger and train the <code>apertium-tagger</code>
 
* Post-edit automatically-generated bilingual dictionaries
 
* Use IceParser to parse a corpus and extract the most frequent phrase patterns.
 
* Merge analysed corpus (IceMorphy full-form list) with Apertium dictionary &mdash; will require matching partial information to paradigms... perhaps use [[extract]] ?
 
   
==Resources==
 
   
===Corpora===
+
==Notes==
   
  +
* ind(is) → def(en): almenningur, alþjóð, alþýða, heimur, stjórnarandstaða, bæjarstjórn, Ermarsund, nefnifall,
* Mediawiki l10n, KDE4, OpenSubtitles, etc. &mdash; from OPUS (~60k sentences)
 
  +
 
==Resources==
   
 
===Bilingual dictionaries===
 
===Bilingual dictionaries===
Line 21: Line 19:
 
** And [http://www.northvegr.org/vigfusson/index.php here]
 
** And [http://www.northvegr.org/vigfusson/index.php here]
 
* Wordbank [http://www.ismal.hi.is/ob/birta/index.cgi at ismal.hi.is] (licence unknown)
 
* Wordbank [http://www.ismal.hi.is/ob/birta/index.cgi at ismal.hi.is] (licence unknown)
  +
** was moved to [http://herdubreid.rhi.hi.is:1026/wordbank/search here]
  +
  +
==Example phrase==
  +
  +
* Hver maður er borinn frjáls og jafn öðrum að virðingu og réttindum.
  +
  +
===IceFormat===
  +
  +
<pre>
  +
Hver foken maður nken er sfg3en
  +
borinn sþgken frjáls lkensf
  +
og c jafn aa öðrum fokfþ að c
  +
virðingu nveþ og c réttindum nhfþ . .
  +
</pre>
  +
  +
<pre>
  +
{*SUBJ> [NP Hver foken maður nken NP] *SUBJ>}
  +
[VPb er sfg3en VPb]
  +
{*COMP< [VPp borinn sþgken VPp] *COMP<}
  +
{*COMP< [APs [AP frjáls lkensf AP] [CP og c CP] [AP jafn lkensf AP] APs] *COMP<}
  +
[NP öðrum fokfþ NP]
  +
[SCP að c SCP]
  +
[NPs [NP virðingu nveþ NP] [CP og c CP] [NP réttindum nhfþ NP] NPs]
  +
</pre>
  +
  +
===Apertium===
  +
  +
<pre>
  +
^Hver<prn><ind><m><sg><nom>$ ^maður<n><m><sg><nom><ind>$ ^vera<vbser><pri><p3><sg>$
  +
^bera<vblex><pp><m><sg><nom>$ ^frjáls<adj><sta><pst><m><sg><nom>$
  +
^og<cnjcoo>$ ^jafn<adj><sta><pst><m><sg><nom>$
  +
^annar<prn><ind><m><pl><dat>$ ^að<pr>$
  +
^virðing<n><f><sg><dat><def>$ ^og<cnjcoo>$ ^réttindi<n><nt><pl><dat><ind>$ ^.<sent>$
  +
</pre>
  +
  +
<pre>
  +
^prn_nom<SN><@SUBJ→>{^Hver<prn><ind><m><sg><nom>$ ^maður<n><m><sg><nom><ind>$}$
  +
^verb<SV>{^vera<vbser><pri><p3><sg>$ ^bera<vblex><pp><m><sg><nom>$}$
  +
^adj_cc_adj<SA>{^frjáls<adj><sta><pst><m><sg><nom>$ ^og<cnjcoo>$ ^jafn<adj><sta><pst><m><sg><nom>$}$
  +
^nom<SN>{^annar<prn><ind><m><pl><dat>$}$
  +
^að<Prep>{^að<pr>$}$
  +
^nom_cc_nom{^virðing<n><f><sg><dat><def>$ ^og<cnjcoo>$ ^réttindi<n><nt><pl><dat><ind>$}$
  +
</pre>
   
 
==See also==
 
==See also==
   
* [[/Regression tests|Regression tests]]
+
* [[/Pending tests|Pending tests]] &mdash; Examples for testing new rules
  +
* [[/Regression tests|Regression tests]] &mdash; Examples of working phrase translations.
   
  +
==External links==
  +
* [http://iceblark.wordpress.com/translation/ is-en MT entry on the IceBLARK blog]
   
 
[[Category:Icelandic and English]]
 
[[Category:Icelandic and English]]

Latest revision as of 12:31, 17 June 2010

Pending tasks[edit]


Notes[edit]

  • ind(is) → def(en): almenningur, alþjóð, alþýða, heimur, stjórnarandstaða, bæjarstjórn, Ermarsund, nefnifall,

Resources[edit]

Bilingual dictionaries[edit]

Example phrase[edit]

  • Hver maður er borinn frjáls og jafn öðrum að virðingu og réttindum.

IceFormat[edit]

Hver foken maður nken er sfg3en 
borinn sþgken frjáls lkensf 
og c jafn aa öðrum fokfþ að c 
virðingu nveþ og c réttindum nhfþ . . 
{*SUBJ> [NP Hver foken maður nken NP] *SUBJ>}
[VPb er sfg3en VPb]
{*COMP< [VPp borinn sþgken VPp] *COMP<}
{*COMP< [APs [AP frjáls lkensf AP] [CP og c CP] [AP jafn lkensf AP] APs] *COMP<}
[NP öðrum fokfþ NP]
[SCP að c SCP]
[NPs [NP virðingu nveþ NP] [CP og c CP] [NP réttindum nhfþ NP] NPs]

Apertium[edit]

^Hver<prn><ind><m><sg><nom>$ ^maður<n><m><sg><nom><ind>$ ^vera<vbser><pri><p3><sg>$ 
^bera<vblex><pp><m><sg><nom>$ ^frjáls<adj><sta><pst><m><sg><nom>$ 
^og<cnjcoo>$ ^jafn<adj><sta><pst><m><sg><nom>$ 
^annar<prn><ind><m><pl><dat>$ ^að<pr>$ 
^virðing<n><f><sg><dat><def>$ ^og<cnjcoo>$ ^réttindi<n><nt><pl><dat><ind>$ ^.<sent>$
^prn_nom<SN><@SUBJ→>{^Hver<prn><ind><m><sg><nom>$ ^maður<n><m><sg><nom><ind>$}$ 
^verb<SV>{^vera<vbser><pri><p3><sg>$ ^bera<vblex><pp><m><sg><nom>$}$  
^adj_cc_adj<SA>{^frjáls<adj><sta><pst><m><sg><nom>$ ^og<cnjcoo>$ ^jafn<adj><sta><pst><m><sg><nom>$}$
^nom<SN>{^annar<prn><ind><m><pl><dat>$}$
^að<Prep>{^að<pr>$}$
^nom_cc_nom{^virðing<n><f><sg><dat><def>$ ^og<cnjcoo>$ ^réttindi<n><nt><pl><dat><ind>$}$

See also[edit]

External links[edit]