Difference between revisions of "Specific resources per language"
(Incubator had moved in SVN) |
|||
Line 7: | Line 7: | ||
===Albanian=== |
===Albanian=== |
||
:''Dictionary: [http://apertium.svn.sourceforge.net/svnroot/apertium |
:''Dictionary: [http://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-mk-sq.sq.dix apertium-mk-sq.sq.dix]'' |
||
;Resources |
;Resources |
||
Line 14: | Line 14: | ||
===Armenian=== |
===Armenian=== |
||
:''Dictionary: [http://apertium.svn.sourceforge.net/svnroot/apertium |
:''Dictionary: [http://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-hy-en.hy.dix apertium-hy-en.hy.dix]'' |
||
;Resources |
;Resources |
||
Line 31: | Line 31: | ||
===Bulgarian=== |
===Bulgarian=== |
||
:''Dictionary: [http://apertium.svn.sourceforge.net/svnroot/apertium |
:''Dictionary: [http://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-mk-bg.bg.dix apertium-mk-bg.bg.dix]'' |
||
;Resources |
;Resources |
||
Line 39: | Line 39: | ||
===Cornish=== |
===Cornish=== |
||
:''Dictionary: [http://apertium.svn.sourceforge.net/svnroot/apertium |
:''Dictionary: [http://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-cy-kw.kw.dix apertium-cy-kw.kw.dix]'' |
||
;Resources |
;Resources |
||
Line 48: | Line 48: | ||
===Czech=== |
===Czech=== |
||
:''Dictionary: [http://apertium.svn.sourceforge.net/svnroot/apertium |
:''Dictionary: [http://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-pl-cs.cs.dix.xml apertium-pl-cs.cs.dix.xml]'' |
||
;Resources |
;Resources |
||
Line 57: | Line 57: | ||
===Faroese=== |
===Faroese=== |
||
:''Dictionary: [http://apertium.svn.sourceforge.net/svnroot/apertium |
:''Dictionary: [http://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-fo-is.fo.dix apertium-fo-is.fo.dix]'' |
||
;Resources |
;Resources |
||
* [http://giellatekno.uit.no/cgi/d-fao.eng.html U. Tromsø -- Faroese analyser ] |
* [http://giellatekno.uit.no/cgi/d-fao.eng.html U. Tromsø -- Faroese analyser ] |
||
* [http://apertium.svn.sourceforge.net/svnroot/apertium |
* [http://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-fo-is.fo.rle Faroese Constraint Grammar] |
||
===Finnish=== |
===Finnish=== |
||
Line 85: | Line 85: | ||
===Greek=== |
===Greek=== |
||
:''Dictionary: [http://apertium.svn.sourceforge.net/svnroot/apertium |
:''Dictionary: [http://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-en-el.el.dix apertium-en-el.el.dix] |
||
;Resources |
;Resources |
||
Line 112: | Line 112: | ||
===Iranian Persian=== |
===Iranian Persian=== |
||
:''Dictionary: [http://apertium.svn.sourceforge.net/svnroot/apertium |
:''Dictionary: [http://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-tg-fa.fa.dix apertium-tg-fa.fa.dix]'' |
||
;Resources |
;Resources |
||
Line 119: | Line 119: | ||
===Lithuanian=== |
===Lithuanian=== |
||
:''Dictionary: [http://apertium.svn.sourceforge.net/svnroot/apertium |
:''Dictionary: [http://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-en-lt.lt.dix apertium-en-lt.lt.dix]'' |
||
;Resources |
;Resources |
||
===Ossetian=== |
===Ossetian=== |
||
:''Dictionary: [http://apertium.svn.sourceforge.net/svnroot/apertium |
:''Dictionary: [http://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-os-fa.os.dix apertium-os-fa.os.dix]'' |
||
;Resources |
;Resources |
||
Line 133: | Line 133: | ||
===Piemontese=== |
===Piemontese=== |
||
:''Dictionary: [http://apertium.svn.sourceforge.net/svnroot/apertium |
:''Dictionary: [http://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-it-pms.pms.dix apertium-it-pms.pms.dix]'' |
||
;Resources |
;Resources |
||
Line 162: | Line 162: | ||
===Russian=== |
===Russian=== |
||
:''Dictionary: [http://apertium.svn.sourceforge.net/svnroot/apertium |
:''Dictionary: [http://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-pl-ru.ru.dix.xml monodix]'' |
||
:''Bidix: [http://apertium.svn.sourceforge.net/svnroot/apertium |
:''Bidix: [http://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-pl-ru.pl-ru.dix.xml Polish-Russian]'' |
||
:''Bidix: [http://apertium.svn.sourceforge.net/svnroot/apertium |
:''Bidix: [http://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-en-ru.en-ru.dix.xml English-Russian] |
||
;Resources |
;Resources |
||
Line 186: | Line 186: | ||
===Slovakian=== |
===Slovakian=== |
||
:''Dictionary: [http://apertium.svn.sourceforge.net/svnroot/apertium |
:''Dictionary: [http://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-pl-sk.sk.dix apertium-pl-sk.sk.dix]'' |
||
;Resources |
;Resources |
||
Line 198: | Line 198: | ||
===Urdu=== |
===Urdu=== |
||
:''Dictionary: [http://apertium.svn.sourceforge.net/svnroot/apertium |
:''Dictionary: [http://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-hi-ur.ur.dix apertium-hi-ur.ur.dix]'' |
||
;Resources |
;Resources |
Revision as of 04:18, 16 March 2010
The incubator can be found here. It provides a place for people to put dictionaries and other stuff that is useful in constructing language pairs.
Specific resources per language
Here are some links to resources that might be useful for expanding on work in the Incubator. Below you can put resources which will be useful in the construction. Try and mark them for licence, or at least free/non-free.
Albanian
- Dictionary: apertium-mk-sq.sq.dix
- Resources
- http://www.albanianoverview.com/grammar.htm
- http://www.idividi.com.mk/recnik/index.htm -- albanian--macedonian dictionary (non-free)
Armenian
- Dictionary: apertium-hy-en.hy.dix
- Resources
Belarusian
Bengali
- http://bengalinux.sourceforge.net/cgi-bin/anubadok/index.pl -- Free software translation for English→Bengali
- http://anubadok.sf.net/ -- See above
Bulgarian
- Dictionary: apertium-mk-bg.bg.dix
- Resources
Cornish
- Dictionary: apertium-cy-kw.kw.dix
- Resources
Czech
- Dictionary: apertium-pl-cs.cs.dix.xml
- Resources
- Most frequent words Also includes a list of the most frequent bi- and tri-grams, but these are of little use as multiwords
- James Naughton's links
- Some complications with diacritics
- Czech morphological guesser - 'free', but not open source
Faroese
- Dictionary: apertium-fo-is.fo.dix
- Resources
Finnish
- See also: Omorfi
- Resources
- http://kaino.kotus.fi/sanat/nykysuomi/ — full form list for Finnish -- LGPL
- Omorfi–Open Morphology for Finnish language
- Helsinki Finite-State Transducer Technology (HFST)
s = lemma hn = homonymy ref t = inflection info tn = inflection number (referring to table) av = ref to consonant gradation
German - English
German-English bilingual dictionary (>216,000 entries), generated from linguistic data (GPL Version 2 or later) available for "Ding: A Dictionary LookUp program" (version 1.5 2007-04-09) from Frank Richter, Technische Universität Chemnitz
- Dictionary: apertium-de-en.dix
Greek
- Dictionary: apertium-en-el.el.dix
- Resources
- Greek <-> Ukranian, Russian, Polish Grammar & Dictionary: http://ellinika.gnu.org.ua/
Hebrew
- Resources
- http://www.mila.cs.technion.ac.il/english/resources/lexicons/ lexicons for Hebrew, in weird XLS format -- GPL
Hindi
- See also: Hindi
- Resources
- POS tagged English-Hindi wordlist: http://indlinux.sourceforge.net/downloads/files/hindidict.txt.bz2
- https://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-en-hi
- https://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-hi-en-unicode
- https://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-hi.hi.dix
- https://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-hi.hi_WX.dix
- https://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-hi-ur.hi.dix
- https://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-hi-ur.hi.dix.old
Iranian Persian
- Dictionary: apertium-tg-fa.fa.dix
- Resources
Lithuanian
- Dictionary: apertium-en-lt.lt.dix
- Resources
Ossetian
- Dictionary: apertium-os-fa.os.dix
- Resources
- Ossetian: Grammatical Sketch — quite nice and comprehensive.
Piemontese
- Dictionary: apertium-it-pms.pms.dix
- Resources
- http://members.fortunecity.it/dotorcarlo/vocen1.html Piemontese--English -- public domain
- http://digilander.libero.it/dotor43/indexit.html -- Piemontese grammar incl. 17k word Piemontese--Italian dictionary (POS tagged and partly annotated for inflection). site suggests "© These pages can be freely used for all purposes, but not for political reasons, and not against the laws (no matter what is the country)."
Portuguese
Even if Apertium has a stable es-pt pair, the coverage of the Brazilian Portuguese Dictionary built at NILC (Universidade de Sao Paulo) for Unitex is much better, and could be used perhaps to improve it.
- Resources
We believe it has a LGPL license.
Quechua
- Resources
- http://www.runasimipi.org/
- AVENUE Quechua-Spanish system. (ask Francis Tyers)
Russian
- Dictionary: monodix
- Bidix: Polish-Russian
- Bidix: English-Russian
- Resources
- http://www.alphadictionary.com/rusgrammar/
- http://www.seelrc.org:8080/grammar/pdf/stand_alone_russian.pdf
- Russian analyser - non-free, Windows only
- Using Czech resources for the morphological analysis of Russian
- Pere - free translator, including Russian<->Ukranian<->English dictionaries. Built from alignments, low quality.
- Russian--Tajik phrase dictionary, 41k entries.
- Another Tajik--Russian dictionary
Sanskrit संस्कृतम्
- Dictionary: apertium-sa-XX
- Resources
Slovakian
- Dictionary: apertium-pl-sk.sk.dix
- Resources
- http://old.bohemica.com/slovak/slovakgrammar.pdf (Slovakian, with some English)
- http://pl.wiktionary.org/wiki/Aneks:J%C4%99zyk_s%C5%82owacki_-_tabele_koniugacji (In Polish)
- http://www.angelfire.com/sk3/quality/Slovak_declension.html
- http://www.juls.savba.sk/msj/
Urdu
- Dictionary: apertium-hi-ur.ur.dix
- Resources
- http://www.lama.univ-savoie.fr/~humayoun/UrduMorph/ — GPL analyser of Urdu
- http://www.crulp.org/software/langproc/E2UMachineTranslationSystem.htm -- Urdu--English MT system