Difference between revisions of "Specific resources per language"

From Apertium
Jump to navigation Jump to search
Line 64: Line 64:
 
;Resources
 
;Resources
   
* [http://nlp.fi.muni.cz/nlp/aisa/NlpCz/Frekvence_slov_lemmat.html Most frequent words] Also includes a list of the most frequent bi- and tri-grams, but these are of little use as multiwords
 
 
* [http://users.ox.ac.uk/~tayl0010/links.html James Naughton's links]
 
* [http://users.ox.ac.uk/~tayl0010/links.html James Naughton's links]
 
* [http://www.czech-language.cz/alphabet/alph-krtiny.html Some complications with diacritics]
 
* [http://www.czech-language.cz/alphabet/alph-krtiny.html Some complications with diacritics]

Revision as of 20:26, 11 December 2019

The incubator can be found in the 'incubator' column in https://apertium.github.io/apertium-on-github/source-browser.html. It houses language pairs which haven't completely matured and are under work.


Specific resources per language

Here are some links to resources that might be useful for expanding on work in the Incubator. Below you can put resources which will be useful in the construction. Try and mark them for licence, or at least free/non-free.

See also the individual language pages.

Albanian

Dictionary: Albanian Monodix
Resources

Armenian

Dictionary: Armenian Monodix
Resources

Assamese and Hindi

Dictionary: Assemese-Hindi Bidix


--- Anusuya

Belarusian

Bengali

Bulgarian

Cornish

Dictionary: Cornish Monodix from SourceForge

This resource has not been migrated to GitHub from SVN

Resources

Czech

Dictionary: apertium-pl-cs.cs.dix.xml

This resource has not been migrated to GitHub from SVN

Dictionary: Czech-Esperanto Bidix
Dictionary: Czech-Slovenian Bidix
Resources

Faroese

Dictionary: Faroese Monodix
Resources

Finnish

See also: Omorfi
Resources
s = lemma
hn = homonymy ref
t = inflection info
tn = inflection number (referring to table)
av = ref to consonant gradation

German and English

German-English bilingual dictionary (>216,000 entries), generated from linguistic data (GPL Version 2 or later) available for "Ding: A Dictionary LookUp program" (version 1.5 2007-04-09) from Frank Richter, Technische Universität Chemnitz

German-English Dictionary

Greek

Dictionary: Greek Monodix
Greek-English Dictionary: Greek-English Dictionary
Resources

Hebrew

Resources

Hindi

See also: Hindi
Resources


Iranian Persian

Dictionary: Persian Monodix
Resources

Ingush

Resources

Latvian

Resources
See also

Lithuanian

Dictionary: Lithuanian Monodix
Resources

Nogai

Resources

Ossetian

Dictionary: Ossetian Monodix
Resources

Piemontese

Dictionary: Piemontese Monodix from SourceForge

This resource has not been migrated to GitHub from SVN

Resources

Portuguese

Even if Apertium has a stable es-pt pair, the coverage of the Brazilian Portuguese Dictionary built at NILC (Universidade de Sao Paulo) for Unitex is much better, and could be used perhaps to improve it.

Resources

We believe it has a LGPL license.

Punjabi

Resources

Quechua

Resources

Russian

Dictionary: monodix
Bidix: Polish-Russian
Bidix: English-Russian
Resources

Sanskrit संस्कृतम्

Dictionary: Sanskrit Monodix
Resources

Slovakian

Dictionary: Slovak Monodix
Resources

Thai

Urdu

Dictionary: Urdu Monodix
Bidix: Hindi-Urdu Monodix
Resources


Github Migration

For languages whose resources are not yet on Github, you can use apertium-init to make their corresponding repository and add the files from SVN to that repositiry.