Difference between revisions of "Arabic"
Jump to navigation
Jump to search
m (→Resources: wops) |
|||
Line 1: | Line 1: | ||
==Resources== |
==Resources== |
||
⚫ | |||
* http://www.qamus.org/morphology.htm |
|||
⚫ | |||
⚫ | |||
⚫ | |||
** Direct download: http://heanet.dl.sourceforge.net/sourceforge/aramorph/aramorph-1.2.1.tar.gz |
** Direct download: http://heanet.dl.sourceforge.net/sourceforge/aramorph/aramorph-1.2.1.tar.gz |
||
* [http://www.nongnu.org/aramorph/ AraMorph - Java] - An Arabic morphological analyzer and part-of-speech tagger rewritten in Java for [http://lucene.apache.org/ Lucene] |
* [http://www.nongnu.org/aramorph/ AraMorph - Java] - An Arabic morphological analyzer and part-of-speech tagger rewritten in Java for [http://lucene.apache.org/ Lucene] |
Revision as of 20:03, 26 January 2010
Resources
- Sarf - Arabic Morphology System (all in Java...)
- AraMorph - Perl - An Arabic morphological analyzer and part-of-speech tagger written in Perl (originally by Tim Buckwalter, see http://www.qamus.org/morphology.htm)
- AraMorph - Java - An Arabic morphological analyzer and part-of-speech tagger rewritten in Java for Lucene
- Arabic dictionaries, by Jon Dehdari, for the Link-Grammar parser. These require the Aramorph stemming package, above.
Corpora
- Meedan-Memory, Arabic-English TMX (sentence-aligned), ~467,000 words on the English side, Open Database Licence
- Quranic Arabic Corpus, 77,430 words of Quranic Arabic, with manually verified contextual POS, inflection, derivation; dependency grammar annotation is planned.