Apertium has moved from SourceForge to GitHub.
If you have any questions, please come and talk to us on #apertium on irc.freenode.net or contact the GitHub migration team.

Arabic

From Apertium
(Difference between revisions)
Jump to: navigation, search
(Resources)
 
Line 19: Line 19:
 
* There is a good documentation of how to make a morphological analyser for Arabic (and Semitic languages in general) in the Beesley/Karttunen [http://fsmbook.com finite state transducer book], documenting the Xerox compiler (Ken Beesley also made an Arabic fst). Also, there now is an open source compiler reading the Xerox format, the [[HFST]] compiler.
 
* There is a good documentation of how to make a morphological analyser for Arabic (and Semitic languages in general) in the Beesley/Karttunen [http://fsmbook.com finite state transducer book], documenting the Xerox compiler (Ken Beesley also made an Arabic fst). Also, there now is an open source compiler reading the Xerox format, the [[HFST]] compiler.
 
* And there is also an open source finite state morphological analyser for Arabic, [http://sourceforge.net/projects/aracomlex/ AraComLex] ([http://www.cngl.ie/aracomlex/morph.php online interface here]). Among other resources related to AraComLex there is [http://sourceforge.net/projects/arabicpatterns/ a list of Arabic morphological patterns] and [http://sourceforge.net/projects/arabicwordcount/ a frequency word list] from a 1 billion word corpus.
 
* And there is also an open source finite state morphological analyser for Arabic, [http://sourceforge.net/projects/aracomlex/ AraComLex] ([http://www.cngl.ie/aracomlex/morph.php online interface here]). Among other resources related to AraComLex there is [http://sourceforge.net/projects/arabicpatterns/ a list of Arabic morphological patterns] and [http://sourceforge.net/projects/arabicwordcount/ a frequency word list] from a 1 billion word corpus.
  +
  +
* [http://arabicreference.com/ Arabic Reference] by Hans Wehr with form I vowelling, masadir (infinitives), broken plurals
   
 
===Wordnet and dbpedia===
 
===Wordnet and dbpedia===

Latest revision as of 10:14, 27 June 2019

Arabic is a semitic language (http://en.wikipedia.org/wiki/Hamito-Semitic).

Language pairs:

Developing other semitic language pairs with Arabic would be a good idea (e.g. Tamazight).

[edit] Resources

  • Sarf - Arabic Morphology System (all in Java...)
  • ElixirFM (online interface here) is a Functional Arabic Morphology written in Haskell and Perl; the lexicon is a "re-processed" version of the Buckwalter analyser.
  • There is a good documentation of how to make a morphological analyser for Arabic (and Semitic languages in general) in the Beesley/Karttunen finite state transducer book, documenting the Xerox compiler (Ken Beesley also made an Arabic fst). Also, there now is an open source compiler reading the Xerox format, the HFST compiler.
  • And there is also an open source finite state morphological analyser for Arabic, AraComLex (online interface here). Among other resources related to AraComLex there is a list of Arabic morphological patterns and a frequency word list from a 1 billion word corpus.
  • Arabic Reference by Hans Wehr with form I vowelling, masadir (infinitives), broken plurals

[edit] Wordnet and dbpedia

[edit] Corpora

Personal tools