Difference between revisions of "Apertium-ara-heb"

From Apertium
Jump to navigation Jump to search
Line 8: Line 8:
 
* http://starling.rinet.ru/cgi-bin/response.cgi?root=config&morpho=0&basename=\data\semham\semet&first=1 big and scary database of semitic etymology
 
* http://starling.rinet.ru/cgi-bin/response.cgi?root=config&morpho=0&basename=\data\semham\semet&first=1 big and scary database of semitic etymology
 
** http://www.semiticroots.net/index.php?r=root/view&id=1 same thing maybe?
 
** http://www.semiticroots.net/index.php?r=root/view&id=1 same thing maybe?
* http://babelnet.org/search?word=قال&lang=AR&langTrans=HE is NC, but should be possible to get entries the same way from http://compling.hss.ntu.edu.sg/omw/ – the Hebrew and Arabic wordnets are GPL and CC-BY-SA, resp.
+
* http://babelnet.org/search?word=قال&lang=AR&langTrans=HE is NC
  +
* http://compling.hss.ntu.edu.sg/omw/ – the Hebrew and Arabic wordnets are GPL and CC-BY-SA, resp.
  +
** <pre>$ get () { awk -vpos=$1 -F'\t' '$1~"-"pos"$" && $2=="lemma"' | sort; }
  +
for pos in n v a r; do echo -n "$pos "; join -j1 -t$'\t' <(get $pos < arb/wn-data-arb.tab) <(get $pos <heb/wn-data-heb.tab) |wc -l;done
  +
n 4687
  +
v 1316
  +
a 115
  +
r 82
  +
</pre>

Revision as of 09:21, 20 April 2015

Resources

for pos in n v a r; do echo -n "$pos "; join -j1 -t$'\t' <(get $pos < arb/wn-data-arb.tab) <(get $pos <heb/wn-data-heb.tab) |wc -l;done n 4687 v 1316 a 115 r 82