Difference between revisions of "User:Eden/GSoC progress"

From Apertium
Jump to navigation Jump to search
Line 39: Line 39:
 
* To count stems in <code>lexc</code>, try:
 
* To count stems in <code>lexc</code>, try:
 
grep -E ":\w+.*;" apertium-lin.lin.lexc | grep -v "[<>]" | wc -l
 
grep -E ":\w+.*;" apertium-lin.lin.lexc | grep -v "[<>]" | wc -l
  +
  +
* To count stems in the bidix, try this:
  +
grep "<p" apertium-eng-lin.eng-lin.dix | wc -l
  +
  +
* To get WER and PER use <code>apertium-eval-translator-line</code>
  +
  +
* Coverage above is on [https://dumps.wikimedia.org/lnwiki/20190520/ 2019-05-20 Wikipedia dump].

Revision as of 19:51, 30 May 2019

Status table

Week Stems naïve coverage WER,PER Progress
dates lin lin-eng lin lin-eng lin→eng eng→lin Evaluation Notes
0 May 20 - May 26 727 139 61.95% 40.86% 86.79%,80.87% 75.27%,63.98%
1 May 27 - June 02

Notes

  • To count stems in lexc, try:
 grep -E ":\w+.*;" apertium-lin.lin.lexc | grep -v "[<>]" | wc -l
  • To count stems in the bidix, try this:
 grep "<p" apertium-eng-lin.eng-lin.dix  | wc -l
  • To get WER and PER use apertium-eval-translator-line