Difference between revisions of "User:Eden/GSoC progress"

From Apertium
Jump to navigation Jump to search
Line 69: Line 69:
| 61.71%
| 61.71%
| 61.52%
| 61.52%
| 85.65%,71.98%
| 78.82%,63.55%
| 81.72%,68.82%
| 81.72%,68.82%
|
|

Revision as of 05:54, 21 June 2019

Status table

Week Stems naïve coverage WER,PER Progress
dates lin lin-eng lin lin-eng lin→eng eng→lin Evaluation Notes
0 May 20 - May 26 727 139 61.95% 40.86% 86.79%,80.87% 75.27%,63.98%
1 May 27 - June 02 904 139 62.57% 40.86% 86.79%,80.87% 75.27%,63.98%
2 May 03 - June 09 1,154 1,416 63.17% 53.03% 87.02%,79.95% 74.46%,60.22%
3 June 10 - June 16 1,172 1,501 61.60% 91.57%,79.04% 75.85%,62.90% WER for 'lin-eng' went up because of an incomplete rule for verbs that creates unnecessary pronouns. Main work next week will be on rules to dramatically improve WER and PER.
4 June 17 - June 23 1,178 1,510 61.71% 61.52% 78.82%,63.55% 81.72%,68.82%

Notes

  • To count stems in lexc, try:
 grep -E ":\w+.*;" apertium-lin.lin.lexc | grep -v "[<>]" | wc -l
  • To count stems in the bidix, try this:
 grep "<p" apertium-eng-lin.eng-lin.dix  | wc -l
  • To get WER and PER use apertium-eval-translator-line