Difference between revisions of "User:Eden/GSoC progress"
Jump to navigation
Jump to search
Line 62: | Line 62: | ||
| |
| |
||
| WER for 'lin-eng' went up because of an incomplete rule for verbs that creates unnecessary pronouns. Main work next week will be on rules to dramatically improve WER and PER. |
| WER for 'lin-eng' went up because of an incomplete rule for verbs that creates unnecessary pronouns. Main work next week will be on rules to dramatically improve WER and PER. |
||
|- |
|||
| 4 |
|||
| June 17 - June 23 |
|||
| 1,178 |
|||
| 1,510 |
|||
| 61.71% |
|||
| 61.52% |
|||
| 85.65%,71.98% |
|||
| 81.72%,68.82% |
|||
| |
|||
| |
|||
|- |
|- |
||
|} |
|} |
Revision as of 02:28, 20 June 2019
Status table
Week | Stems | naïve coverage | WER,PER | Progress | |||||
---|---|---|---|---|---|---|---|---|---|
№ | dates | lin | lin-eng | lin | lin-eng | lin→eng | eng→lin | Evaluation | Notes |
0 | May 20 - May 26 | 727 | 139 | 61.95% | 40.86% | 86.79%,80.87% | 75.27%,63.98% | ||
1 | May 27 - June 02 | 904 | 139 | 62.57% | 40.86% | 86.79%,80.87% | 75.27%,63.98% | ||
2 | May 03 - June 09 | 1,154 | 1,416 | 63.17% | 53.03% | 87.02%,79.95% | 74.46%,60.22% | ||
3 | June 10 - June 16 | 1,172 | 1,501 | 61.60% | 91.57%,79.04% | 75.85%,62.90% | WER for 'lin-eng' went up because of an incomplete rule for verbs that creates unnecessary pronouns. Main work next week will be on rules to dramatically improve WER and PER. | ||
4 | June 17 - June 23 | 1,178 | 1,510 | 61.71% | 61.52% | 85.65%,71.98% | 81.72%,68.82% |
Notes
- To count stems in
lexc
, try:
grep -E ":\w+.*;" apertium-lin.lin.lexc | grep -v "[<>]" | wc -l
- To count stems in the bidix, try this:
grep "<p" apertium-eng-lin.eng-lin.dix | wc -l
- To get WER and PER use
apertium-eval-translator-line
- Coverage above is on 2019-05-20 Wikipedia dump.