Difference between revisions of "User:Eden/GSoC progress"
Jump to navigation
Jump to search
Firespeaker (talk | contribs) |
Firespeaker (talk | contribs) (→Notes) |
||
Line 39: | Line 39: | ||
* To count stems in <code>lexc</code>, try: |
* To count stems in <code>lexc</code>, try: |
||
grep -E ":\w+.*;" apertium-lin.lin.lexc | grep -v "[<>]" | wc -l |
grep -E ":\w+.*;" apertium-lin.lin.lexc | grep -v "[<>]" | wc -l |
||
* To count stems in the bidix, try this: |
|||
grep "<p" apertium-eng-lin.eng-lin.dix | wc -l |
|||
* To get WER and PER use <code>apertium-eval-translator-line</code> |
|||
* Coverage above is on [https://dumps.wikimedia.org/lnwiki/20190520/ 2019-05-20 Wikipedia dump]. |
Revision as of 19:51, 30 May 2019
Status table
Week | Stems | naïve coverage | WER,PER | Progress | |||||
---|---|---|---|---|---|---|---|---|---|
№ | dates | lin | lin-eng | lin | lin-eng | lin→eng | eng→lin | Evaluation | Notes |
0 | May 20 - May 26 | 727 | 139 | 61.95% | 40.86% | 86.79%,80.87% | 75.27%,63.98% | ||
1 | May 27 - June 02 |
Notes
- To count stems in
lexc
, try:
grep -E ":\w+.*;" apertium-lin.lin.lexc | grep -v "[<>]" | wc -l
- To count stems in the bidix, try this:
grep "<p" apertium-eng-lin.eng-lin.dix | wc -l
- To get WER and PER use
apertium-eval-translator-line
- Coverage above is on 2019-05-20 Wikipedia dump.