Apertium has moved from SourceForge to GitHub.
If you have any questions, please come and talk to us on #apertium on irc.freenode.net or contact the GitHub migration team.

User:Eden/GSoC progress

From Apertium
< User:Eden(Difference between revisions)
Jump to: navigation, search
(Status table)
(Notes)
Line 39: Line 39:
 
* To count stems in <code>lexc</code>, try:
 
* To count stems in <code>lexc</code>, try:
 
grep -E ":\w+.*;" apertium-lin.lin.lexc | grep -v "[<>]" | wc -l
 
grep -E ":\w+.*;" apertium-lin.lin.lexc | grep -v "[<>]" | wc -l
  +
  +
* To count stems in the bidix, try this:
  +
grep "<p" apertium-eng-lin.eng-lin.dix | wc -l
  +
  +
* To get WER and PER use <code>apertium-eval-translator-line</code>
  +
  +
* Coverage above is on [https://dumps.wikimedia.org/lnwiki/20190520/ 2019-05-20 Wikipedia dump].

Revision as of 21:51, 30 May 2019

Status table

Week Stems naïve coverage WER,PER Progress
dates lin lin-eng lin lin-eng lin→eng eng→lin Evaluation Notes
0 May 20 - May 26 727 139 61.95% 40.86% 86.79%,80.87% 75.27%,63.98%
1 May 27 - June 02

Notes

  • To count stems in lexc, try:
 grep -E ":\w+.*;" apertium-lin.lin.lexc | grep -v "[<>]" | wc -l
  • To count stems in the bidix, try this:
 grep "<p" apertium-eng-lin.eng-lin.dix  | wc -l
  • To get WER and PER use apertium-eval-translator-line
Personal tools