Difference between revisions of "User:Eden/GSoC progress"
		
		
		
		
		
		
		Jump to navigation
		Jump to search
		
				
		
		
		
		
		
		
		
	
Firespeaker (talk | contribs)  | 
				Firespeaker (talk | contribs)   (→Notes)  | 
				||
| Line 39: | Line 39: | ||
* To count stems in <code>lexc</code>,  try:  | 
  * To count stems in <code>lexc</code>,  try:  | 
||
  grep -E ":\w+.*;" apertium-lin.lin.lexc | grep -v "[<>]" | wc -l  | 
    grep -E ":\w+.*;" apertium-lin.lin.lexc | grep -v "[<>]" | wc -l  | 
||
* To count stems in the bidix, try this:  | 
|||
  grep "<p" apertium-eng-lin.eng-lin.dix  | wc -l  | 
|||
* To get WER and PER use <code>apertium-eval-translator-line</code>  | 
|||
* Coverage above is on [https://dumps.wikimedia.org/lnwiki/20190520/ 2019-05-20 Wikipedia dump].  | 
|||
Revision as of 19:51, 30 May 2019
Status table
| Week | Stems | naïve coverage | WER,PER | Progress | |||||
|---|---|---|---|---|---|---|---|---|---|
| № | dates | lin | lin-eng | lin | lin-eng | lin→eng | eng→lin | Evaluation | Notes | 
| 0 | May 20 - May 26 | 727 | 139 | 61.95% | 40.86% | 86.79%,80.87% | 75.27%,63.98% | ||
| 1 | May 27 - June 02 | ||||||||
Notes
- To count stems in 
lexc, try: 
grep -E ":\w+.*;" apertium-lin.lin.lexc | grep -v "[<>]" | wc -l
- To count stems in the bidix, try this:
 
grep "<p" apertium-eng-lin.eng-lin.dix | wc -l
- To get WER and PER use 
apertium-eval-translator-line 
- Coverage above is on 2019-05-20 Wikipedia dump.