Difference between revisions of "Chinese and Spanish"
		
		
		
		
		
		
		Jump to navigation
		Jump to search
		
				
		
		
		
		
		
		
		
	
| Line 21: | Line 21: | ||
{|class=wikitable  | 
  {|class=wikitable  | 
||
|-  | 
  |-  | 
||
! Week    !! Dates              !! Trimmed coverage !! Testvoc      !! Evaluation !! Notes !!   | 
  ! Week    !! Dates              !! Trimmed coverage !! Testvoc      !! Evaluation !! Notes !! Achieved   | 
||
|-  | 
  |-  | 
||
| 0       || 21/05—16/06  ||   45%            ||    ||   500 words || '''Preliminary evaluation'''. Translate the story total coverage and without diagnostics. Get a baseline WER. Create <code>zho.dix</code> by: (a) extracting word + POS from Wiktionary. Test and evaluate segmentation strategies and produce report. || 85.55%,  | 
  | 0       || 21/05—16/06  ||   45%            ||    ||   500 words || '''Preliminary evaluation'''. Translate the story total coverage and without diagnostics. Get a baseline WER. Create <code>zho.dix</code> by: (a) extracting word + POS from Wiktionary. Test and evaluate segmentation strategies and produce report. || WER: 85.55%,<br/>BLEU: 0.1184,<br/>Cov: ?   | 
||
|-  | 
  |-  | 
||
| 1       || 17/06—23/06  ||   50%            ||  {{tag|num}}  || -     || Numerals should be added and testvoc clean. ||   | 
  | 1       || 17/06—23/06  ||   50%            ||  {{tag|num}}  || -     || Numerals should be added and testvoc clean. ||   | 
||
Revision as of 08:28, 10 June 2013
Contents | 
Segmentadors
| Nom | Rendiment | 
|---|---|
| LRLM | |
| Cobertura òptima | |
| zhseg | |
| Stanford | 
Pla de treball
| Week | Dates | Trimmed coverage | Testvoc | Evaluation | Notes | Achieved | 
|---|---|---|---|---|---|---|
| 0 | 21/05—16/06 | 45% | 500 words | Preliminary evaluation. Translate the story total coverage and without diagnostics. Get a baseline WER. Create zho.dix by: (a) extracting word + POS from Wiktionary. Test and evaluate segmentation strategies and produce report. | 
WER: 85.55%, BLEU: 0.1184, Cov: ?  | |
| 1 | 17/06—23/06 | 50% | <num> | 
- | Numerals should be added and testvoc clean. | |
| 2 | 24/06—30/06 | 53% | <cnjcoo> <cnjadv> <cnjsub> | 
- | ||
| 3 | 01/07—07/07 | 59% | <adv> | 
200 words | ||
| 4 | 08/07—14/07 | 63% | <prn> <det> | 
- | ||
| 5 | 15/07—21/07 | 68% | <adj> | 
- | ||
| 6 | 22/07—28/07 | 70% | <n> | 
500 words | Midterm evaluation. | |
| 7 | 29/07—04/08 | 73% | - | - | ||
| 8 | 05/08—11/08 | 75% | - | - | ||
| 9 | 12/08—18/08 | 77% | - | 200 words | ||
| 10 | 19/08—25/08 | 80% | <vblex> | 
- | ||
| 11 | 26/08—01/09 | 82% | - | - | ||
| 12 | 02/09—08/09 | 83% | - | - | ||
| 13 | 09/09—15/09 | 85% | all categories clean | 500 words | Final evaluation. Tidying up, releasing |