Difference between revisions of "Chinese and Spanish"
		
		
		
		
		
		
		Jump to navigation
		Jump to search
		
				
		
		
		
		
		
		
		
	
| Line 21: | Line 21: | ||
{|class=wikitable  | 
  {|class=wikitable  | 
||
|-  | 
  |-  | 
||
! Week    !! Dates              !! Trimmed coverage !! Testvoc      !! Evaluation !! Notes !! Achieved   | 
  ! Week    !! Dates              !! Trimmed coverage !! Achieved !!  Testvoc      !! Evaluation !! Notes !! Achieved   | 
||
|-  | 
  |-  | 
||
| 0       || 21/05—16/06  ||   45%            ||    ||   500 words || '''Preliminary evaluation'''. Translate the story total coverage and without diagnostics. Get a baseline WER. Create <code>zho.dix</code> by: (a) extracting word + POS from Wiktionary. Test and evaluate segmentation strategies and produce report. || WER: 85.55%,<br/>BLEU: 0.1184,<br/>Cov: ?   | 
  | 0       || 21/05—16/06  ||   45%            || ?||    ||   500 words || '''Preliminary evaluation'''. Translate the story total coverage and without diagnostics. Get a baseline WER. Create <code>zho.dix</code> by: (a) extracting word + POS from Wiktionary. Test and evaluate segmentation strategies and produce report. || WER: 85.55%,<br/>BLEU: 0.1184,<br/>Cov: ?   | 
||
|-  | 
  |-  | 
||
| 1       || 17/06—23/06  ||   50%            ||  {{tag|num}}  || -     || Numerals should be added and testvoc clean. ||   | 
  | 1       || 17/06—23/06  ||   50%            || ?||   {{tag|num}}  || -     || Numerals should be added and testvoc clean. ||   | 
||
|-  | 
  |-  | 
||
| 2       || 24/06—30/06  ||   53%            ||  {{tag|cnjcoo}} {{tag|cnjadv}} {{tag|cnjsub}} || -     || ||   | 
  | 2       || 24/06—30/06  ||   53%            || ?||   {{tag|cnjcoo}} {{tag|cnjadv}} {{tag|cnjsub}} || -     || ||   | 
||
|-  | 
  |-  | 
||
| 3       || 01/07—07/07  ||   59%            ||  {{tag|adv}} || 200 words    || ||   | 
  | 3       || 01/07—07/07  ||   59%            ||||   {{tag|adv}} || 200 words    || ||   | 
||
|-  | 
  |-  | 
||
| 4       || 08/07—14/07  ||   63%            ||  {{tag|prn}} {{tag|det}} || -   || ||  | 
  | 4       || 08/07—14/07  ||   63%            || ||  {{tag|prn}} {{tag|det}} || -   || ||  | 
||
|-  | 
  |-  | 
||
| 5       || 15/07—21/07  ||   68%            ||  {{tag|adj}}  || -   || ||  | 
  | 5       || 15/07—21/07  ||   68%            ||||   {{tag|adj}}  || -   || ||  | 
||
|-  | 
  |-  | 
||
| 6       || 22/07—28/07  ||   70%            ||  {{tag|n}} || 500 words   || '''Midterm evaluation'''.||  | 
  | 6       || 22/07—28/07  ||   70%            ||||   {{tag|n}} || 500 words   || '''Midterm evaluation'''.||  | 
||
|-  | 
  |-  | 
||
| 7       || 29/07—04/08  ||   73%            ||  - || -   || ||  | 
  | 7       || 29/07—04/08  ||   73%            ||||   - || -   || ||  | 
||
|-  | 
  |-  | 
||
| 8       || 05/08—11/08  ||   75%            ||  - || -   || ||  | 
  | 8       || 05/08—11/08  ||   75%            ||||   - || -   || ||  | 
||
|-  | 
  |-  | 
||
| 9       || 12/08—18/08  ||   77%            ||  - || 200 words   ||  ||  | 
  | 9       || 12/08—18/08  ||   77%            || ||  - || 200 words   ||  ||  | 
||
|-  | 
  |-  | 
||
| 10      || 19/08—25/08  ||   80%            ||  {{tag|vblex}} || -  || ||  | 
  | 10      || 19/08—25/08  ||   80%            ||||   {{tag|vblex}} || -  || ||  | 
||
|-  | 
  |-  | 
||
| 11      || 26/08—01/09  ||   82%            || - || -  ||  ||  | 
  | 11      || 26/08—01/09  ||   82%            ||||  - || -  ||  ||  | 
||
|-  | 
  |-  | 
||
| 12      || 02/09—08/09  ||   83%            || - || -  ||  ||  | 
  | 12      || 02/09—08/09  ||   83%            ||||  - || -  ||  ||  | 
||
|-  | 
  |-  | 
||
| 13      || 09/09—15/09  ||   85%            || ''all categories clean'' || 500 words  || '''Final evaluation'''. Tidying up, releasing ||  | 
  | 13      || 09/09—15/09  ||   85%            ||||  ''all categories clean'' || 500 words  || '''Final evaluation'''. Tidying up, releasing ||  | 
||
|-  | 
  |-  | 
||
|}  | 
  |}  | 
||
Revision as of 13:01, 2 July 2013
Contents | 
Segmentadors
| Nom | Rendiment | 
|---|---|
| LRLM | |
| Cobertura òptima | |
| zhseg | |
| Stanford | 
Pla de treball
| Week | Dates | Trimmed coverage | Achieved | Testvoc | Evaluation | Notes | Achieved | 
|---|---|---|---|---|---|---|---|
| 0 | 21/05—16/06 | 45% | ? | 500 words | Preliminary evaluation. Translate the story total coverage and without diagnostics. Get a baseline WER. Create zho.dix by: (a) extracting word + POS from Wiktionary. Test and evaluate segmentation strategies and produce report. | 
WER: 85.55%, BLEU: 0.1184, Cov: ?  | |
| 1 | 17/06—23/06 | 50% | ? | <num> | 
- | Numerals should be added and testvoc clean. | |
| 2 | 24/06—30/06 | 53% | ? | <cnjcoo> <cnjadv> <cnjsub> | 
- | ||
| 3 | 01/07—07/07 | 59% | <adv> | 
200 words | |||
| 4 | 08/07—14/07 | 63% | <prn> <det> | 
- | |||
| 5 | 15/07—21/07 | 68% | <adj> | 
- | |||
| 6 | 22/07—28/07 | 70% | <n> | 
500 words | Midterm evaluation. | ||
| 7 | 29/07—04/08 | 73% | - | - | |||
| 8 | 05/08—11/08 | 75% | - | - | |||
| 9 | 12/08—18/08 | 77% | - | 200 words | |||
| 10 | 19/08—25/08 | 80% | <vblex> | 
- | |||
| 11 | 26/08—01/09 | 82% | - | - | |||
| 12 | 02/09—08/09 | 83% | - | - | |||
| 13 | 09/09—15/09 | 85% | all categories clean | 500 words | Final evaluation. Tidying up, releasing |