Chinese and Spanish

From Apertium
Jump to navigation Jump to search

Segmentadors

Nom Rendiment
LRLM
Cobertura òptima
zhseg
Stanford

Pla de treball

Week Dates Trimmed coverage Testvoc Evaluation Notes Achieved
0 21/05—16/06 45% 500 words Preliminary evaluation. Translate the story total coverage and without diagnostics. Get a baseline WER. Create zho.dix by: (a) extracting word + POS from Wiktionary. Test and evaluate segmentation strategies and produce report. WER: 85.55%,
BLEU: 0.1184,
Cov: ?
1 17/06—23/06 50% <num> - Numerals should be added and testvoc clean.
2 24/06—30/06 53% <cnjcoo> <cnjadv> <cnjsub> -
3 01/07—07/07 59% <adv> 200 words
4 08/07—14/07 63% <prn> <det> -
5 15/07—21/07 68% <adj> -
6 22/07—28/07 70% <n> 500 words Midterm evaluation.
7 29/07—04/08 73% - -
8 05/08—11/08 75% - -
9 12/08—18/08 77% - 200 words
10 19/08—25/08 80% <vblex> -
11 26/08—01/09 82% - -
12 02/09—08/09 83% - -
13 09/09—15/09 85% all categories clean 500 words Final evaluation. Tidying up, releasing