Difference between revisions of "Catalan and Sardinian/Work plan"
Jump to navigation
Jump to search
Hectoralos (talk | contribs) |
Hectoralos (talk | contribs) |
||
(12 intermediate revisions by the same user not shown) | |||
Line 26: | Line 26: | ||
| 10 || 31 juliol—6 agost || || ~20.000 || ~90.0% || || || || 26387 || 93,1% || || || |
| 10 || 31 juliol—6 agost || || ~20.000 || ~90.0% || || || || 26387 || 93,1% || || || |
||
|- |
|- |
||
| |
| 13 || 21 agost—27 agost ||21 agost—29 agost|| ~20.000 || ~90.0% || || || || 29070 || 94,0% || || 0 || ✓ |
||
|- |
|||
| 0 || <b>sard > italià</b> || || || || || || || 25500 || 89,1% || 31,9% || || ✓ |
|||
|- |
|- |
||
| 11 || 7 agost—13 agost || || ~25.500 || || || pr, cnj*, adv || || |
| 11 || 7 agost—13 agost || || ~25.500 || || || pr, cnj*, adv || || 25504 || 88,7% || || 42.708 || |
||
|- |
|- |
||
| 12 || 14 agost—20 agost || || ~25.500 || ~89,1% || ~20% || prn, det, n, adj || || |
| 12 || 14 agost—20 agost || || ~25.500 || ~89,1% || ~20% || prn, det, n, adj || || 27093 || 88,5% (corpus parcialment dialectal)<br/>92,6% (corpus literari LSC)|| || 443.161 || ✕ |
||
|- |
|- |
||
| 13 || 21 agost—27 agost ||21 agost—29 agost|| || || || vblex || || || || || || |
| 13 || 21 agost—27 agost ||21 agost—29 agost|| || || || vblex || || || || || || |
||
Line 42: | Line 44: | ||
<pre> |
<pre> |
||
$ bash dev/testvoc/generation.sh cat-srd | wc -l # en apertium-cat-srd |
$ bash dev/testvoc/generation.sh cat-srd | wc -l # en apertium-cat-srd |
||
$ bash dev/testvoc/generation.sh |
$ bash dev/testvoc/generation.sh srd-ita | wc -l # en apertium-srd-ita |
||
</pre> |
</pre> |
||
Latest revision as of 23:07, 25 August 2017
Note: The Sardinian → Italian part of the project is optional and subject to all goals being achieved for the Catalan → Sardinian part.
Setmana | Dates | Avaluació | Bidix previst |
(%) Cobertura prevista |
(%) WER previst |
Testvoc | Avaluació | Bidix real |
(%) Cobertura real |
(%) WER | Err. | Complert ? |
---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | català > sard | 2877 | 77,7% | 34,8% | ✓ | |||||||
1 | 30 maig—4 juny | ~4000 | ~79,0% | 3239 | 77,4% | 488.474 | ✕ | |||||
2 | 5 juny—11 juny | ~6000 | ~82,0% | 4918 | 86,4% | 774.697 | ½ | |||||
3 | 12 juny—18 juny | ~8000 | ~83.5% | 6532 | 88,4% | 697.757 | ½ | |||||
4 | 19 juny—25 juny | ~10.000 | ~84.5% | 8694 | 90,1% | 910.172 | ✓ | |||||
5 | 26 juny—2 juliol | 26 juny—29 juny | ~12.000 | ~85.5% | ~25% | pr, cnj*, adv | 8766 | 90,1% | 7,6% | 7.491 | ||
6 | 3 juliol—9 juliol | ~14.000 | ~86.5% | prn, det | 20062 | 91,2% | 9.302 | ✓ | ||||
7 | 10 juliol—16 juliol | ~16.000 | ~87.5% | vblex | 22869 | 91,9% | 3.149 | ✓ | ||||
8 | 17 juliol—23 juliol | ~18.000 | ~88.5% | adj | 23915 | 92,3% | 0 | ✓ | ||||
9 | 24 juliol—30 juliol | 24 juliol—28 juliol | ~18.000 | ~89.5% | ~15% | n | 24528 | 92,6% | 13,9% | ✓ | ||
10 | 31 juliol—6 agost | ~20.000 | ~90.0% | 26387 | 93,1% | |||||||
13 | 21 agost—27 agost | 21 agost—29 agost | ~20.000 | ~90.0% | 29070 | 94,0% | 0 | ✓ | ||||
0 | sard > italià | 25500 | 89,1% | 31,9% | ✓ | |||||||
11 | 7 agost—13 agost | ~25.500 | pr, cnj*, adv | 25504 | 88,7% | 42.708 | ||||||
12 | 14 agost—20 agost | ~25.500 | ~89,1% | ~20% | prn, det, n, adj | 27093 | 88,5% (corpus parcialment dialectal) 92,6% (corpus literari LSC) |
443.161 | ✕ | |||
13 | 21 agost—27 agost | 21 agost—29 agost | vblex |
Per a calcular els nombres[edit]
- Errors (calcular en apertium-cat-srd/dev o apertium-srd-ita)
$ bash dev/testvoc/generation.sh cat-srd | wc -l # en apertium-cat-srd $ bash dev/testvoc/generation.sh srd-ita | wc -l # en apertium-srd-ita
- Bidix (calcular en apertium-cat-srd o apertium-srd-ita)
$ cat apertium-cat-srd.cat-srd.dix | grep '<l' | wc -l $ cat apertium-srd-ita.srd-ita.dix | grep '<l' | wc -l
- Cobertura (calcular en apertium-cat-srd o apertium-srd-ita)
$ cat ../apertium-cat/corpus/corpus_ca_wp100000.txt | apertium -d . cat-srd-morph | sed 's/\$\W*\^/$\n^/g' > /tmp/cat-srd.coverage.txt $ calc `cat /tmp/cat-srd.coverage.txt | grep -v '\*' | wc -l `/`cat /tmp/cat-srd.coverage.txt | wc -l` $ cat srd.crp.txt | apertium -d . srd-ita-morph | sed 's/\$\W*\^/$\n^/g' > /tmp/srd-ita.coverage.txt $ calc `cat /tmp/srd-ita.coverage.txt | grep -v '\*' | wc -l `/`cat /tmp/srd-ita.coverage.txt | wc -l`