Difference between revisions of "Occitan and French/Work plan"

From Apertium
Jump to navigation Jump to search
Line 2: Line 2:
   
 
{|class=wikitable
 
{|class=wikitable
! Setmana !! Dates !! Avaluació !! Bidix<br/>previst !!(%) Cobertura<br/>prevista !! (%) WER<br/>previst !! Testvoc !! Avaluació !! Bidix<br/>real !! (%) Cobertura<br/>real !! (%) WER !! Err. !! Complert ?
+
! Setmana !! Dates !! Descripció !! Bidix<br/>(sense np)<br/>previst !!(%) Cobertura<br/>prevista !! (%) WER<br/>previst !! Testvoc !! Avaluació !! Bidix<br/>real !! (%) Cobertura<br/>real !! (%) WER !! Err. !! Complert ?
 
|-
 
|-
 
| 0 || <b>français > occitan</b> || || || || || || || || || || ||
 
| 0 || <b>français > occitan</b> || || || || || || || || || || ||
Line 50: Line 50:
   
 
<pre>
 
<pre>
$ cat apertium-oci-fra.oci-fra.dix | grep '<l' | wc -l
+
$ cat apertium-oci-fra.oci-fra.dix | grep '<l' | grep -v '¨np"' | wc -l
 
</pre>
 
</pre>
   

Revision as of 06:30, 16 March 2018

Note: The French → Occitan part of the project is the main direction.

Setmana Dates Descripció Bidix
(sense np)
previst
(%) Cobertura
prevista
(%) WER
previst
Testvoc Avaluació Bidix
real
(%) Cobertura
real
(%) WER Err. Complert ?
0 français > occitan
1 14 mai—20 mai ~4000 ~79,0%
2 21 mai—11 mai ~6000 ~82,0%
3 28 mai—3 junh ~8000 ~83.5%
4 4 junh—10 junh ~10.000 ~84.5%
5 11 junh—17 junh 26 juny—29 juny ~12.000 ~85.5% ~25% pr, cnj*, adv
6 18 junh—24 junh ~14.000 ~86.5% prn, det
7 25 junh—1 julhet ~16.000 ~87.5% vblex
8 2 julhet—8 julhet ~18.000 ~88.5% adj
9 9 julhet—15 julhet 24 juliol—28 juliol ~18.000 ~89.5% ~15% n
10 16 julhet—22 julhet ~20.000 ~90.0%
13 23 julhet—29 julhet 21 agost—29 agost ~20.000 ~90.0%
0 occitan > français
11 30 julhet—5 agost ~25.500 pr, cnj*, adv
12 6 agost—14 agost avaluacion finala ~25.500 ~89,1% ~20% prn, det, n, adj
13 21 agost—27 agost 21 agost—29 agost vblex

Per a calcular els nombres

Errors (calcular en apertium-fra-oci/dev)
$ bash dev/testvoc/generation.sh fra-oci | wc -l  # en apertium-oci-fra
$ bash dev/testvoc/generation.sh oci-fra | wc -l  # en apertium-oci-fra
Bidix (calcular en apertium-oci-fra)
$ cat apertium-oci-fra.oci-fra.dix | grep '<l'  | grep -v '¨np"' | wc -l
Cobertura (calcular en apertium-oci-fra)
$ cat ../apertium-fra/corpus/corpus_fra_wp100000.txt | apertium -d . fra-oci-morph | sed 's/\$\W*\^/$\n^/g' > /tmp/fra-oci.coverage.txt
$ calc `cat /tmp/fra-oci.coverage.txt | grep -v '\*' | wc -l `/`cat /tmp/fra-oci.coverage.txt | wc -l`

$ cat ../apertium-cat/corpus/corpus_oci_wp100000.txt | apertium -d . oci-fra-morph | sed 's/\$\W*\^/$\n^/g' > /tmp/oci-fra.coverage.txt
$ calc `cat /tmp/oci-fra.coverage.txt | grep -v '\*' | wc -l `/`cat /tmp/oci-fra.coverage.txt | wc -l`

Veire tanben