Difference between revisions of "User:Aidana/Proposal/Working plan"

From Apertium
Jump to navigation Jump to search
(Created page with "==Corpora== ===Downloads=== * Bitextor: https://www.dropbox.com/s/ajy53y55toh0n40/corpus%20Lab%20IIS%20%285925%29.kz?dl=0 * GCI kaz corpus: https://www.dropbox.com/s/8krkzjs...")
 
Line 4: Line 4:


* Bitextor: https://www.dropbox.com/s/ajy53y55toh0n40/corpus%20Lab%20IIS%20%285925%29.kz?dl=0
* Bitextor: https://www.dropbox.com/s/ajy53y55toh0n40/corpus%20Lab%20IIS%20%285925%29.kz?dl=0
* Akorda corpus: https://www.dropbox.com/home/apertium/lexical%20selection/Akorda?preview=akorda-kaz-8651.txt
* GCI kaz corpus: https://www.dropbox.com/s/8krkzjs1ykxwdhk/1452262000_kaz.darkgaia.txt?dl=0
* GCI kaz corpus: https://www.dropbox.com/s/8krkzjs1ykxwdhk/1452262000_kaz.darkgaia.txt?dl=0
* 12500 words from wikipedia: https://www.dropbox.com/s/lrj7i639d3g7dhn/12500words.txt?dl=0

===Expanding vocabulary===

==Coverage targets==

{|class=wikitable
!rowspan=2| Date ||colspan=4| Target || ||colspan=4| Achieved ||rowspan=2| Target<br/>achieved||rowspan=2|Stems||rowspan=2| Notes
|-
! 5925 corpus!! Akorda!! GCI corpus!! Wiki 12500 words!! !! 5925 corpus!! Akorda!! GCI corpus!! Wiki 12500 words
|-
| 23-04-2014 || || 85.70% || 97.32% ||86.83% ||83.21% || ? || || || ||align=center| || || Initial value
|-
| 30-04-2014 || 86.00% || 97.50% ||87.00% || 83.50% || || || || || ||align=center| || ||
|-
| 07-05-2014 || 86.50% || 97.70% || 87.50% || 84.00% || || || || || ||align=center| || ||
|-
| 14-05-2014 ||87.00% || 98.00% || 87.70% || 84.50% || || || || || ||align=center| || ||
|-
| 21-05-2014 || 87.50% || 98.20% || 88.00% ||85.00% || || || || || ||align=center| || ||Official GSOC start date
|-
| 28-05-2014 || 79.00% || 80.00% || 78.00% ||79.00% || || 85.67% || 87.57% || ? || 83.83% ||align=center| √ || ||
|-
| 04-06-2014 || 80.00% || 81.00% || 79.00% ||80.00% || || 88.57% || 91.51% || 84.67% || 88.22% ||align=center| √ || 5,840 ||
|-
| 11-06-2014 || 82.00% || 82.00% || 81.00% ||82.00% || || 88.92% ||91.98% || || 88.59% || || ||
|-
| 18-06-2014 || 83.00% || 83.00% || 83.00% || 83.00%|| || || || || || || ||
|-
| 25-06-2014 || 85.00% || 85.00% || 85.00% || 85.00% || || || || || || || || Midterm evaluation
|-
| 02-07-2014 || 86.00% || 86.00% || 86.00% || 86.00% || || || || || || || ||
|-
| 09-07-2014 || 87.00% || 87.00% || 87.00% ||87.00% || || || || || || || ||
|-
| 16-07-2014 || 88.00% || 88.00% || 88.00% || 88.00% || || || || || || || ||
|-
| 23-07-2014 || 89.00% || 89.00% || 89.00% || 89.00% || || || || || || || ||
|-
| 30-07-2014 || 81.50% || 91.50% || 89.50% || 89.50% || || || || || || || ||
|-
| 06-08-2014 || 93.00% || 93.00% || 90.00% || 90.00% || || || || || || || ||
|-
| 13-08-2014 || 94.00% || 94.00% || 90.00% || 90.00% || || || || || || || ||
|-
| 22-08-2014 || 95.00% ||95.00% || 90.00% ||90.00% || || || || || || || || Final target
|-
|}

Revision as of 22:01, 23 April 2016

Corpora

Downloads

Expanding vocabulary

Coverage targets

Date Target Achieved Target
achieved
Stems Notes
5925 corpus Akorda GCI corpus Wiki 12500 words 5925 corpus Akorda GCI corpus Wiki 12500 words
23-04-2014 85.70% 97.32% 86.83% 83.21% ? Initial value
30-04-2014 86.00% 97.50% 87.00% 83.50%
07-05-2014 86.50% 97.70% 87.50% 84.00%
14-05-2014 87.00% 98.00% 87.70% 84.50%
21-05-2014 87.50% 98.20% 88.00% 85.00% Official GSOC start date
28-05-2014 79.00% 80.00% 78.00% 79.00% 85.67% 87.57% ? 83.83%
04-06-2014 80.00% 81.00% 79.00% 80.00% 88.57% 91.51% 84.67% 88.22% 5,840
11-06-2014 82.00% 82.00% 81.00% 82.00% 88.92% 91.98% 88.59%
18-06-2014 83.00% 83.00% 83.00% 83.00%
25-06-2014 85.00% 85.00% 85.00% 85.00% Midterm evaluation
02-07-2014 86.00% 86.00% 86.00% 86.00%
09-07-2014 87.00% 87.00% 87.00% 87.00%
16-07-2014 88.00% 88.00% 88.00% 88.00%
23-07-2014 89.00% 89.00% 89.00% 89.00%
30-07-2014 81.50% 91.50% 89.50% 89.50%
06-08-2014 93.00% 93.00% 90.00% 90.00%
13-08-2014 94.00% 94.00% 90.00% 90.00%
22-08-2014 95.00% 95.00% 90.00% 90.00% Final target