User:Aidana/Proposal/Working plan
Jump to navigation
Jump to search
Corpora
Downloads
- Bitextor: https://www.dropbox.com/s/ajy53y55toh0n40/corpus%20Lab%20IIS%20%285925%29.kz?dl=0
- Akorda corpus: https://www.dropbox.com/home/apertium/lexical%20selection/Akorda?preview=akorda-kaz-8651.txt
- GCI kaz corpus: https://www.dropbox.com/s/8krkzjs1ykxwdhk/1452262000_kaz.darkgaia.txt?dl=0
- 12500 words from wikipedia: https://www.dropbox.com/s/lrj7i639d3g7dhn/12500words.txt?dl=0
Expanding vocabulary
Coverage targets
Date | Target | Achieved | Target achieved |
Stems | Notes | |||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
5925 corpus | GCI corpus | Wiki 12500 words | Akorda | 5925 corpus | GCI corpus | Wiki 12500 words | Akorda | |||||
23-04-2016 | 85.70% | 97.32% | 85.83% | 83.21% | 86.32% | 97.49% | 87.01% | 84.88% | Initial value | |||
30-04-2016 | 86.00% | 97.50% | 86.80% | 83.50% | 88.22% | 97.69% | 88.14% | 86.48% | ||||
07-05-2016 | 86.50% | 97.70% | 87.50% | 84.00% | ||||||||
14-05-2016 | 87.00% | 98.00% | 87.70% | 84.50% | ||||||||
21-05-2016 | 87.50% | 98.20% | 88.00% | 85.00% | Official GSOC start date | |||||||
23-05-2016 | 87.70% | 98.20% | 88.00% | 85.00% | ||||||||
1-06-2016 | 88.00% | 98.50% | 88.30% | 85.50% | ||||||||
8-06-2016 | 88.30% | 98.70% | 88.50% | 85.70% | ||||||||
16-06-2016 | 88.50% | 99.00% | 88.70% | 86.00% | ||||||||
27-06-2016 | 89.00% | 99.20% | 89.00% | 86.50% | Midterm evaluation | |||||||
02-07-2016 | 89.30% | 99.30% | 89.30% | 86.80% | ||||||||
09-07-2016 | 89.70% | 99.40% | 89.70% | 87.00% | ||||||||
16-07-2016 | 90.00% | 99.40% | 90.00% | 87.30% | ||||||||
23-07-2016 | 90.50% | 99.50% | 90.30% | 87.70% | ||||||||
30-07-2016 | 90.70% | 99.60% | 90.70% | 88.00% | ||||||||
06-08-2016 | 91.00% | 99.70% | 91.00% | 88.50% | ||||||||
13-08-2016 | 91.50% | 99.80% | 91.50% | 89.00% | ||||||||
23-08-2016 | 92.00% | 99.90% | 92.00% | 90.00% | Final target |