Difference between revisions of "User:Elmurod1202/GSoC2020Progress"
Jump to navigation
Jump to search
Firespeaker (talk | contribs) |
Firespeaker (talk | contribs) |
||
Line 27: | Line 27: | ||
| May 4 - May 31 |
| May 4 - May 31 |
||
| 34375 |
| 34375 |
||
| |
| 4359 |
||
| 90.80 % |
| 90.80 % |
||
| 81.60 % |
| 81.60 % |
||
Line 52: | Line 52: | ||
| June 29 - July 5 |
| June 29 - July 5 |
||
| 34373 |
| 34373 |
||
| 4360 |
|||
| |
|||
| |
| |
||
| |
| |
Revision as of 18:56, 31 July 2020
The original proposal can be seen here
Contents
Status table
Week | Stems | Tur-Uzb | Uzb-Tur | Naïve Coverage | Progress | ||||||
---|---|---|---|---|---|---|---|---|---|---|---|
№ | Dates | uzb | tur-uzb | WER | PER | WER | PER | uzb | tur-uzb | Evaluation | Notes |
0 | May 4 - May 31 | 34375 | 4359 | 90.80 % | 81.60 % | 97.01 % | 92.36 % | 89.57 % | 72.14 % | Initial evaluation | |
1 | June 1 - June 7 | ||||||||||
2 | May 8 - June 14 | ||||||||||
3 | June 15 - June 21 | ||||||||||
4 | June 22 - June 28 | ||||||||||
5 | June 29 - July 5 | 34373 | 4360 | 72.14 % | |||||||
6 | July 6 - July 12 | ||||||||||
7 | July 13 - July 19 | ||||||||||
8 | July 20 - July 26 | ||||||||||
9 | July 27 - Aug 2 | ||||||||||
10 | July 3 - Aug 9 | ||||||||||
11 | Aug 10 - Aug 16 | ||||||||||
12 | Aug 17 - Aug 23 |
To Do
Week 1-4
- Introducing apertium-separable to the tur-uzb pair
- Adding more stems to bilingual dictionary;
- Transfer rules refactoring;
- Increasing WER coverage;
- Running tests
- Updating documentation
- Preparing for the first evaluation
Ongoing
- Calculating initial naive coverage of monolingual apertium-uzb;
- Calculating initial naive coverage of bilingual apertium-tur-uzb;
Done
Community bonding period (May 4 - June 1):
- Getting closer with Apertium tools and community
- Finding out the current state of Uzbek language
- Finding out the availability of Uzbek resources
- Learning more about the HFST
- Doing coding challenge
- Begin interacting with Apertium's core system