User:Elmurod1202/GSoC2020Progress

From Apertium
Jump to navigation Jump to search

The original proposal can be seen here

Status table

Week Stems Tur-Uzb Naïve Coverage Progress
Dates uzb tur-uzb WER PER uzb tur-uzb Evaluation Notes
0 May 4 - May 31 34375 2412 90.80 % 81.60 % 89.57 % 72.14 % Initial evaluation As of the end of May
1 June 1 - June 7
2 May 8 - June 14
3 June 15 - June 21
4 June 22 - June 28
5 June 29 - July 5 34373 2445 84.45 % 76.80 % 90.23 % 72.14 % First Evaluations End of June - ~July 3
6 July 6 - July 12
7 July 13 - July 19
8 July 20 - July 26
9 July 27 - Aug 2 34424 4191 78.70 % 68.34 % 90.23 % 72.74 % Second Evaluations As of July 31 - Aug 1
10 July 3 - Aug 9
11 Aug 10 - Aug 16
12 Aug 17 - Aug 23

To Do

Week 1-4

  • Introducing apertium-separable to the tur-uzb pair
  • Adding more stems to bilingual dictionary;
  • Transfer rules refactoring;
  • Increasing WER coverage;
  • Running tests
  • Updating documentation
  • Preparing for the first evaluation

Ongoing

  • Calculating initial naive coverage of monolingual apertium-uzb;
  • Calculating initial naive coverage of bilingual apertium-tur-uzb;

Done

Community bonding period (May 4 - June 1):

  • Getting closer with Apertium tools and community
  • Finding out the current state of Uzbek language
  • Finding out the availability of Uzbek resources
  • Learning more about the HFST
  • Doing coding challenge
  • Begin interacting with Apertium's core system

Notes