Difference between revisions of "User:Kamush/GSoC2021ProgresReport"

From Apertium
Jump to navigation Jump to search
(Created page with "Report is coming really soon")
 
(Starting progress report)
Line 1: Line 1:
==Progress Report==
Report is coming really soon

{| class="wikitable" border="1"
!Time Period
!Goal
!Details
|-
|Community Bonding Period
May 17-June 5
|
* Installing Apertium
* Initialize kaz-uzb pair
* Collect data in both languages
|
* Installing Apertium and necessary tools;
* Send the first PR that can translate a small sample text;
* Extract Uzbek and Kazakh wiki corpus;
* Collect Uzbek and Kazakh web(non-wiki) corpus;
* Collect Kazakh-Uzbek dictionary and parallel corpora;
|-
|Week 1
June 6-12
|Make Uzbek better
|
* Go through all Uzbek stems in uzb.lexc;
* Clean(deduplicate) and correct uzb stems;
* Improve Uzbek lexicon;
|-
|Week 2
June 13-19
| Expand bilingual dictionary
|
* Start adding bilingual dictionary elements;
|-
|Week 3
June 20-26
| More on .dix and .lrx
|
* Expand bilingual dictionary;
* Lexical selection rules;
|-
|Week 4
June 27-July 3
|Focus on transfer rules
|
* Expand bilingual dictionary;
* Lexical selection rules;
|-
|Week 5
July 4-10
|Test translator and expand more
|
* Test the kaz-uzb translator;
* Expand the Uzbek lexicon with missing words;
* Expand bilingual dictionary;
* Expand lexical selection rules;
|-
|Week 6
July 11-17
|Focus more on transfer rules
|
* Work more on transfer rules;
* More bilingual dictionary;
* More lexical section rules;
*
|-
|Week 7
July 18-24
|Test the kaz-uzb translator
|
* Test the kaz-uzb translator;
* Extend the Uzbek lexicon with missing words;
* Extend the Kazakh lexicon with missing words;
* Extend bilingual dictionary;
* Add more lexical selection rules;
|-
|Week 8
July 25-31
|Focus on transfer rules
|
* Add words, rules;
* Work on transfer rules;
* Start the testvoc;
|-
|Week 9
August 1-7
|Focus on testvoc
|
* Add words, rules;
* Transfer rules kaz-uzb;
* Testvoc kaz-uzb
|-
|Week 10
August 8-14
|Finalize work
|
* Test the kaz-uzb translator;
* Check the transfer rules;
* Check the testvoc;
* Write the final report;
|-
|}

Revision as of 10:00, 10 July 2021

Progress Report

Time Period Goal Details
Community Bonding Period

May 17-June 5

  • Installing Apertium
  • Initialize kaz-uzb pair
  • Collect data in both languages
  • Installing Apertium and necessary tools;
  • Send the first PR that can translate a small sample text;
  • Extract Uzbek and Kazakh wiki corpus;
  • Collect Uzbek and Kazakh web(non-wiki) corpus;
  • Collect Kazakh-Uzbek dictionary and parallel corpora;
Week 1

June 6-12

Make Uzbek better
  • Go through all Uzbek stems in uzb.lexc;
  • Clean(deduplicate) and correct uzb stems;
  • Improve Uzbek lexicon;
Week 2

June 13-19

Expand bilingual dictionary
  • Start adding bilingual dictionary elements;
Week 3

June 20-26

More on .dix and .lrx
  • Expand bilingual dictionary;
  • Lexical selection rules;
Week 4

June 27-July 3

Focus on transfer rules
  • Expand bilingual dictionary;
  • Lexical selection rules;
Week 5

July 4-10

Test translator and expand more
  • Test the kaz-uzb translator;
  • Expand the Uzbek lexicon with missing words;
  • Expand bilingual dictionary;
  • Expand lexical selection rules;
Week 6

July 11-17

Focus more on transfer rules
  • Work more on transfer rules;
  • More bilingual dictionary;
  • More lexical section rules;
Week 7

July 18-24

Test the kaz-uzb translator
  • Test the kaz-uzb translator;
  • Extend the Uzbek lexicon with missing words;
  • Extend the Kazakh lexicon with missing words;
  • Extend bilingual dictionary;
  • Add more lexical selection rules;
Week 8

July 25-31

Focus on transfer rules
  • Add words, rules;
  • Work on transfer rules;
  • Start the testvoc;
Week 9

August 1-7

Focus on testvoc
  • Add words, rules;
  • Transfer rules kaz-uzb;
  • Testvoc kaz-uzb
Week 10

August 8-14

Finalize work
  • Test the kaz-uzb translator;
  • Check the transfer rules;
  • Check the testvoc;
  • Write the final report;