User:Kamush/GSoC2021ProgresReport

From Apertium
Jump to navigation Jump to search

Progress Report

Time Period Goal Bidix Coverage WER,PER Details/Comments
kaz-uzb kaz-uzb kaz-uzb uzb-kaz
Community Bonding Period

May 17-June 5

  • Installing Apertium
  • Initialize kaz-uzb pair
  • Collect data in both languages
- - - -
  • Installing Apertium and necessary tools;
  • Send the first PR that can translate a small sample text;
  • Extract Uzbek and Kazakh wiki corpus;
  • Collect Uzbek and Kazakh web(non-wiki) corpus;
  • Collect Kazakh-Uzbek dictionary and parallel corpora;
Week 1

June 6-12

Make Uzbek better - - - -
  • Go through all Uzbek stems in uzb.lexc;
  • Clean(deduplicate) and correct uzb stems;
  • Improve Uzbek lexicon;
Week 2

June 13-19

Expand bilingual dictionary - - - -
  • Start adding bilingual dictionary elements;
Week 3

June 20-26

More on .dix and .lrx - - - -
  • Expand bilingual dictionary;
  • Lexical selection rules;
Week 4

June 27-July 3

Focus on transfer rules - - - -
  • Expand bilingual dictionary;
  • Lexical selection rules;
Week 5

July 4-10

Test translator and expand more - - - -
  • Test the kaz-uzb translator;
  • Expand the Uzbek lexicon with missing words;
  • Expand bilingual dictionary;
  • Expand lexical selection rules;
Week 6

July 11-17

Focus more on transfer rules - - - -
  • Work more on transfer rules;
  • More bilingual dictionary;
  • More lexical section rules;
Week 7

July 18-24

Test the kaz-uzb translator - - - -
  • Test the kaz-uzb translator;
  • Extend the Uzbek lexicon with missing words;
  • Extend the Kazakh lexicon with missing words;
  • Extend bilingual dictionary;
  • Add more lexical selection rules;
Week 8

July 25-31

Focus on transfer rules - - - -
  • Add words, rules;
  • Work on transfer rules;
  • Start the testvoc;
Week 9

August 1-7

Focus on testvoc - - - -
  • Add words, rules;
  • Transfer rules kaz-uzb;
  • Testvoc kaz-uzb
Week 10

August 8-14

Finalize work - - - -
  • Test the kaz-uzb translator;
  • Check the transfer rules;
  • Check the testvoc;
  • Write the final report;