Difference between revisions of "User:Aboelhamd/progress"
Line 1: | Line 1: | ||
'''GSOC 2019 : Extend weighted transfer rules '''progress'''''' |
'''GSOC 2019 : Extend weighted transfer rules '''progress''' ''' |
||
I will upload the code in this repo[https://github.com/aboelhamd/machine-translation].<br /> |
I will upload the code in this repo[https://github.com/aboelhamd/machine-translation].<br /> |
||
== Phase 1 (April 19 : May 16) == |
== Phase 1 (April 19 : May 16) == |
Revision as of 20:56, 20 April 2019
GSOC 2019 : Extend weighted transfer rules progress
I will upload the code in this repo[1].
Contents
- 1 Phase 1 (April 19 : May 16)
- 2 Phase 2 (June 28 : July 25)
- 3 Phase 3 (July 26 : August 19)
Phase 1 (April 19 : May 16)
From April 19 to May 16 and from June 21 to June 28.
It's shifted because of my exams.
Week 1 (April 19 : April 25)
Day 1 (Friday April 19)
Latest evaluation scores were lower than traditional apertium's LRLM resolution, by far, unfortunately.
Debugged the code to see what's the cause of such low evaluation score.
Figured out that there is a bug in normalizing the LM scores of the target ambiguous sentences. The LM score is a log base 10 the probability of the sentence, and as the magnitude gets higher, the sentence normalizing probability should get lower, and I was doing the inverse of that.
The easiest solution was to modify the score-sentences script, and instead of getting the score, will get its reciprocal.
Day 2 (Saturday April 20)
The evaluation results were better, but still not better than traditional apertium's LRLM resolution.
Debugged the code to see why the score still worse than apertium's.
Found a bug in generating the ambiguous combinations and still working on solving it.
The bug resulted some incomplete sentences like:
Sentence : Reciprocal of negative LM score
Resumption of the period of sessions : 0.0552026445652
Resumption of session period : 0.0739337147853
Resumption of the session period : 0.0753641871191
Resumption of the period of : 0.0757469108192
Resumption of period of : 0.0684245152522
Resumption of the period : 0.0809504851312
And as shown, the best score went to incomplete sentence, which is one of the reason why we got bad evaluation score.
Day 3 (Sunday April 21)
Day 4 (Monday April 22)
Day 5 (Tuesday April 23)
Day 6 (Wednesday April 24)
Day 7 (Thursday April 25)
Week 2 (April 26 : May 2)
Day 1 (Friday April 26)
Day 2 (Saturday April 27)
Day 3 (Sunday April 28)
Day 4 (Monday April 29)
Day 5 (Tuesday April 30)
Day 6 (Wednesday May 1)
Day 7 (Thursday May 2)
Week 3 (May 3 : May 9)
Day 1 (Friday May 3)
Day 2 (Saturday May 4)
Day 3 (Sunday May 5)
Day 4 (Monday May 6)
Day 5 (Tuesday May 7)
Day 6 (Wednesday May 8)
Day 7 (Thursday May 9)
Week 4 (May 10 : May 16)
Day 1 (Friday May 10)
Day 2 (Saturday May 11)
Day 3 (Sunday May 12)
Day 4 (Monday May 13)
Day 5 (Tuesday May 14)
Day 6 (Wednesday May 15)
Day 7 (Thursday May 16)
Week 5 (June 21 : June 27)
After my exams.
Day 1 (Friday June 21)
Day 2 (Saturday June 22)
Day 3 (Sunday June 23)
Day 4 (Monday June 24)
Day 5 (Tuesday June 25)
Day 6 (Wednesday June 26)
Day 7 (Thursday June 27)
Phase 2 (June 28 : July 25)
Week 1 (June 28 : July 4)
Day 1 (Friday June 28)
Day 2 (Saturday June 29)
Day 3 (Sunday June 30)
Day 4 (Monday July 1)
Day 5 (Tuesday July 2)
Day 6 (Wednesday July 3)
Day 7 (Thursday July 4)
Week 2 (July 5 : July 11)
Day 1 (Friday July 5)
Day 2 (Saturday July 6)
Day 3 (Sunday July 7)
Day 4 (Monday July 8)
Day 5 (Tuesday July 9)
Day 6 (Wednesday July 10)
Day 7 (Thursday July 11)
Week 3 (July 12 : July 18)
Day 1 (Friday July 12)
Day 2 (Saturday July 13)
Day 3 (Sunday July 14)
Day 4 (Monday July 15)
Day 5 (Tuesday July 16)
Day 6 (Wednesday July 17)
Day 7 (Thursday July 18)
Week 4 (July 19 : July 25)
Day 1 (Friday July 19)
Day 2 (Saturday July 20)
Day 3 (Sunday July 21)
Day 4 (Monday July 22)
Day 5 (Tuesday July 23)
Day 6 (Wednesday July 24)
Day 7 (Thursday July 25)