Difference between revisions of "Apertium-kaz-kir/Workplan"
Jump to navigation
Jump to search
Firespeaker (talk | contribs) (→Workplan: WER) |
Firespeaker (talk | contribs) |
||
Line 144: | Line 144: | ||
# clean testvoc for {{tag|adj}} {{tag|adj}}{{tag|advl}} |
# clean testvoc for {{tag|adj}} {{tag|adj}}{{tag|advl}} |
||
# trimmed coverage 68% |
# trimmed coverage 68% |
||
|{{Workeval5|3}} |
|||
| |
|||
|rowspan="3"| |
|rowspan="3"| |
||
# stems in dix: 5552 |
# stems in dix: 5552 |
||
# trimmed coverage: |
# trimmed coverage: 72%,67% |
||
# azattyq_24455849 WER: 18.01% |
# azattyq_24455849 WER: 18.01% |
||
|rowspan=" |
|rowspan="2"| |
||
* good improvement in dix |
|||
** should be checking for errors (e.g., extra spaces) |
|||
* not much progress with WER text |
|||
** simple lrx and t1x should be enough here |
|||
* no indication of progress with testvoc |
|||
* better communication and commit frequency, but could still improve |
|||
—[[User:Firespeaker|Firespeaker]] 18:21, 1 August 2013 (UTC) |
|||
|- |
|- |
||
! 7 |
! 7 |
||
Line 156: | Line 163: | ||
# total 6400 stems in dix |
# total 6400 stems in dix |
||
# trimmed coverage 70% |
# trimmed coverage 70% |
||
|{{Workeval5|2}} |
|||
| |
|||
|- |
|- |
||
!colspan="2" style="text-align: right"| [[Apertium-kaz-kir/TODO#By_midterm|midterm eval]]<br />2 August |
!colspan="2" style="text-align: right"| [[Apertium-kaz-kir/TODO#By_midterm|midterm eval]]<br />2 August |
||
Line 163: | Line 170: | ||
# 500-word evaluation, WER ~10% |
# 500-word evaluation, WER ~10% |
||
# trimmed coverage 72% |
# trimmed coverage 72% |
||
|{{Workeval5|2}} |
|||
| |
|||
|* overall progress has been mediocre |
|||
| |
|||
* among the lowest-performing students |
|||
| |
|||
* noticeable improvement in the last few weeks |
|||
* needs to improve more to pass the final |
|||
|- |
|- |
||
! 8 |
! 8 |
Revision as of 18:21, 1 August 2013
Contents
Major goals
- Good WER
- Clean testvoc
- 12'000 stems in bidix (~1000 stems per week, or ~200 per day)
- Sort Adjective and Noun stems in kir.lexc into appropriate categories
- Trimmed coverage approaching 90%
Schedule
Timeline
See GSoC 2013 Timeline for complete timeline. Important coding dates follow:
- June 17th: coding begins
- July 29th - August 2nd: midterm evaluations
- September 16th - September 23rd: pencils down
- September 27th: final evaluation
Workplan
week | dates | goals | eval | accomplishments | notes |
---|---|---|---|---|---|
post-application period 3 - 24 May |
|
|
| ||
community bonding period 27 May - 16 June |
note: should be in IRC every day |
|
—Firespeaker 02:28, 2 July 2013 (UTC) | ||
1 | 17 - 22 June |
|
| ||
2 | 23 - 29 June |
|
|
| |
3 | 30 - 6 July |
|
|
—Firespeaker 20:43, 8 July 2013 (UTC) | |
4 | 7 - 13 July |
|
|
—Firespeaker 22:16, 22 July 2013 (UTC) | |
5 | 14 - 20 July |
|
|||
6 | 21 - 27 July |
|
|
—Firespeaker 18:21, 1 August 2013 (UTC) | |
7 | 28 - 3 August |
|
|||
midterm eval 2 August |
|
* overall progress has been mediocre
| |||
8 | 4 - 10 August |
|
|||
9 | 11 - 17 August |
|
|||
10 | 18 - 24 August |
|
|||
11 | 25 - 31 August |
|
|||
12 | 1 - 7 September |
|
|||
13 | 8 - 15 September |
|
|||
pencils-down week final evaluation 16 - 23 September |
|
Tips and Tricks
Adding stems quickly
- Add top stems from frequency lists of unknown forms
- Use spectie's dix-entries-to-be-checked script