Difference between revisions of "User:Firespeaker/GSoC2014/Workplan"
< User:Firespeaker | GSoC2014
Jump to navigation
Jump to search
Firespeaker (talk | contribs) |
Firespeaker (talk | contribs) |
||
(3 intermediate revisions by the same user not shown) | |||
Line 111: | Line 111: | ||
* Brought testvoc of apertium-tur-kir on SETimes corpus down from 0.22% to 0.04% |
* Brought testvoc of apertium-tur-kir on SETimes corpus down from 0.22% to 0.04% |
||
| |
| |
||
* Made and presented poster for Morphology Fest |
|||
|- |
|- |
||
! 6 !! 22 - 28 June |
! 6 !! 22 - 28 June |
||
Line 119: | Line 120: | ||
* work on uzb CG and lrx |
* work on uzb CG and lrx |
||
* continue testvoc nouns for all pairs |
* continue testvoc nouns for all pairs |
||
|{{Workeval5|0}} |
|||
⚫ | |||
| |
| |
||
| |
| |
||
* Moving and getting situated week |
* Moving and getting situated week |
||
* (break for personal reasons) |
|||
|- |
|- |
||
!colspan="2" style="text-align: right"|midterm eval<br />29 June |
!colspan="2" style="text-align: right"|midterm eval<br />29 June |
||
Line 132: | Line 134: | ||
* tur(-uzb) trimmed coverage ≥80% |
* tur(-uzb) trimmed coverage ≥80% |
||
* uzb(-tur) trimmed coverage ≥80% |
* uzb(-tur) trimmed coverage ≥80% |
||
|{{Workeval5|3}} |
|||
⚫ | |||
| |
|||
|- |
|- |
||
! 7 !! 29 June - 5 July |
! 7 !! 29 June - 5 July |
||
| |
| |
||
* get texts for kaz-kir translating |
|||
* finish testvoc nouns for all pairs |
|||
|- |
|- |
||
! 8 !! 6 - 12 July |
! 8 !! 6 - 12 July |
||
| |
| |
||
* clean up kir.lexc |
|||
* testvoc adjs for all pairs |
|||
|- |
|- |
||
! 9 !! 13 - 19 July |
! 9 !! 13 - 19 July |
||
| |
| |
||
* |
* corpus textvoc for kaz-kir |
||
|- |
|- |
||
! 10 !! 20 - 26 July |
! 10 !! 20 - 26 July |
||
| |
| |
||
* |
|||
* testvoc v.iv for all pairs |
|||
|- |
|- |
||
! 11 !! 27 July - 2 August |
! 11 !! 27 July - 2 August |
||
| |
| |
||
* |
|||
* testvoc v.tv categories for all pairs |
|||
|- |
|- |
||
! 12 !! 3 - 10 August |
! 12 !! 3 - 10 August |
||
| |
| |
||
* |
|||
* testvoc adverbs for all pairs |
|||
* testvoc misc categories for all pairs |
|||
|- |
|- |
||
!colspan="2" style="text-align: right"|pencils-down week<br />final evaluation<br />11 August - 18 August |
!colspan="2" style="text-align: right"|pencils-down week<br />final evaluation<br />11 August - 18 August |
Latest revision as of 18:50, 2 July 2014
Primary goals[edit]
- A production-ready release of kaz-kir
- Translates kaz→kir and kir→kaz with consistently <10% WER
- Trimmed coverage for kaz and kir ≥90%
- A production-ready release of tur-kir
- Translates tur→kir and kir→tur with consistently <20% WER
- Trimmed coverage for tur and kir ≥85%
- A stable release of uzb-tur
- Translates tur→uzb and uzb→tur with consistently <25% WER
- Trimmed coverage for tur and uzb ≥80%
- While bidix size is not built into the goals, the trimmed coverage numbers can be seen as a more relevant proxy for the same basic idea.
Plan[edit]
Schedule[edit]
See GSoC 2014 Timeline for complete timeline.
week | dates | goals | eval | accomplishments | notes |
---|---|---|---|---|---|
post-application period 22 March - 20 April |
|
|
| ||
community bonding period 21 April - 19 May |
|
|
| ||
1 | 19 - 24 May |
|
|
| |
2 | 25 - 31 May |
|
| ||
3 | 1 - 7 June |
|
|
| |
4 | 8 - 14 June |
|
|
||
5 | 15 - 21 June |
|
|
| |
6 | 22 - 28 June |
|
| ||
midterm eval 29 June |
|
||||
7 | 29 June - 5 July |
| |||
8 | 6 - 12 July |
| |||
9 | 13 - 19 July |
| |||
10 | 20 - 26 July |
| |||
11 | 27 July - 2 August |
| |||
12 | 3 - 10 August |
| |||
pencils-down week final evaluation 11 August - 18 August |
|
Getting started[edit]
- make scripts for:
- get updated corpora for:
- Uzbek
- Turkish
Recurring[edit]
- The end of every week:
- Update Progress
- Constantly:
- Add good sentences to regression tests
- Clean up lexc files
- remove duplicate entries
- alphabetise sections?
- add glosses, etc.