Difference between revisions of "User:Firespeaker/GSoC2014/Workplan"
< User:Firespeaker | GSoC2014
Jump to navigation
Jump to search
Firespeaker (talk | contribs) |
Firespeaker (talk | contribs) |
||
Line 24: | Line 24: | ||
!colspan="2" style="text-align: right"|post-application period<br />22 March - 20 April |
!colspan="2" style="text-align: right"|post-application period<br />22 March - 20 April |
||
| |
| |
||
* [[apertium-kir]] to 90% coverage |
* [[apertium-kir]] to 90% coverage (with kaz-like transducer) |
||
* [[apertium-tur]] to 90% coverage |
* [[apertium-tur]] to 90% coverage (with kaz-like transducer) |
||
* [[apertium-uzb]] to 90% coverage |
* [[apertium-uzb]] to 90% coverage (with kaz-like transducer) |
||
* build arsenal of texts with post-edited translations: |
* build arsenal of texts with post-edited translations: |
||
** four 200-word texts in each kaz, kir, tur, uzb |
** four 200-word texts in each kaz, kir, tur, uzb |
Revision as of 05:46, 13 March 2014
Major goals
- A production-ready release of kaz-kir
- Translates kaz→kir and kir→kaz with consistently <10% WER
- Trimmed coverage for kaz and kir ≥90%
- A production-ready release of tur-kir
- Translates tur→kir and kir→tur with consistently <10% WER
- Trimmed coverage for tur and kir ≥90%
- A stable release of uzb-tur
- Translates tur→uzb and uzb→tur with consistently <20% WER
- Trimmed coverage for tur and kir ≥80%
Schedule
Schedule
See GSoC 2014 Timeline for complete timeline. Dates need to be verified.
week | dates | goals | eval | accomplishments | notes |
---|---|---|---|---|---|
post-application period 22 March - 20 April |
| ||||
community bonding period 21 April - 19 May |
| ||||
1 | 19 - 24 May |
| |||
2 | 25 - 31 May |
| |||
3 | 1 - 7 June |
| |||
4 | 8 - 14 June |
| |||
5 | 15 - 21 June |
| |||
6 | 22 - 28 June |
| |||
7 | 29 June - 5 July |
| |||
midterm eval July 6 |
| ||||
8 | 6 - 12 July |
| |||
9 | 13 - 19 July |
| |||
10 | 20 - 26 July |
| |||
11 | 27 July - 2 August |
| |||
12 | 3 - 9 August |
| |||
13 | 10 - 18 August |
| |||
pencils-down week final evaluation 18 August - 24 August |
|
Getting started
- make scripts for:
- getting raw numbers for [[User:Firespeaker/GSoC2014/Workplan|Progress]
- doing regression tests
- get updated corpora for:
- Uzbek
- Turkish
Recurring
- The end of every week:
- Update Progress
- Constantly:
- Add good sentences to regression tests
- Clean up lexc files
- remove duplicate entries
- alphabetise sections?
- add glosses, etc.