Difference between revisions of "User:Firespeaker/GSoC2014/Workplan"

From Apertium
Jump to navigation Jump to search
Line 12: Line 12:
|-
|-
!colspan="2" style="text-align: right"|post-application period<br />22 March - 20 April
!colspan="2" style="text-align: right"|post-application period<br />22 March - 20 April
|
| * [[apertium-kir]] to 90% coverage
* [[apertium-kir]] to 90% coverage
* [[apertium-tur]] to 90% coverage
* [[apertium-tur]] to 90% coverage
* [[apertium-uzb]] to 90% coverage
* [[apertium-uzb]] to 90% coverage
Line 20: Line 21:
|-
|-
!colspan="2" style="text-align: right"|community bonding period<br />21 April - 19 May
!colspan="2" style="text-align: right"|community bonding period<br />21 April - 19 May
|
| * kaz-kir text to <10% WER
* kir-kaz text to <10% WER
* tur-kir text to <10% WER
* kir-tur text to <10% WER
* tur-uzb text to <10% WER
* uzb-tur text to <10% WER
* tur-uzb bidix to 7000 stems
* tur-uzb bidix to 7000 stems
* make some real CG for kir, uzb
* make some real CG for kir, uzb
* one 200-word kaz-kir text to <10% WER
* one 200-word kir-kaz text to <10% WER
* one 200-word tur-kir text to <10% WER
* one 200-word kir-tur text to <10% WER
* one 200-word tur-uzb text to <10% WER
* one 200-word uzb-tur text to <10% WER
|-
|-
! 1 !! 19 - 24 May
! 1 !! 19 - 24 May
|
* one 200-word kir-tur text to <10% WER
* one 200-word kir-kaz text to <10% WER
* work on kir CG and lrx
|-
|-
! 2 !! 25 - 31 May
! 2 !! 25 - 31 May
|
* one 200-word kaz-kir text to <10% WER
* one 200-word tur-kir text to <10% WER
* work on tur CG and lrx
|-
|-
! 3 !! 1 - 7 June
! 3 !! 1 - 7 June
|
* one 200-word tur-uzb text to <10% WER
* one 200-word uzb-tur text to <10% WER
* work on uzb CG and lrx
|-
|-
! 4 !! 8 - 14 June
! 4 !! 8 - 14 June
|
* one 500-word kir-tur text to <10% WER
* one 500-word kir-kaz text to <10% WER
* work on kir CG and lrx
|-
|-
! 5 !! 15 - 21 June
! 5 !! 15 - 21 June
|
* one 500-word kaz-kir text to <10% WER
* one 500-word tur-kir text to <10% WER
* work on tur CG and lrx
|-
|-
! 6 !! 22 - 28 June
! 6 !! 22 - 28 June
|
* one 500-word tur-uzb text to <10% WER
* one 500-word uzb-tur text to <10% WER
* work on uzb CG and lrx
|-
|-
! 7 !! 29 June - 5 July
! 7 !! 29 June - 5 July
Line 46: Line 72:
|-
|-
! 8 !! 6 - 12 July
! 8 !! 6 - 12 July
|
* testvoc adjs for all pairs
|-
|-
! 9 !! 13 - 19 July
! 9 !! 13 - 19 July
|
* testvoc nouns for all pairs
|-
|-
! 10 !! 20 - 26 July
! 10 !! 20 - 26 July
|
* testvoc v.iv for all pairs
|-
|-
! 11 !! 27 July - 2 August
! 11 !! 27 July - 2 August
|
* testvoc v.tv categories for all pairs
|-
|-
! 12 !! 3 - 9 August
! 12 !! 3 - 9 August
|
* testvoc adverbs for all pairs
|-
|-
! 13 !! 10 - 18 August
! 13 !! 10 - 18 August
|
* testvoc misc categories for all pairs
|-
|-
!colspan="2" style="text-align: right"|pencils-down week<br />final evaluation<br />18 August - 24 August
!colspan="2" style="text-align: right"|pencils-down week<br />final evaluation<br />18 August - 24 August

Revision as of 04:37, 13 March 2014

Major goals

Schedule

Schedule

week dates goals eval accomplishments notes
post-application period
22 March - 20 April
  • apertium-kir to 90% coverage
  • apertium-tur to 90% coverage
  • apertium-uzb to 90% coverage
  • build arsenal of texts:
    • four 200-word texts in each kaz, kir, tur, uzb
    • four 500-word texts in each kaz, kir, tur, uzb
community bonding period
21 April - 19 May
  • tur-uzb bidix to 7000 stems
  • make some real CG for kir, uzb
  • one 200-word kaz-kir text to <10% WER
  • one 200-word kir-kaz text to <10% WER
  • one 200-word tur-kir text to <10% WER
  • one 200-word kir-tur text to <10% WER
  • one 200-word tur-uzb text to <10% WER
  • one 200-word uzb-tur text to <10% WER
1 19 - 24 May
  • one 200-word kir-tur text to <10% WER
  • one 200-word kir-kaz text to <10% WER
  • work on kir CG and lrx
2 25 - 31 May
  • one 200-word kaz-kir text to <10% WER
  • one 200-word tur-kir text to <10% WER
  • work on tur CG and lrx
3 1 - 7 June
  • one 200-word tur-uzb text to <10% WER
  • one 200-word uzb-tur text to <10% WER
  • work on uzb CG and lrx
4 8 - 14 June
  • one 500-word kir-tur text to <10% WER
  • one 500-word kir-kaz text to <10% WER
  • work on kir CG and lrx
5 15 - 21 June
  • one 500-word kaz-kir text to <10% WER
  • one 500-word tur-kir text to <10% WER
  • work on tur CG and lrx
6 22 - 28 June
  • one 500-word tur-uzb text to <10% WER
  • one 500-word uzb-tur text to <10% WER
  • work on uzb CG and lrx
7 29 June - 5 July
midterm eval
July 6
8 6 - 12 July
  • testvoc adjs for all pairs
9 13 - 19 July
  • testvoc nouns for all pairs
10 20 - 26 July
  • testvoc v.iv for all pairs
11 27 July - 2 August
  • testvoc v.tv categories for all pairs
12 3 - 9 August
  • testvoc adverbs for all pairs
13 10 - 18 August
  • testvoc misc categories for all pairs
pencils-down week
final evaluation
18 August - 24 August

GSoC Timeline

See GSoC 2014 Timeline for complete timeline. Important coding dates follow:

  • March 10th - March 21st: application
  • April 21st - May 19th: community bonding
  • May 19th: coding begins
  •  ??: midterm evaluations
  • August 18th?: pencils down
  •  ??: final evaluation

Goals by time

  • Community bonding (4+ weeks):
    • apertium-kir, apertium-tur, apertium-uzb coverages to 90%
    • one 200-word text for each direction to <10% WER
    • make some real CG for kir, uzb
    • build arsenal of 4 200-word texts and 4 500-word texts translated to all languages
    • tur-uzb bidix to 7000 stems
  • Coding period (13 weeks)
    • First half (7 weeks):
      • work on WER (one text per week)
      • beef up CG for each language
      • lrx, transfer as needed
    • Second half (6 weeks):
      • work on testvoc