Difference between revisions of "Uighur and Turkish/Work plan"
Jump to navigation
Jump to search
Line 1: | Line 1: | ||
{| class="wikitable" |
{| class="wikitable" |
||
!Week |
!Week |
||
! Cov. |
! Cov. goal |
||
! CG |
! CG goal |
||
! Transfer |
! Transfer goal |
||
! Lexsel |
! Lexsel goal |
||
! Corpusvoc |
! Corpusvoc goal |
||
! Done? |
! Done? |
||
! Coverage |
! Coverage |
||
Line 17: | Line 17: | ||
| |
| |
||
| |
| |
||
| 200000 |
|||
| |
|||
|style="background-color: green"| '''✓''' |
|style="background-color: green"| '''✓''' |
||
| 45.5 |
| 45.5 |
||
| 197742 |
|||
| |
|||
| |
| |
||
| Good work! |
| Good work! |
||
Line 29: | Line 29: | ||
| |
| |
||
| |
| |
||
| 198000 |
|||
| |
|||
|style="background-color: yellow"| '''½''' |
|style="background-color: yellow"| '''½''' |
||
| 65.6 |
| 65.6 |
||
| |
| 197742 |
||
| |
| |
||
| Good coverage, insufficient CG rules |
| Good coverage, insufficient CG rules |
Revision as of 10:12, 7 May 2018
Week | Cov. goal | CG goal | Transfer goal | Lexsel goal | Corpusvoc goal | Done? | Coverage | Errors | Checkpoint | Comments |
---|---|---|---|---|---|---|---|---|---|---|
April 23-29 | 45% | 5 | 200000 | ✓ | 45.5 | 197742 | Good work! | |||
April 30 - May 6 | 65% | 10 | 198000 | ½ | 65.6 | 197742 | Good coverage, insufficient CG rules | |||
May 7-13 | 67% | |||||||||
May 14-20 | 70% | |||||||||
May 21-27 | 75% | |||||||||
May 28-June 3 | 78% | |||||||||
June 4-10 | 82% | 20 | ||||||||
June 11-17 | 84% | Eval 1 | ||||||||
June 18-24 | 85% | |||||||||
June 25 - July 1 | 85% | 20 | ||||||||
July 2-8 | 86% | |||||||||
July 9-15 | 88% | Eval 2 | ||||||||
July 16-22 | 89% | |||||||||
July 23-29 | 90% | |||||||||
July 30 - August 5 | 91% | |||||||||
August 6-14 | 92% | Final Evals |
Plan by Weeks
- 30% coverage
- Basic CG
- 40% coverage
- Transfer
- 50% coverage
- Transfer, lexical selection, 65% coverage
- CG, 80% coverage
- Transfer, lexsel, 84% coverage
- Transfer
- CG, Transfer
- Transfer, lexsel, 86% coverage
- Transfer, 88% coverage
- Preparing text for annotation
- Annotating the Uyghur corpus, %90 coverage
- Annotating the Uyghur corpus, %90 coverage
- Annotating the Uyghur corpus, %90 coverage
Plan Outline
- Post-application period:
- Facilitating MT of a text from Uyghur to Turkish.
- Community-bonding period:
- bidix words, up to 50%
- Month 1:
- Writing scripts
- Adding words to bidix, get coverage to around 80%
- Chunking
- Transfer rules
- Begin CG for UIG
- Month 2:
- POS tagging/constraint grammar
- Transfer rules
- Get CG rules up to 100, ~50% disambiguation
- >90% coverage
- Month 3:
- Creation of an Annotated Corpus