Difference between revisions of "Uighur and Turkish/Work plan"
Jump to navigation
Jump to search
(Created page with "*Post-application period: **Facilitating MT of a text from Uyghur to Turkish. *Community-bonding period: **bidix words, up to 50% *Month 1: **Writing scripts **Adding words ...") |
|||
(25 intermediate revisions by 3 users not shown) | |||
Line 1: | Line 1: | ||
{| class="wikitable" |
|||
!Week |
|||
! Cov. goal |
|||
! CG goal |
|||
! Transfer goal |
|||
! Lexsel goal |
|||
! Corpusvoc goal |
|||
! Done? |
|||
! Coverage |
|||
! Errors |
|||
! Checkpoint |
|||
! Comments |
|||
|- |
|||
|April 23-29 |
|||
|45% |
|||
| 5 |
|||
| |
|||
| |
|||
| 200000 |
|||
|style="background-color: green"| '''✓''' |
|||
| 45.5 |
|||
| 197742 |
|||
| |
|||
| Good work! |
|||
|- |
|||
|April 30 - May 6 |
|||
|65% |
|||
| 10 |
|||
| |
|||
| |
|||
| 198000 |
|||
|style="background-color: yellow"| '''½''' |
|||
| 65.6 |
|||
| 197742 |
|||
| |
|||
| Good coverage, insufficient CG rules |
|||
|- |
|||
|May 7-13 |
|||
|67% |
|||
| |
|||
| |
|||
| |
|||
| |
|||
| style="background-color: green"| '''✓''' |
|||
| |
|||
| |
|||
| |
|||
| |
|||
|- |
|||
|May 14-20 |
|||
|70% |
|||
| |
|||
| |
|||
| |
|||
| |
|||
| style="background-color: green"| '''✓''' |
|||
| |
|||
| |
|||
| |
|||
| |
|||
|- |
|||
| May 21-27 |
|||
|75% |
|||
| |
|||
| |
|||
| |
|||
| |
|||
|style="background-color: green"| '''✓''' |
|||
| |
|||
| |
|||
| |
|||
| |
|||
|- |
|||
| May 28-June 3 |
|||
|78% |
|||
| |
|||
| |
|||
| |
|||
| |
|||
| style="background-color: green"| '''✓''' |
|||
| 81.42 |
|||
|18334 |
|||
| |
|||
| |
|||
|- |
|||
| June 4-10 |
|||
|82% |
|||
| 20 |
|||
| |
|||
| |
|||
| |
|||
| style="background-color: green"| '''✓''' |
|||
| |
|||
| |
|||
| |
|||
| |
|||
|- |
|||
| June 11-17 |
|||
|84% |
|||
| |
|||
| |
|||
| |
|||
| |
|||
| style="background-color: green"| '''✓''' |
|||
| |
|||
| |
|||
| Eval 1 |
|||
| |
|||
|- |
|||
| June 18-24 |
|||
|85% |
|||
| |
|||
| |
|||
| |
|||
| |
|||
| style="background-color: green"| '''✓''' |
|||
| |
|||
| |
|||
| |
|||
| |
|||
|- |
|||
| June 25 - July 1 |
|||
|85% |
|||
| 20 |
|||
| |
|||
| |
|||
| |
|||
|style="background-color: green"| '''✓''' |
|||
| |
|||
| |
|||
| |
|||
| |
|||
|- |
|||
| July 2-8 |
|||
|86% |
|||
| |
|||
| |
|||
| |
|||
| |
|||
| style="background-color: green"| '''✓''' |
|||
| |
|||
| |
|||
| |
|||
| |
|||
|- |
|||
| July 9-15 |
|||
|88% |
|||
| |
|||
| |
|||
| |
|||
| |
|||
| style="background-color: green"| '''✓''' |
|||
| |
|||
| |
|||
| Eval 2 |
|||
| |
|||
|- |
|||
| July 16-22 |
|||
|89% |
|||
| |
|||
| |
|||
| |
|||
| |
|||
| style="background-color: green"| '''✓''' |
|||
| |
|||
| |
|||
| |
|||
| |
|||
|- |
|||
| July 23-29 |
|||
|90% |
|||
| |
|||
| |
|||
| |
|||
| |
|||
| |
|||
| |
|||
| |
|||
| |
|||
| |
|||
|- |
|||
| July 30 - August 5 |
|||
|91% |
|||
| |
|||
| |
|||
| |
|||
| |
|||
| |
|||
| |
|||
| |
|||
| |
|||
| |
|||
|- |
|||
| August 6-14 |
|||
|92% |
|||
| |
|||
| |
|||
| |
|||
| 0 |
|||
| |
|||
| |
|||
| |
|||
| Final Evals |
|||
| |
|||
|} |
|||
==Plan by Weeks== |
|||
# 45% coverage, adding new stems to bidix and monodix |
|||
# 65% coverage, adding new stems to bidix and monodix |
|||
# 67% coverage, Basic CG |
|||
# 70% coverage, Adding inflectional affixes to uig.lexc, writing twol rules for them |
|||
# 75% coverage, Adding derivational affixes to uig.lexc, writing twol rules for them |
|||
# 78% coverage, Transfer, CG |
|||
# 82% coverage, CG, lexsel |
|||
# 84% coverage, Transfer, lexsel |
|||
# 85% coverage, Transfer, lexsel |
|||
# 85% coverage, CG, Transfer |
|||
# 86% coverage, Transfer, lexsel |
|||
# 88% coverage, Transfer, CG |
|||
# Preparing text for annotation, evaluation |
|||
# Annotating the Uyghur corpus, %90 coverage |
|||
# Annotating the Uyghur corpus, %90 coverage, Writing paper |
|||
# Writing paper |
|||
== Plan Outline == |
|||
*Post-application period: |
*Post-application period: |
||
**Facilitating MT of a text from Uyghur to Turkish. |
**Facilitating MT of a text from Uyghur to Turkish. |
||
Line 16: | Line 242: | ||
*Month 3: |
*Month 3: |
||
**Creation of an Annotated Corpus |
**Creation of an Annotated Corpus |
||
==Plan by Weeks== |
|||
1. 30% coverage |
|||
2. Basic CG |
|||
3. 40% coverage |
|||
4. Transfer |
|||
5. 50% coverage |
|||
6. Transfer, lexical selection, 65% coverage |
|||
7. CG, 80% coverage |
|||
8. Transfer, lexsel, 84% coverage |
|||
9. Transfer |
|||
10. CG, Transfer |
|||
11. Transfer, lexsel, 86% coverage |
|||
12. Transfer, 88% coverage |
|||
13. Preparing text for annotation |
|||
14-16. Annotating the Uyghur corpus, %90 coverage |
Latest revision as of 08:30, 23 July 2018
Week | Cov. goal | CG goal | Transfer goal | Lexsel goal | Corpusvoc goal | Done? | Coverage | Errors | Checkpoint | Comments |
---|---|---|---|---|---|---|---|---|---|---|
April 23-29 | 45% | 5 | 200000 | ✓ | 45.5 | 197742 | Good work! | |||
April 30 - May 6 | 65% | 10 | 198000 | ½ | 65.6 | 197742 | Good coverage, insufficient CG rules | |||
May 7-13 | 67% | ✓ | ||||||||
May 14-20 | 70% | ✓ | ||||||||
May 21-27 | 75% | ✓ | ||||||||
May 28-June 3 | 78% | ✓ | 81.42 | 18334 | ||||||
June 4-10 | 82% | 20 | ✓ | |||||||
June 11-17 | 84% | ✓ | Eval 1 | |||||||
June 18-24 | 85% | ✓ | ||||||||
June 25 - July 1 | 85% | 20 | ✓ | |||||||
July 2-8 | 86% | ✓ | ||||||||
July 9-15 | 88% | ✓ | Eval 2 | |||||||
July 16-22 | 89% | ✓ | ||||||||
July 23-29 | 90% | |||||||||
July 30 - August 5 | 91% | |||||||||
August 6-14 | 92% | 0 | Final Evals |
Plan by Weeks[edit]
- 45% coverage, adding new stems to bidix and monodix
- 65% coverage, adding new stems to bidix and monodix
- 67% coverage, Basic CG
- 70% coverage, Adding inflectional affixes to uig.lexc, writing twol rules for them
- 75% coverage, Adding derivational affixes to uig.lexc, writing twol rules for them
- 78% coverage, Transfer, CG
- 82% coverage, CG, lexsel
- 84% coverage, Transfer, lexsel
- 85% coverage, Transfer, lexsel
- 85% coverage, CG, Transfer
- 86% coverage, Transfer, lexsel
- 88% coverage, Transfer, CG
- Preparing text for annotation, evaluation
- Annotating the Uyghur corpus, %90 coverage
- Annotating the Uyghur corpus, %90 coverage, Writing paper
- Writing paper
Plan Outline[edit]
- Post-application period:
- Facilitating MT of a text from Uyghur to Turkish.
- Community-bonding period:
- bidix words, up to 50%
- Month 1:
- Writing scripts
- Adding words to bidix, get coverage to around 80%
- Chunking
- Transfer rules
- Begin CG for UIG
- Month 2:
- POS tagging/constraint grammar
- Transfer rules
- Get CG rules up to 100, ~50% disambiguation
- >90% coverage
- Month 3:
- Creation of an Annotated Corpus