Difference between revisions of "Uighur and Turkish/Work plan"

From Apertium
Jump to navigation Jump to search
Line 13: Line 13:
! Coverage
! Coverage
! Checkpoint
! Checkpoint
! Comments
|-
|-
|April 23-29
|April 23-29
Line 22: Line 23:
| 58.9
| 58.9
|
|
| Good work!
|-
|-
|April 30 - May 6
|April 30 - May 6
Line 31: Line 33:
| 65.6
| 65.6
|
|
| Good coverage, insufficient CG rules
|-
|-
|May 7-13
|May 7-13
|67%
|67%
|
|
|
|
|
|
|
Line 44: Line 48:
|70%
|70%
|
|
|
|
|
|
|
Line 53: Line 58:
|75%
|75%
|
|
|
|
|
|
|
Line 61: Line 67:
| May 28-June 3
| May 28-June 3
|78%
|78%
|
|
|
|
|
Line 71: Line 78:
|82%
|82%
| 20
| 20
|
|
|
|
|
Line 85: Line 93:
|
|
| Eval 1
| Eval 1
|
|-
|-
| June 18-24
| June 18-24
|85%
|85%
|
|
|
|
|
|
|
Line 98: Line 108:
|85%
|85%
| 20
| 20
|
|
|
|
|
Line 106: Line 117:
| July 2-8
| July 2-8
|86%
|86%
|
|
|
|
|
Line 121: Line 133:
|
|
| Eval 2
| Eval 2
|
|-
|-
| July 16-22
| July 16-22
|89%
|89%
|
|
|
|
|
|
|
Line 134: Line 148:
|90%
|90%
|
|
|
|
|
|
|
Line 143: Line 158:
|91%
|91%
|
|
|
|
|
|
|
Line 157: Line 173:
|
|
| Final Evals
| Final Evals
|
|}
|}



Revision as of 09:58, 7 May 2018

As of April 25:

  • Coverage: 58.9 %
  • Trimmed coverage: 45.5 %


Week Cov. Goal CG Goal Transfer Goal Lexsel Goal Done? Coverage Checkpoint Comments
April 23-29 58% 5 58.9 Good work!
April 30 - May 6 65% 10 ½ 65.6 Good coverage, insufficient CG rules
May 7-13 67%
May 14-20 70%
May 21-27 75%
May 28-June 3 78%
June 4-10 82% 20
June 11-17 84% Eval 1
June 18-24 85%
June 25 - July 1 85% 20
July 2-8 86%
July 9-15 88% Eval 2
July 16-22 89%
July 23-29 90%
July 30 - August 5 91%
August 6-14 92% Final Evals

Plan by Weeks

  1. 30% coverage
  2. Basic CG
  3. 40% coverage
  4. Transfer
  5. 50% coverage
  6. Transfer, lexical selection, 65% coverage
  7. CG, 80% coverage
  8. Transfer, lexsel, 84% coverage
  9. Transfer
  10. CG, Transfer
  11. Transfer, lexsel, 86% coverage
  12. Transfer, 88% coverage
  13. Preparing text for annotation
  14. Annotating the Uyghur corpus, %90 coverage
  15. Annotating the Uyghur corpus, %90 coverage
  16. Annotating the Uyghur corpus, %90 coverage

Plan Outline

  • Post-application period:
    • Facilitating MT of a text from Uyghur to Turkish.
  • Community-bonding period:
    • bidix words, up to 50%
  • Month 1:
    • Writing scripts
    • Adding words to bidix, get coverage to around 80%
    • Chunking
    • Transfer rules
    • Begin CG for UIG
  • Month 2:
    • POS tagging/constraint grammar
    • Transfer rules
    • Get CG rules up to 100, ~50% disambiguation
    • >90% coverage
  • Month 3:
    • Creation of an Annotated Corpus