Uighur and Turkish/Work plan

From Apertium
Jump to navigation Jump to search
Week Coverage Done?
April 23-29 30%
April 30 - May 6 35%
May 7-13 45%
May 14-20 55%
May 21-27 65%
May 28-June 3 75%
June 4-10 80%
June 11-17 84%
June 18-24 85%
June 25 - July 1 85%
July 2-8 86%
July 9-15 88%
July 16-22 88%
July 23-29 88%
July 30 - August 5 90%
August 6-14 92%

Plan by Weeks

1. 30% coverage

2. Basic CG

3. 40% coverage

4. Transfer

5. 50% coverage

6. Transfer, lexical selection, 65% coverage

7. CG, 80% coverage

8. Transfer, lexsel, 84% coverage

9. Transfer

10. CG, Transfer

11. Transfer, lexsel, 86% coverage

12. Transfer, 88% coverage

13. Preparing text for annotation

14-16. Annotating the Uyghur corpus, %90 coverage

Plan Outline

  • Post-application period:
    • Facilitating MT of a text from Uyghur to Turkish.
  • Community-bonding period:
    • bidix words, up to 50%
  • Month 1:
    • Writing scripts
    • Adding words to bidix, get coverage to around 80%
    • Chunking
    • Transfer rules
    • Begin CG for UIG
  • Month 2:
    • POS tagging/constraint grammar
    • Transfer rules
    • Get CG rules up to 100, ~50% disambiguation
    • >90% coverage
  • Month 3:
    • Creation of an Annotated Corpus