Uighur and Turkish/Work plan
Jump to navigation
Jump to search
- Post-application period:
- Facilitating MT of a text from Uyghur to Turkish.
- Community-bonding period:
- bidix words, up to 50%
- Month 1:
- Writing scripts
- Adding words to bidix, get coverage to around 80%
- Chunking
- Transfer rules
- Begin CG for UIG
- Month 2:
- POS tagging/constraint grammar
- Transfer rules
- Get CG rules up to 100, ~50% disambiguation
- >90% coverage
- Month 3:
- Creation of an Annotated Corpus
Plan by Weeks
1. 30% coverage
2. Basic CG
3. 40% coverage
4. Transfer
5. 50% coverage
6. Transfer, lexical selection, 65% coverage
7. CG, 80% coverage
8. Transfer, lexsel, 84% coverage
9. Transfer
10. CG, Transfer
11. Transfer, lexsel, 86% coverage
12. Transfer, 88% coverage
13. Preparing text for annotation
14-16. Annotating the Uyghur corpus, %90 coverage