Difference between revisions of "Uighur and Turkish/Work plan"

From Apertium
Jump to navigation Jump to search
Line 15: Line 15:
 
|-
 
|-
 
|April 23-29
 
|April 23-29
|30%
+
|60%
 
|
 
|
 
|
 
|
Line 24: Line 24:
 
|-
 
|-
 
|April 30 - May 6
 
|April 30 - May 6
|35%
+
|65%
 
| 10
 
| 10
 
|
 
|
Line 33: Line 33:
 
|-
 
|-
 
|May 7-13
 
|May 7-13
|45%
+
|67%
 
|
 
|
 
|
 
|
Line 42: Line 42:
 
|-
 
|-
 
|May 14-20
 
|May 14-20
|55%
+
|70%
 
|
 
|
 
|
 
|
Line 51: Line 51:
 
|-
 
|-
 
| May 21-27
 
| May 21-27
|65%
+
|75%
 
|
 
|
 
|
 
|
Line 60: Line 60:
 
|-
 
|-
 
| May 28-June 3
 
| May 28-June 3
|75%
+
|78%
 
|
 
|
 
|
 
|
Line 69: Line 69:
 
|-
 
|-
 
| June 4-10
 
| June 4-10
|80%
+
|82%
 
| 20
 
| 20
 
|
 
|
Line 161: Line 161:
 
==Plan by Weeks==
 
==Plan by Weeks==
   
1. 30% coverage
+
# 30% coverage
 
# Basic CG
 
 
# 40% coverage
2. Basic CG
 
 
# Transfer
 
3. 40% coverage
+
# 50% coverage
 
# Transfer, lexical selection, 65% coverage
 
 
# CG, 80% coverage
4. Transfer
 
 
# Transfer, lexsel, 84% coverage
 
 
# Transfer
5. 50% coverage
 
 
# CG, Transfer
 
6. Transfer, lexical selection, 65% coverage
+
# Transfer, lexsel, 86% coverage
  +
# Transfer, 88% coverage
 
 
# Preparing text for annotation
7. CG, 80% coverage
 
 
# Annotating the Uyghur corpus, %90 coverage
 
8. Transfer, lexsel, 84% coverage
+
# Annotating the Uyghur corpus, %90 coverage
  +
# Annotating the Uyghur corpus, %90 coverage
 
9. Transfer
 
 
10. CG, Transfer
 
 
11. Transfer, lexsel, 86% coverage
 
 
12. Transfer, 88% coverage
 
 
13. Preparing text for annotation
 
 
14-16. Annotating the Uyghur corpus, %90 coverage
 
   
 
== Plan Outline ==
 
== Plan Outline ==

Revision as of 09:51, 7 May 2018

As of April 25:

  • Coverage: 58.9 %
  • Trimmed coverage: 45.5 %


Week Cov. Goal CG Goal Transfer Goal Lexsel Goal Done? Coverage Checkpoint
April 23-29 60%
April 30 - May 6 65% 10
May 7-13 67%
May 14-20 70%
May 21-27 75%
May 28-June 3 78%
June 4-10 82% 20
June 11-17 84% Eval 1
June 18-24 85%
June 25 - July 1 85% 20
July 2-8 86%
July 9-15 88% Eval 2
July 16-22 88%
July 23-29 88%
July 30 - August 5 90%
August 6-14 92% Final Evals

Plan by Weeks

  1. 30% coverage
  2. Basic CG
  3. 40% coverage
  4. Transfer
  5. 50% coverage
  6. Transfer, lexical selection, 65% coverage
  7. CG, 80% coverage
  8. Transfer, lexsel, 84% coverage
  9. Transfer
  10. CG, Transfer
  11. Transfer, lexsel, 86% coverage
  12. Transfer, 88% coverage
  13. Preparing text for annotation
  14. Annotating the Uyghur corpus, %90 coverage
  15. Annotating the Uyghur corpus, %90 coverage
  16. Annotating the Uyghur corpus, %90 coverage

Plan Outline

  • Post-application period:
    • Facilitating MT of a text from Uyghur to Turkish.
  • Community-bonding period:
    • bidix words, up to 50%
  • Month 1:
    • Writing scripts
    • Adding words to bidix, get coverage to around 80%
    • Chunking
    • Transfer rules
    • Begin CG for UIG
  • Month 2:
    • POS tagging/constraint grammar
    • Transfer rules
    • Get CG rules up to 100, ~50% disambiguation
    • >90% coverage
  • Month 3:
    • Creation of an Annotated Corpus