Apertium has moved from SourceForge to GitHub.
If you have any questions, please come and talk to us on #apertium on irc.freenode.net or contact the GitHub migration team.

Uighur and Turkish/Work plan

From Apertium
< Uighur and Turkish(Difference between revisions)
Jump to: navigation, search
Line 1: Line 1:
 
{| class="wikitable"
 
{| class="wikitable"
 
!Week
 
!Week
! Cov. Goal
+
! Cov. goal
! CG Goal
+
! CG goal
! Transfer Goal
+
! Transfer goal
! Lexsel Goal
+
! Lexsel goal
! Corpusvoc
+
! Corpusvoc goal
 
! Done?
 
! Done?
 
! Coverage
 
! Coverage
Line 17: Line 17:
 
|
 
|
 
|
 
|
|
+
| 200000
 
|style="background-color: green"| '''✓'''
 
|style="background-color: green"| '''✓'''
 
| 45.5
 
| 45.5
|
+
| 197742
 
|
 
|
 
| Good work!
 
| Good work!
Line 29: Line 29:
 
|
 
|
 
|
 
|
|
+
| 198000
 
|style="background-color: yellow"| '''½'''
 
|style="background-color: yellow"| '''½'''
 
| 65.6
 
| 65.6
|
+
| 197742
 
|
 
|
 
| Good coverage, insufficient CG rules
 
| Good coverage, insufficient CG rules

Revision as of 12:12, 7 May 2018

Week Cov. goal CG goal Transfer goal Lexsel goal Corpusvoc goal Done? Coverage Errors Checkpoint Comments
April 23-29 45% 5 200000 45.5 197742 Good work!
April 30 - May 6 65% 10 198000 ½ 65.6 197742 Good coverage, insufficient CG rules
May 7-13 67%
May 14-20 70%
May 21-27 75%
May 28-June 3 78%
June 4-10 82% 20
June 11-17 84% Eval 1
June 18-24 85%
June 25 - July 1 85% 20
July 2-8 86%
July 9-15 88% Eval 2
July 16-22 89%
July 23-29 90%
July 30 - August 5 91%
August 6-14 92% Final Evals

Plan by Weeks

  1. 30% coverage
  2. Basic CG
  3. 40% coverage
  4. Transfer
  5. 50% coverage
  6. Transfer, lexical selection, 65% coverage
  7. CG, 80% coverage
  8. Transfer, lexsel, 84% coverage
  9. Transfer
  10. CG, Transfer
  11. Transfer, lexsel, 86% coverage
  12. Transfer, 88% coverage
  13. Preparing text for annotation
  14. Annotating the Uyghur corpus, %90 coverage
  15. Annotating the Uyghur corpus, %90 coverage
  16. Annotating the Uyghur corpus, %90 coverage

Plan Outline

  • Post-application period:
    • Facilitating MT of a text from Uyghur to Turkish.
  • Community-bonding period:
    • bidix words, up to 50%
  • Month 1:
    • Writing scripts
    • Adding words to bidix, get coverage to around 80%
    • Chunking
    • Transfer rules
    • Begin CG for UIG
  • Month 2:
    • POS tagging/constraint grammar
    • Transfer rules
    • Get CG rules up to 100, ~50% disambiguation
    • >90% coverage
  • Month 3:
    • Creation of an Annotated Corpus
Personal tools