Difference between revisions of "Crimean Tatar and Turkish/Work plan"
Jump to navigation
Jump to search
| Line 1: | Line 1: | ||
What [[User:Ilnar.salimzyan|selimcan]] expects: |
What [[User:Ilnar.salimzyan|selimcan]] expects: |
||
* '''a bidirectional Crimean Tatar-Turkish translator for translating Wikipedia articles''', with: |
|||
* [[Calculating coverage| |
** >90% [[Calculating coverage|bidix-trimmed coverage]] on both Wikipedias, |
||
* [[Testvoc#Corpus testvoc|Corpus testvoc]] clean on all corpora. |
|||
** single-stem-per-lexicon-testvoc and [[Testvoc#Corpus testvoc|Wikipedia-corpus-testvoc]] clean in both directions, |
|||
* Tests in [[Crimean Tatar and Turkish/Pending tests|Pending tests]] pass and thus are moved to [[Crimean Tatar and Turkish/Regression tests|Regression tests]]. |
|||
** [[WER]] < 25% in both directions. |
|||
{|class=wikitable |
|||
|- |
|||
!rowspan="2"| Week !!rowspan="2"| Dates !!colspan="2"| Target !! !!colspan="2"| Achieved !!rowspan="2"| Evaluation !!rowspan="2"| Notes |
|||
|- |
|||
! crh-tur cov. !! tur-crh cov. !! !! crh-tur cov. !! tur-crh cov. |
|||
|- |
|||
| 1 || 07/06—11/06 |
|||
| 65% || 65% || || || || || |
|||
|- |
|||
|- |
|||
| 12 || 21/08—27/08 |
|||
| 90% || 90% || || || || || |
|||
|} |
|||
{|class=wikitable |
{|class=wikitable |
||
Revision as of 23:27, 6 June 2017
What selimcan expects:
- a bidirectional Crimean Tatar-Turkish translator for translating Wikipedia articles, with:
- >90% bidix-trimmed coverage on both Wikipedias,
- single-stem-per-lexicon-testvoc and Wikipedia-corpus-testvoc clean in both directions,
- WER < 25% in both directions.
| Week | Dates | Target | Achieved | Evaluation | Notes | |||
|---|---|---|---|---|---|---|---|---|
| crh-tur cov. | tur-crh cov. | crh-tur cov. | tur-crh cov. | |||||
| 1 | 07/06—11/06 | 65% | 65% | |||||
| 12 | 21/08—27/08 | 90% | 90% | |||||
| Week | Dates | Coverage | Achieved | Evaluation |
|---|---|---|---|---|
| 3 | 22nd May — 28th May | 40% | 43.9% | |
| * Add all non-inflecting words | ||||
| * Finish challenge text (no *,#) | ||||
| * Do baseline evaluation (WER) | ||||
| Official start | ||||
| 4 | 29th May — 4th June | 40% | ||
| * Break | ||||
| 5 | 5th June — 11th June | 65% | ||
| * ? | ||||
| 6 | 12th June — 18th June | 70% | ||
| * ? | ||||
| * ? | ||||
| 7 | 19th June — 25th June | 80% | ||
| Phase 1 evaluation | ||||
| Deliverable: All closed classes + numerals testvoc clean | ||||
| 8 | 26th June — 2nd July | 84% | ||
| * ? | ||||
| * ? | ||||
| 9 | 3rd July — 9th July | 82% | ||
| * ? | ||||
| 10 | 10th July — 16th July | 84% | ||
| * ? | ||||
| * ? | ||||
| 11 | 17th July — 23rd July | 86% | ||
| Phase 2 evaluation | ||||
| Deliverable: Nouns, adjectives testvoc clean | ||||
| * ? | ||||
| 12 | 24th July — 30th July | 88% | ||
| * ? | ||||
| 13 | 1st August — 6th August | 89% | ||
| * ? | ||||
| 14 | 7th August — 13th August | 90% | ||
| * ? | ||||
| 15 | 14th August — 20th August | 91% | ||
| * ? | ||||
| 16 | 21th August — 27th August | 92% | ||
| Final evaluation | ||||
| Final deliverable: Full MT system, testvoc clean. | ||||
| * Evaluation | ||||
| * Write paper | ||||
| 17 | 28th August — 3rd September | |||
| * Write paper | ||||
| 18 | 4th September — 6th September | |||
| * Write paper | ||||