Difference between revisions of "Indonesian and Malaysian/Work plan"
Jump to navigation
Jump to search
Line 9: | Line 9: | ||
| 1 || 21/05—27/05 || Working on Indonesian analyzer/generator. || - || - || - |
| 1 || 21/05—27/05 || Working on Indonesian analyzer/generator. || - || - || - |
||
|- |
|- |
||
| 2 || 28/05—03/06 || Working on Indonesian analyzer/generator.<br/>Translating Malaysian wikipedia articles to Indonesian to get a parallel corpus. Bilingual dictionaries will be [[Extracting bilingual dictionaries with Giza++|extracted]] from the corpus. || 72.9%, 29 |
| 2 || 28/05—03/06 || Working on Indonesian analyzer/generator.<br/>Translating Malaysian wikipedia articles to Indonesian to get a parallel corpus. Bilingual dictionaries will be [[Extracting bilingual dictionaries with Giza++|extracted]] from the corpus. || 72.9%, 29.9% || - || - |
||
|- |
|- |
||
| 3 || 04/06—10/06 || Translating Malaysian wikipedia articles to Indonesian.<br/>Working on Malaysian analyzer/generator. || 74.9%, 46.4% || - || - |
| 3 || 04/06—10/06 || Translating Malaysian wikipedia articles to Indonesian.<br/>Working on Malaysian analyzer/generator. || 74.9%, 46.4% || - || - |
Revision as of 20:30, 10 June 2012
This is a workplan for development efforts for the Indonesian and Malaysian translator in Google Summer of Code 2012.
Week | Dates | Activities | Coverage reached (wp) | Evaluation | WER reached |
---|---|---|---|---|---|
0 | 23/04—21/05 | Translating the story to get a baseline WER. | - | 500 words | 4.68% |
1 | 21/05—27/05 | Working on Indonesian analyzer/generator. | - | - | - |
2 | 28/05—03/06 | Working on Indonesian analyzer/generator. Translating Malaysian wikipedia articles to Indonesian to get a parallel corpus. Bilingual dictionaries will be extracted from the corpus. |
72.9%, 29.9% | - | - |
3 | 04/06—10/06 | Translating Malaysian wikipedia articles to Indonesian. Working on Malaysian analyzer/generator. |
74.9%, 46.4% | - | - |
4 | 11/06—17/06 | Working on Malaysian analyzer/generator and bidix. | - | - |