Difference between revisions of "User:Ilnar.salimzyan/GSoC2014/Application"
Jump to navigation
Jump to search
Line 4: | Line 4: | ||
* work on the 'James and Mary' translation |
* work on the 'James and Mary' translation |
||
** get rid of the debugging symbols |
** <s>get rid of the debugging symbols</s> |
||
** get the baseline WER |
** get the baseline WER |
||
* get permission to use one of the modern government-funded Tatar-Russian dictionaries under a free license and digitize it or fall back to one of the dictionaries in the public domain and scan that |
* get permission to use one of the modern government-funded Tatar-Russian dictionaries under a free license and digitize it or fall back to one of the dictionaries in the public domain and scan that |
Revision as of 12:40, 15 April 2014
You can find my proposal for GSoC 2014 here:
Post-application period
- work on the 'James and Mary' translation
get rid of the debugging symbols- get the baseline WER
- get permission to use one of the modern government-funded Tatar-Russian dictionaries under a free license and digitize it or fall back to one of the dictionaries in the public domain and scan that
- read documentation on chunking based-transfer and papers describing other Apertium pairs for distant languages
Chunking,Chunking: A full example, sme-nob paper, eus-eng paper, eng-kaz paper.