Difference between revisions of "Apertium-kaz-tat/paper"
Jump to navigation
Jump to search
Firespeaker (talk | contribs) m (→Ilnar) |
Firespeaker (talk | contribs) (→Ilnar) |
||
Line 7: | Line 7: | ||
=== Ilnar === |
=== Ilnar === |
||
* Development corpus |
|||
* Work on increasing <s>coverage (via lexc) and</s> trimmed coverage (via dix) to 90% |
** Work on increasing <s>coverage (via lexc) and</s> trimmed coverage (via dix) to 90% |
||
* Work on making sure testvoc passes |
** Work on making sure testvoc passes |
||
⚫ | |||
* Test corpus (don't base rules on this!) |
|||
** Make a gold standard translation |
|||
* Paper |
|||
⚫ | |||
* Later: correct some tests for [[Evaluation|error-rate testing]] |
* Later: correct some tests for [[Evaluation|error-rate testing]] |
||
Revision as of 23:34, 6 April 2013
We're submitting a paper on apertium-kaz-tat to MT Summit 2013. DEADLINE: APRIL 20.
TODO
Ideal benchmarks:
- document rules in the rlx with example sentences
- more like 100-150 (currently ~40) disambiguation rules in -kaz
Ilnar
- Development corpus
- Work on increasing
coverage (via lexc) andtrimmed coverage (via dix) to 90% - Work on making sure testvoc passes
- Work on increasing
- Test corpus (don't base rules on this!)
- Make a gold standard translation
- Paper
- Add affiliation to paper
- Later: correct some tests for error-rate testing
Fran
- Delegate out error-rate testing tasks
- new version of Table 1
JNW
- Work on last few issues in -tat twol
- Write up background
- Contrastive analysis of Kazakh and Tatar
- phonological differences (a generalised summary, 2 or 3 small specific examples)
- orthographical differences (a generalised summary, 1 or 2 small specific examples)
- lexical and morphological differences (2 or 3 specific examples)
- morphotactic differences (2 or 3 specific examples)
- syntactic differences (2 or 3 specific examples)