Difference between revisions of "Apertium-kaz-tat/paper"
Jump to navigation
Jump to search
Firespeaker (talk | contribs) m (→Ilnar) |
m (→Ilnar) |
||
Line 8: | Line 8: | ||
=== Ilnar === |
=== Ilnar === |
||
* Development corpus (lots and lots of text) |
* Development corpus (lots and lots of text) |
||
** Work on increasing |
** <s>Work on increasing coverage (via lexc) and trimmed coverage (via dix) to 90%<s/> |
||
** Work on making sure testvoc passes |
** Work on making sure testvoc passes |
||
** add rules — disambiguation (CG), lexical selection, and transfer. |
** add rules — disambiguation (CG), lexical selection, and transfer. |
Revision as of 15:56, 10 April 2013
We're submitting a paper on apertium-kaz-tat to MT Summit 2013. DEADLINE: APRIL 20.
TODO
Ideal benchmarks:
- document rules in the rlx with example sentences
- more like 100-150 (currently ~40) disambiguation rules in -kaz
Ilnar
- Development corpus (lots and lots of text)
Work on increasing coverage (via lexc) and trimmed coverage (via dix) to 90%- Work on making sure testvoc passes
- add rules — disambiguation (CG), lexical selection, and transfer.
- Test corpus (about 10 pages; don't base rules on this text!)
- Make a gold standard translation/correct some tests for error-rate testing
- Paper
- Add affiliation to paper
- Help JNW come up with some more contrastive stuff (see below / FIXME: Ilnars in paper)
Fran
- Delegate out error-rate testing tasks
- new version of Table 1
JNW
- Work on last few issues in -tat twol
Write up background- Contrastive analysis of Kazakh and Tatar
phonological differences (a generalised summary, 2 or 3 small specific examples)orthographical differences (a generalised summary, 1 or 2 small specific examples)- lexical and morphological differences (2 or 3 specific examples)
- morphotactic differences (2 or 3 specific examples)
- syntactic differences (2 or 3 specific examples)