Difference between revisions of "Apertium-kaz-tat/paper"

From Apertium
Jump to navigation Jump to search
Line 7: Line 7:


=== Ilnar ===
=== Ilnar ===
* Development corpus
* Work on increasing <s>coverage (via lexc) and</s> trimmed coverage (via dix) to 90%
** Work on increasing <s>coverage (via lexc) and</s> trimmed coverage (via dix) to 90%
* Work on making sure testvoc passes
** Work on making sure testvoc passes
* Add affiliation to paper
* Test corpus (don't base rules on this!)
** Make a gold standard translation
* Paper
** Add affiliation to paper
* Later: correct some tests for [[Evaluation|error-rate testing]]
* Later: correct some tests for [[Evaluation|error-rate testing]]



Revision as of 23:34, 6 April 2013

We're submitting a paper on apertium-kaz-tat to MT Summit 2013. DEADLINE: APRIL 20.

TODO

Ideal benchmarks:

  • document rules in the rlx with example sentences
  • more like 100-150 (currently ~40) disambiguation rules in -kaz

Ilnar

  • Development corpus
    • Work on increasing coverage (via lexc) and trimmed coverage (via dix) to 90%
    • Work on making sure testvoc passes
  • Test corpus (don't base rules on this!)
    • Make a gold standard translation
  • Paper
    • Add affiliation to paper
  • Later: correct some tests for error-rate testing

Fran

  • Delegate out error-rate testing tasks
  • new version of Table 1

JNW

  • Work on last few issues in -tat twol
  • Write up background
  • Contrastive analysis of Kazakh and Tatar
    • phonological differences (a generalised summary, 2 or 3 small specific examples)
    • orthographical differences (a generalised summary, 1 or 2 small specific examples)
    • lexical and morphological differences (2 or 3 specific examples)
    • morphotactic differences (2 or 3 specific examples)
    • syntactic differences (2 or 3 specific examples)