Apertium Turkic/TODO

From Apertium
Jump to navigation Jump to search

This is a general to-do list for the Apertium Turkic working group.

Website

This section outlines what's left to get http://turkic.apertium.com/ up and running.

software infrastructure

optional: spell checker and language detection stuff

what to include

make the following pairs available to the site:

  • pairs: kaz-tat, tur-kir, kaz-kir, tat-bak, kaz-kaa, tuk-tur?, tur-uzb?, kaz-eng?
  • transducers: kaz, tat, kir, tur, bak, chv, kum, nog, kaa, uzb?, tuk?

prettifying

future

  • consider including the web concordancer on the site (and consider what corpora to provide search access to...)

Things that need to be figured out

Issues introduced by new build process

  • How can we do single-category testvoc now?
  • How can we make vanilla transducers (without MT-specific "wrong" POSes)
  • How can we count trimmed stems?