Difference between revisions of "Apertium Turkic/TODO"

From Apertium
Jump to navigation Jump to search
Line 21: Line 21:
 
=== what to include ===
 
=== what to include ===
 
make the following pairs available to the site:
 
make the following pairs available to the site:
* pairs: '''kaz-tat''', '''tur-kir''', kaz-kir, tat-bak, kaz-kaa, tuk-tur?, tur-uzb?
+
* pairs: '''kaz-tat''', '''tur-kir''', kaz-kir, tat-bak, kaz-kaa, tuk-tur?, tur-uzb?, kaz-eng?
 
* transducers: '''kaz''', '''tat''', kir, tur, bak, chv, kum, nog, kaa, uzb?, tuk?
 
* transducers: '''kaz''', '''tat''', kir, tur, bak, chv, kum, nog, kaa, uzb?, tuk?
   

Revision as of 23:06, 4 January 2014

This is a general to-do list for the Apertium Turkic working group.

Website

This section outlines what's left to get http://turkic.apertium.com/ up and running.

software infrastructure

optional: spell checker and language detection stuff

what to include

make the following pairs available to the site:

  • pairs: kaz-tat, tur-kir, kaz-kir, tat-bak, kaz-kaa, tuk-tur?, tur-uzb?, kaz-eng?
  • transducers: kaz, tat, kir, tur, bak, chv, kum, nog, kaa, uzb?, tuk?

prettifying

future

  • consider including the web concordancer on the site (and consider what corpora to provide search access to...)

Things that need to be figured out

Issues introduced by new build process

  • How can we do single-category testvoc now?
  • How can we make vanilla transducers (without MT-specific "wrong" POSes)
  • How can we count trimmed stems?