Difference between revisions of "Apertium Turkic/TODO"

From Apertium
Jump to navigation Jump to search
Line 9: Line 9:
 
* <s>Get apertium-apy working stably</s>
 
* <s>Get apertium-apy working stably</s>
 
* [http://www.google-melange.com/gci/task/view/google/gci2013/5827136816414720 merge simple-html and html-tools so that simple-html can be automatically extracted from html-tools]
 
* [http://www.google-melange.com/gci/task/view/google/gci2013/5827136816414720 merge simple-html and html-tools so that simple-html can be automatically extracted from html-tools]
* [http://www.google-melange.com/gci/task/view/google/gci2013/6612775656751104 apache forwarding for html-tools]
+
* <s>[http://www.google-melange.com/gci/task/view/google/gci2013/6612775656751104 apache forwarding for html-tools]</s> (unnecessary!)
 
* [http://www.google-melange.com/gci/task/view/google/gci2013/5833268486209536 init scripts] and [http://www.google-melange.com/gci/task/view/google/gci2013/5346872029872128 cron testers] for apertium-html-tools, gateway, and apertium-apy
 
* [http://www.google-melange.com/gci/task/view/google/gci2013/5833268486209536 init scripts] and [http://www.google-melange.com/gci/task/view/google/gci2013/5346872029872128 cron testers] for apertium-html-tools, gateway, and apertium-apy
 
** find some way to have it retry restarting if it fails because the port is still reserved by the OS
 
** find some way to have it retry restarting if it fails because the port is still reserved by the OS

Revision as of 18:32, 3 January 2014

This is a general to-do list for the Apertium Turkic working group.

Website

This section outlines what's left to get http://turkic.apertium.com/ up and running.

software infrastructure

optional: spell checker and language detection stuff

what to include

make the following pairs available to the site:

  • pairs: kaz-tat, tur-kir, kaz-kir, tat-bak, kaz-kaa, tuk-tur?, tur-uzb?
  • transducers: kaz, tat, kir, tur, bak, chv, kum, nog, kaa, uzb?, tuk?

prettifying

future

  • consider including the web concordancer on the site (and consider what corpora to provide search access to...)

Things that need to be figured out

Issues introduced by new build process

  • How can we do single-category testvoc now?
  • How can we make vanilla transducers (without MT-specific "wrong" POSes)
  • How can we count trimmed stems?