Apertium Turkic/Use/MT

From Apertium
Jump to navigation Jump to search

The dictionary languages/apertium-kaz/apertium-kaz.kaz.lexc has a bunch of lines with the comment ! Use/MT eng-kaz. These lines are only to appear in machine translation pairs, not when creating a "vanilla" transducer. It's easy to grep them out, but harder to get this into the current build system where language pairs depend on the pre-built att.gz of languages/apertium-kaz and trim that.

Some options:

  1. . Make two binaries in languages/apertium-kaz, one vanilla and one MT.
    • Cons: takes a lot longer to make apertium-kaz, installed size doubles, looks messy
  1. . Have Use/MT as a visible tag, removed by CG or similar
    • Cons: Ugggly
  1. . Have Use/MT as a compiler-tag, paths with this tag are removed by twol when compiling for vanilla, while the tag itself is removed when compiling for MT
  1. . Redundant make steps in pairs
  1. . Create a transducer with only the grep lines. Subtract that from vanilla after creating the att.gz
  1. . Two transducers in langs, but run one to att.gz, vanilla to full binary.