User:Ilnar.salimzyan

From Apertium
Revision as of 12:41, 16 March 2014 by Ilnar.salimzyan (talk | contribs)
Jump to navigation Jump to search

My name is Ilnar Salimzyanov (Илнар Сәлимҗан) and I maintain the Kazakh-Tatar language pair.

My IRC nick is "selimcan".

Responsibilities as the maintainer of the Kazakh-Tatar pair

Making sure that the following tests pass:

  • All (=100%) of the regression tests should pass: ./wiki-tests.sh Regression kaz tat update
  • Corpus test shouldn't produce any debugging symbols: ./qa.sh kaz-tat-corp && grep -c '[*@#]' corpa/kaz-tat-nova.txt should output 0.

The same applies for the opposite direction:

  • ./wiki-tests.sh Regression tat kaz update => 100%
  • ./qa.sh tat-kaz-corp && grep -c '[*@#]' corpa/tat-kaz-nova.txt => 0

If and only if all test cases pass, and I don't actively work on other pair, I work towards achiving those goals and take care of documentation.

TODO's related to the above

A good idea is to set up some bot which compiles apertium-kaz, apertium-tat and apertium-kaz-tat modules if somebody commits to them, runs the four tests above and emails me if they don't pass.

Another important thing to take care of automatically is that WER continuosly decreases or at least stays the same after every commit. So, the WER numbers output by the commands

kaz-tat$ perl ../apertium-eval-translator/apertium-eval-translator.pl -test corpa/kaz-tat-nova.txt -ref corpa/corpus.tat.txt

and

kaz-tat$ perl ../apertium-eval-translator/apertium-eval-translator.pl -test corpa/tat-kaz-nova.txt -ref corpa/corpus.kaz.txt

are less than or equal to numbers output by

kaz-tat$ perl ../apertium-eval-translator/apertium-eval-translator.pl -test corpa/kaz-tat-origina.txt -ref corpa/corpus.tat.txt

and

kaz-tat$ perl ../apertium-eval-translator/apertium-eval-translator.pl -test corpa/tat-kaz-origina.txt -ref corpa/corpus.kaz.txt