Difference between revisions of "User:Ilnar.salimzyan"

From Apertium
Jump to navigation Jump to search
Line 5: Line 5:
== Responsibilities as the maintainer of the Kazakh-Tatar pair ==
== Responsibilities as the maintainer of the Kazakh-Tatar pair ==


Making sure that the following tests pass:
Making sure that the following things are constant:


* All (=100%) of the regression tests should pass: <code>./wiki-tests.sh Regression kaz tat update</code>
# All (=100%) of the regression tests pass: <code>./wiki-tests.sh Regression kaz tat update</code>
# Corpus test doesn't produce any debugging symbols: <code>./qa.sh kaz-tat-corp && grep -c '[*@#]' corpa/kaz-tat-nova.txt</code> outputs 0.

# WER continuosly decreases or at least stays the same after every commit. So, the WER numbers output by the command <pre>kaz-tat$ perl ../apertium-eval-translator/apertium-eval-translator.pl -test corpa/kaz-tat-nova.txt -ref corpa/corpus.tat.txt</pre> are '''less than or equal to''' numbers output by <pre>kaz-tat$ perl ../apertium-eval-translator/apertium-eval-translator.pl -test corpa/kaz-tat-origina.txt -ref corpa/corpus.tat.txt</pre>.
* Corpus test shouldn't produce any debugging symbols: <code>./qa.sh kaz-tat-corp && grep -c '[*@#]' corpa/kaz-tat-nova.txt</code> should output 0.


The same applies for the opposite direction:
The same applies for the opposite direction:


* <code>./wiki-tests.sh Regression tat kaz update</code> => 100%
# <code>./wiki-tests.sh Regression tat kaz update</code> => 100%
# <code>./qa.sh tat-kaz-corp && grep -c '[*@#]' corpa/tat-kaz-nova.txt</code> => 0
# The numbers output by <pre>kaz-tat$ perl ../apertium-eval-translator/apertium-eval-translator.pl -test corpa/tat-kaz-nova.txt -ref corpa/corpus.kaz.txt</pre> are '''less than or equal to''' numbers output by <pre>kaz-tat$ perl ../apertium-eval-translator/apertium-eval-translator.pl -test corpa/tat-kaz-origina.txt -ref corpa/corpus.kaz.txt</pre>.


If and only if all test cases pass, and I don't actively work on an other pair, I work towards achieving those [[Kazakh_and_Tatar/TODO#Goals|goals]] and take care of documentation.
* <code>./qa.sh tat-kaz-corp && grep -c '[*@#]' corpa/tat-kaz-nova.txt</code> => 0

If and only if all test cases pass, and I don't actively work on an other pair, I work towards achiving those [[Kazakh_and_Tatar/TODO#Goals|goals]] and take care of documentation.


=== TODO's related to the above ===
=== TODO's related to the above ===


A good idea is to set up some bot which compiles apertium-kaz, apertium-tat and apertium-kaz-tat modules if somebody commits to them, runs the four
<code>./qa.sh all</code> in <code>apertium-kaz-tat</code> will compile the transcducers and the pair and run all the tests described above (note that you have to <code>mv corpa/kaz-tat-nova.txt corpa/kaz-tat-origina.txt</code> before you run tests if you want to see how the last commit affected them (not how the last+second_to_last commits did that). A good idea would be, however, to set up some bot which compiles apertium-kaz, apertium-tat and apertium-kaz-tat modules if somebody commits to them, runs the tests above and emails me if they don't pass.
tests above and emails me if they don't pass.

Another important thing to take care of automatically is that WER continuosly decreases or at least stays the same after every commit. So, the WER numbers output by the commands <pre>kaz-tat$ perl ../apertium-eval-translator/apertium-eval-translator.pl -test corpa/kaz-tat-nova.txt -ref corpa/corpus.tat.txt</pre> and <pre>kaz-tat$ perl ../apertium-eval-translator/apertium-eval-translator.pl -test corpa/tat-kaz-nova.txt -ref corpa/corpus.kaz.txt</pre> are '''less than or equal to''' numbers output by <pre>kaz-tat$ perl ../apertium-eval-translator/apertium-eval-translator.pl -test corpa/kaz-tat-origina.txt -ref corpa/corpus.tat.txt</pre> and <pre>kaz-tat$ perl ../apertium-eval-translator/apertium-eval-translator.pl -test corpa/tat-kaz-origina.txt -ref corpa/corpus.kaz.txt</pre>


[[Category:Users]]
[[Category:Users]]

Revision as of 00:21, 21 March 2014

My name is Ilnar Salimzyanov (Илнар Сәлимҗан) and I maintain the Kazakh-Tatar language pair.

My IRC nick is "selimcan".

Responsibilities as the maintainer of the Kazakh-Tatar pair

Making sure that the following things are constant:

  1. All (=100%) of the regression tests pass: ./wiki-tests.sh Regression kaz tat update
  2. Corpus test doesn't produce any debugging symbols: ./qa.sh kaz-tat-corp && grep -c '[*@#]' corpa/kaz-tat-nova.txt outputs 0.
  3. WER continuosly decreases or at least stays the same after every commit. So, the WER numbers output by the command
    kaz-tat$ perl ../apertium-eval-translator/apertium-eval-translator.pl -test corpa/kaz-tat-nova.txt -ref corpa/corpus.tat.txt
    are less than or equal to numbers output by
    kaz-tat$ perl ../apertium-eval-translator/apertium-eval-translator.pl -test corpa/kaz-tat-origina.txt -ref corpa/corpus.tat.txt
    .

The same applies for the opposite direction:

  1. ./wiki-tests.sh Regression tat kaz update => 100%
  2. ./qa.sh tat-kaz-corp && grep -c '[*@#]' corpa/tat-kaz-nova.txt => 0
  3. The numbers output by
    kaz-tat$ perl ../apertium-eval-translator/apertium-eval-translator.pl -test corpa/tat-kaz-nova.txt -ref corpa/corpus.kaz.txt
    are less than or equal to numbers output by
    kaz-tat$ perl ../apertium-eval-translator/apertium-eval-translator.pl -test corpa/tat-kaz-origina.txt -ref corpa/corpus.kaz.txt
    .

If and only if all test cases pass, and I don't actively work on an other pair, I work towards achieving those goals and take care of documentation.

TODO's related to the above

./qa.sh all in apertium-kaz-tat will compile the transcducers and the pair and run all the tests described above (note that you have to mv corpa/kaz-tat-nova.txt corpa/kaz-tat-origina.txt before you run tests if you want to see how the last commit affected them (not how the last+second_to_last commits did that). A good idea would be, however, to set up some bot which compiles apertium-kaz, apertium-tat and apertium-kaz-tat modules if somebody commits to them, runs the tests above and emails me if they don't pass.