Difference between revisions of "Apertium-quality/Application Documentation"

From Apertium
Jump to navigation Jump to search
(Created page with '== aq-ambtest == <span style='font-weight: bold; font-size: 12pt;'>Help output</span> <pre> usage: aq-ambtest [-h] [-X [STATFILE]] dictionary Get average ambiguity. positional …')
 
 
(2 intermediate revisions by one other user not shown)
Line 1: Line 1:
== aq-ambtest ==
== Usage Information ==
=== aq-ambtest &mdash; Ambiguity Testing ===
<span style='font-weight: bold; font-size: 12pt;'>Help output</span>
==== Help output ====
<pre>
<pre>
usage: aq-ambtest [-h] [-X [STATFILE]] dictionary
usage: aq-ambtest [-h] [-X [STATFILE]] dictionary
Line 12: Line 13:
-h, --help show this help message and exit
-h, --help show this help message and exit
-X [STATFILE], --statistics [STATFILE]
-X [STATFILE], --statistics [STATFILE]
XML file that statistics are to be stored in
XML file that statistics are to be stored in (Default: quality-stats.xml)


</pre>
</pre>
==== Usage ====
<span style='font-weight: bold; font-size: 12pt;'>Usage</span>
<pre>aq-ambtest apertium-mt-he.mt.dix -X
<!-- Add usage cases here! -->
aq-ambtest apertium-mt-he.he.dix -X</pre>


== aq-wikicrp ==
=== aq-wikicrp &mdash; Wikipedia Corpus Extractor ===
==== Help output ====
<span style='font-weight: bold; font-size: 12pt;'>Help output</span>
<pre>
<pre>
usage: aq-wikicrp [-h] [-c COUNT] [-C CORES] [-t TOKENISER] [-q QUEUE] [-x]
usage: aq-wikicrp [-h] [-c COUNT] [-C CORES] [-t TOKENISER] [-q QUEUE] [-x]
Line 44: Line 46:


</pre>
</pre>
==== Usage ====
<span style='font-weight: bold; font-size: 12pt;'>Usage</span>
<pre>wget http://dumps.wikimedia.org/mtwiki/latest/mtwiki-latest-pages-articles.xml.bz2 && bunzip2 mtwiki-latest-pages-articles.xml.bz2
<!-- Add usage cases here! -->
aq-wikicrp mtwiki-latest-pages-articles.xml mt.wikipedia.crp.txt</pre>


== aq-covtest ==
=== aq-covtest &mdash; Coverage Testing ===
==== Help output ====
<span style='font-weight: bold; font-size: 12pt;'>Help output</span>
<pre>
<pre>
usage: aq-covtest [-h] [-X [STATFILE]] [-H] corpus dictionary
usage: aq-covtest [-h] [-X [STATFILE]] [-H] corpus dictionary
Line 61: Line 64:
-h, --help show this help message and exit
-h, --help show this help message and exit
-X [STATFILE], --statistics [STATFILE]
-X [STATFILE], --statistics [STATFILE]
XML file that statistics are to be stored in
XML file that statistics are to be stored in (Default: quality-stats.xml)
-H, --hfst HFST mode
-H, --hfst HFST mode


</pre>
</pre>
==== Usage ====
<span style='font-weight: bold; font-size: 12pt;'>Usage</span>
<pre>aq-covtest mt.news.crp.txt mt-he.automorf.bin -X</pre>
<!-- Add usage cases here! -->


== aq-htmlgen ==
=== aq-htmlgen &mdash; HTML Generation for Statistics ===
==== Help output ====
<span style='font-weight: bold; font-size: 12pt;'>Help output</span>
<pre>
<pre>
usage: aq-htmlgen [-h] [-t [TITLE]] statistics outdir
usage: aq-htmlgen [-h] [-t [TITLE]] statistics outdir
Line 85: Line 88:


</pre>
</pre>
==== Usage ====
<span style='font-weight: bold; font-size: 12pt;'>Usage</span>
<pre>aq-htmlgen quality-stats.xml output</pre>
<!-- Add usage cases here! -->


== aq-autotest ==
=== aq-autotest &mdash; Automatic Testing using AQX files ===
==== Help output ====
<span style='font-weight: bold; font-size: 12pt;'>Help output</span>
<pre>
<pre>
usage: aq-autotest [-h] [-c] [-X [STATS]] [-o [OUTDIR]] aqx
usage: aq-autotest [-h] [-c] [-X [STATS]] [-o [OUTDIR]] aqx
Line 102: Line 105:
-c, --colour Colours the output
-c, --colour Colours the output
-X [STATS], --statistics [STATS]
-X [STATS], --statistics [STATS]
XML file that statistics are to be stored in
XML file that statistics are to be stored in (Default: quality-stats.xml)
-o [OUTDIR], --html [OUTDIR]
-o [OUTDIR], --html [OUTDIR]
Output directory for HTML content
Output directory for HTML content


</pre>
</pre>
==== Usage ====
<span style='font-weight: bold; font-size: 12pt;'>Usage</span>
<pre>aq-autotest -X stats.xml -o output quality.aqx</pre>
<!-- Add usage cases here! -->


== aq-gentest ==
=== aq-gentest &mdash; Generation Testing ===
==== Help output ====
<span style='font-weight: bold; font-size: 12pt;'>Help output</span>
<pre>
<pre>
usage: aq-gentest [-h] [-X [STATFILE]] [-d [DIRECTORY]] mode corpus
usage: aq-gentest [-h] [-X [STATFILE]] [-d [DIRECTORY]] mode corpus
Line 124: Line 127:
-h, --help show this help message and exit
-h, --help show this help message and exit
-X [STATFILE], --statistics [STATFILE]
-X [STATFILE], --statistics [STATFILE]
XML file that statistics are to be stored in
XML file that statistics are to be stored in (Default: quality-stats.xml)
-d [DIRECTORY], --dict [DIRECTORY]
-d [DIRECTORY], --dict [DIRECTORY]
Directory of dictionary (Default: current directory)
Directory of dictionary (Default: current directory)


</pre>
</pre>
==== Usage ====
<span style='font-weight: bold; font-size: 12pt;'>Usage</span>
<pre>aq-gentest -d . mt-he mt.crp.txt -X</pre>
<!-- Add usage cases here! -->


== aq-dixtest ==
=== aq-dixtest &mdash; Dictionary tests (rule count, etc) ===
==== Help output ====
<span style='font-weight: bold; font-size: 12pt;'>Help output</span>
<pre>
<pre>
usage: aq-dixtest [-h] [-X [STATFILE]] [-d [DICTDIR]] langpair
usage: aq-dixtest [-h] [-X [STATFILE]] [-d [DICTDIR]] langpair
Line 145: Line 148:
-h, --help show this help message and exit
-h, --help show this help message and exit
-X [STATFILE], --statistics [STATFILE]
-X [STATFILE], --statistics [STATFILE]
XML file that statistics are to be stored in
XML file that statistics are to be stored in (Default: quality-stats.xml)
-d [DICTDIR], --dict [DICTDIR]
-d [DICTDIR], --dict [DICTDIR]
Directory of dictionary (Default: current directory)
Directory of dictionary (Default: current directory)


</pre>
</pre>
==== Usage ====
<span style='font-weight: bold; font-size: 12pt;'>Usage</span>
<pre>aq-dixtest -d . mt-he </pre>
<!-- Add usage cases here! -->


== aq-regtest ==
=== aq-regtest &mdash; Regression Testing ===
==== Help output ====
<span style='font-weight: bold; font-size: 12pt;'>Help output</span>
<pre>
<pre>
usage: aq-regtest [-h] [-X [STATFILE]] [-d [DICTDIR]] mode wikiurl
usage: aq-regtest [-h] [-X [STATFILE]] [-d [DICTDIR]] mode wikiurl
Line 167: Line 170:
-h, --help show this help message and exit
-h, --help show this help message and exit
-X [STATFILE], --statistics [STATFILE]
-X [STATFILE], --statistics [STATFILE]
XML file that statistics are to be stored in
XML file that statistics are to be stored in (Default: quality-stats.xml)
-d [DICTDIR], --dict [DICTDIR]
-d [DICTDIR], --dict [DICTDIR]
Directory of dictionary (Default: current directory)
Directory of dictionary (Default: current directory)


</pre>
</pre>
==== Usage ====
<span style='font-weight: bold; font-size: 12pt;'>Usage</span>
<pre>aq-regtest -d . mt-he http://wiki.apertium.org/wiki/Special:Export/Maltese_and_Hebrew/Regression_tests -X
<!-- Add usage cases here! -->
aq-regtest -d . mt-he Regression_tests.xml -X</pre>


== aq-voctest ==
=== aq-voctest &mdash; Vocabulary Testing ===
==== Help output ====
<span style='font-weight: bold; font-size: 12pt;'>Help output</span>
<pre>
<pre>
usage: aq-voctest [-h] [-X [STATFILE]] [-a [ANADIX]] [-g [GENBIN]]
usage: aq-voctest [-h] [-X [STATFILE]] [-a [ANADIX]] [-g [GENBIN]]
Line 190: Line 194:
-h, --help show this help message and exit
-h, --help show this help message and exit
-X [STATFILE], --statistics [STATFILE]
-X [STATFILE], --statistics [STATFILE]
XML file that statistics are to be stored in
XML file that statistics are to be stored in (Default: quality-stats.xml)
-a [ANADIX], --anadix [ANADIX]
-a [ANADIX], --anadix [ANADIX]
Analysis .dix file (Default: apertium-aa-ab.aa-ab.dix)
Analysis .dix file (Default: apertium-aa-ab.aa-ab.dix)
Line 201: Line 205:
Dictionary direction (lr, rl)
Dictionary direction (lr, rl)
-o [OUTPUT], --output [OUTPUT]
-o [OUTPUT], --output [OUTPUT]
Output file for arrows output
Output file for arrows output (Default: voctest.txt)


</pre>
</pre>
==== Usage ====
<span style='font-weight: bold; font-size: 12pt;'>Usage</span>
<pre>aq-voctest mt-he -X</pre>
<!-- Add usage cases here! -->


== aq-morftest ==
=== aq-morftest &mdash; Morph Testing (HFST, etc) ===
==== Help output ====
<span style='font-weight: bold; font-size: 12pt;'>Help output</span>
<pre>
<pre>
usage: aq-morftest [-h] [-c] [-X [STATFILE]] [-C] [-i] [-s] [-l] [-f] [-p]
usage: aq-morftest [-h] [-c] [-X [STATFILE]] [-C] [-i] [-s] [-l] [-f] [-p]
Line 225: Line 229:
-c, --colour Colours the output
-c, --colour Colours the output
-X [STATFILE], --statistics [STATFILE]
-X [STATFILE], --statistics [STATFILE]
XML file that statistics are to be stored in
XML file that statistics are to be stored in (Default: quality-stats.xml)
-C, --compact Makes output more compact
-C, --compact Makes output more compact
-i, --ignore-extra-analyses
-i, --ignore-extra-analyses
Line 247: Line 251:


</pre>
</pre>
==== Usage ====
<span style='font-weight: bold; font-size: 12pt;'>Usage</span>
<pre>aq-morftest tgl.yaml -X</pre>
<!-- Add usage cases here! -->

Latest revision as of 11:02, 30 August 2011

Usage Information[edit]

aq-ambtest — Ambiguity Testing[edit]

Help output[edit]

usage: aq-ambtest [-h] [-X [STATFILE]] dictionary

Get average ambiguity.

positional arguments:
  dictionary            DIX file

optional arguments:
  -h, --help            show this help message and exit
  -X [STATFILE], --statistics [STATFILE]
                        XML file that statistics are to be stored in (Default: quality-stats.xml)

Usage[edit]

aq-ambtest apertium-mt-he.mt.dix -X
aq-ambtest apertium-mt-he.he.dix -X

aq-wikicrp — Wikipedia Corpus Extractor[edit]

Help output[edit]

usage: aq-wikicrp [-h] [-c COUNT] [-C CORES] [-t TOKENISER] [-q QUEUE] [-x]
                  wikidump outfile

Extract a usable corpus from a Wikipedia dump.

positional arguments:
  wikidump              Wikipedia XML dump
  outfile               Output filename

optional arguments:
  -h, --help            show this help message and exit
  -c COUNT, --count COUNT
                        Maximum sentences to store in corpus output (default:
                        unlimited)
  -C CORES, --cores CORES
                        Limit how many cores to use for generation
  -t TOKENISER, --tokeniser TOKENISER
                        Tokeniser to use
  -q QUEUE, --queue QUEUE
                        Set queue size (for advanced users)
  -x, --xml             Output corpora in XML format

Usage[edit]

wget http://dumps.wikimedia.org/mtwiki/latest/mtwiki-latest-pages-articles.xml.bz2 && bunzip2 mtwiki-latest-pages-articles.xml.bz2
aq-wikicrp mtwiki-latest-pages-articles.xml mt.wikipedia.crp.txt

aq-covtest — Coverage Testing[edit]

Help output[edit]

usage: aq-covtest [-h] [-X [STATFILE]] [-H] corpus dictionary

Test coverage.

positional arguments:
  corpus                Corpus text file
  dictionary            Binary dictionary (.bin, .fst, etc)

optional arguments:
  -h, --help            show this help message and exit
  -X [STATFILE], --statistics [STATFILE]
                        XML file that statistics are to be stored in (Default: quality-stats.xml)
  -H, --hfst            HFST mode

Usage[edit]

aq-covtest mt.news.crp.txt mt-he.automorf.bin -X

aq-htmlgen — HTML Generation for Statistics[edit]

Help output[edit]

usage: aq-htmlgen [-h] [-t [TITLE]] statistics outdir

Generate webpage and related files.

positional arguments:
  statistics            Statistics file
  outdir                Output directory

optional arguments:
  -h, --help            show this help message and exit
  -t [TITLE], --title [TITLE]
                        Directory of dictionary (Default: current directory)

Usage[edit]

aq-htmlgen quality-stats.xml output

aq-autotest — Automatic Testing using AQX files[edit]

Help output[edit]

usage: aq-autotest [-h] [-c] [-X [STATS]] [-o [OUTDIR]] aqx

Attempt all tests with default settings.

positional arguments:
  aqx                   Apertium Quality XML configuration file

optional arguments:
  -h, --help            show this help message and exit
  -c, --colour          Colours the output
  -X [STATS], --statistics [STATS]
                        XML file that statistics are to be stored in (Default: quality-stats.xml)
  -o [OUTDIR], --html [OUTDIR]
                        Output directory for HTML content

Usage[edit]

aq-autotest -X stats.xml -o output quality.aqx

aq-gentest — Generation Testing[edit]

Help output[edit]

usage: aq-gentest [-h] [-X [STATFILE]] [-d [DIRECTORY]] mode corpus

Test generation.

positional arguments:
  mode                  Language mode (eg, br-fr)
  corpus                Corpus text file

optional arguments:
  -h, --help            show this help message and exit
  -X [STATFILE], --statistics [STATFILE]
                        XML file that statistics are to be stored in (Default: quality-stats.xml)
  -d [DIRECTORY], --dict [DIRECTORY]
                        Directory of dictionary (Default: current directory)

Usage[edit]

aq-gentest -d . mt-he mt.crp.txt -X

aq-dixtest — Dictionary tests (rule count, etc)[edit]

Help output[edit]

usage: aq-dixtest [-h] [-X [STATFILE]] [-d [DICTDIR]] langpair

Get general dictionary statistics.

positional arguments:
  langpair              Language pair (eg aa-ab)

optional arguments:
  -h, --help            show this help message and exit
  -X [STATFILE], --statistics [STATFILE]
                        XML file that statistics are to be stored in (Default: quality-stats.xml)
  -d [DICTDIR], --dict [DICTDIR]
                        Directory of dictionary (Default: current directory)

Usage[edit]

aq-dixtest -d . mt-he 

aq-regtest — Regression Testing[edit]

Help output[edit]

usage: aq-regtest [-h] [-X [STATFILE]] [-d [DICTDIR]] mode wikiurl

Test for regressions directly from Apertium wiki.

positional arguments:
  mode                  Mode of operation (eg. br-fr)
  wikiurl               URL to regression tests

optional arguments:
  -h, --help            show this help message and exit
  -X [STATFILE], --statistics [STATFILE]
                        XML file that statistics are to be stored in (Default: quality-stats.xml)
  -d [DICTDIR], --dict [DICTDIR]
                        Directory of dictionary (Default: current directory)

Usage[edit]

aq-regtest -d . mt-he http://wiki.apertium.org/wiki/Special:Export/Maltese_and_Hebrew/Regression_tests -X
aq-regtest -d . mt-he Regression_tests.xml -X

aq-voctest — Vocabulary Testing[edit]

Help output[edit]

usage: aq-voctest [-h] [-X [STATFILE]] [-a [ANADIX]] [-g [GENBIN]]
                  [-d [DICTDIR]] [-D [DIRECTION]] [-o [OUTPUT]]
                  langpair

Test vocabulary for generation errors.

positional arguments:
  langpair              Language pair (eg, br-fr)

optional arguments:
  -h, --help            show this help message and exit
  -X [STATFILE], --statistics [STATFILE]
                        XML file that statistics are to be stored in (Default: quality-stats.xml)
  -a [ANADIX], --anadix [ANADIX]
                        Analysis .dix file (Default: apertium-aa-ab.aa-ab.dix)
  -g [GENBIN], --genbin [GENBIN]
                        Generation .bin file (Default: apertium-aa-
                        ab.autogen.bin)
  -d [DICTDIR], --dict [DICTDIR]
                        Directory of dictionary (Default: current directory)
  -D [DIRECTION], --direction [DIRECTION]
                        Dictionary direction (lr, rl)
  -o [OUTPUT], --output [OUTPUT]
                        Output file for arrows output (Default: voctest.txt)

Usage[edit]

aq-voctest mt-he -X

aq-morftest — Morph Testing (HFST, etc)[edit]

Help output[edit]

usage: aq-morftest [-h] [-c] [-X [STATFILE]] [-C] [-i] [-s] [-l] [-f] [-p]
                   [-S SECTION] [-t TEST] [-v] [--app APP] [--gen GEN]
                   [--morph MORPH]
                   test_file

Test morphological transducers for consistency. `hfst-lookup` (or Xerox'
`lookup` with argument -x) must be available on the PATH.

positional arguments:
  test_file             YAML file with test rules

optional arguments:
  -h, --help            show this help message and exit
  -c, --colour          Colours the output
  -X [STATFILE], --statistics [STATFILE]
                        XML file that statistics are to be stored in (Default: quality-stats.xml)
  -C, --compact         Makes output more compact
  -i, --ignore-extra-analyses
                        Ignore extra analyses when there are more than
                        expected, will PASS if the expected one is found.
  -s, --surface         Surface input/analysis tests only
  -l, --lexical         Lexical input/generation tests only
  -f, --hide-fails      Suppresses passes to make finding failures easier
  -p, --hide-passes     Suppresses failures to make finding passes easier
  -S SECTION, --section SECTION
                        The section to be used for testing (default is `hfst`)
  -t TEST, --test TEST  Which test to run (Default: all). TEST = test ID, e.g.
                        'Noun - gåetie' (remember quotes if the ID contains
                        spaces)
  -v, --verbose         More verbose output.
  --app APP             Override application used for test
  --gen GEN             Override generation transducer used for test
  --morph MORPH         Override morph transducer used for test

Will run all tests in the test_file by default.

Usage[edit]

aq-morftest tgl.yaml -X