Difference between revisions of "Apertium-tat"

From Apertium
Jump to navigation Jump to search
Line 6: Line 6:
   
 
== Installation ==
 
== Installation ==
'''apertium-tat''' is currently located in [https://svn.code.sf.net/p/apertium/svn/languages/apertium-tat/].
+
'''apertium-tat''' is located in [https://svn.code.sf.net/p/apertium/svn/languages/apertium-tat/ languages] module.
  +
  +
You will need [[HFST]], lttoolbox, apertium and [[CG|vislcg]] installed on your computer to be able to use it.
  +
  +
If you are on a Debian-based distro, the easiest way to get those dependencies is to install them with apt-get from [[User:Tino Didriksen]]'s [[Prerequisites for Debian|repository]]:
  +
  +
<pre>
  +
wget http://apertium.projectjj.com/apt/install-nightly.sh -O - | sudo bash
  +
sudo apt-get -f install locales build-essential automake subversion pkg-config \
  +
gawk apertium lttoolbox libapertium3-3.3-dev liblttoolbox3-3.3-dev apertium-lex-tools \
  +
cg3 hfst libhfst36-dev
  +
</pre>
  +
  +
Then you can check out apertium-tat from our svn repository and compile it:
  +
  +
<pre>
  +
svn co http://svn.code.sf.net/p/apertium/svn/languages/apertium-tat/
  +
./autogen.sh
  +
make
  +
</pre>
   
 
== Current State ==
 
== Current State ==

Revision as of 19:15, 27 August 2014

Apertium-tat is a morphological analyser/generator and CG tagger for Tatar, currently under development. It is intended to be compatible with transducers for other Turkic languages so that they can be translated between. It's used in the following language pairs:

Installation

apertium-tat is located in languages module.

You will need HFST, lttoolbox, apertium and vislcg installed on your computer to be able to use it.

If you are on a Debian-based distro, the easiest way to get those dependencies is to install them with apt-get from User:Tino Didriksen's repository:

wget http://apertium.projectjj.com/apt/install-nightly.sh -O - | sudo bash
sudo apt-get -f install locales build-essential automake subversion pkg-config \
 gawk apertium lttoolbox libapertium3-3.3-dev liblttoolbox3-3.3-dev apertium-lex-tools \
 cg3 hfst libhfst36-dev

Then you can check out apertium-tat from our svn repository and compile it:

svn co http://svn.code.sf.net/p/apertium/svn/languages/apertium-tat/
./autogen.sh
make

Current State

{{#set_param_default | corpus1 | None }} {{#set_param_default | corpus2 | None }} {{#set_param_default | corpus3 | None }} {{#set_param_default | corpus4 | None }} {{#set_param_default | corpus5 | None }} {{#set_param_default | corpus6 | None }} {{#set_param_default | corpus7 | None }} {{#set_param_default | corpus8 | None }} {{#set_param_default | corpus9 | None }} {{#set_param_default | corpus10 | None }}

  • Number of stems: 55,702 {{#ifneq | | | () }}
  • Disambiguation rules: 123
  • Coverage: ~91%

{{#ifneq | quran | None |

{{#ifneq | | | | }}

}}

{{#ifneq | NewTestament | None |

{{#ifneq | | | | }}

}}

{{#ifneq | aytmatov | None |

{{#ifneq | | | | }}

}}

{{#ifneq | wp2013 | None |

{{#ifneq | | | | }}

}}

{{#ifneq | tatnews2005/11 | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus6}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus7}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus8}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus9}}} | None |

{{#ifneq | | | | }}

}}

{{#ifneq | {{{corpus10}}} | None |

{{#ifneq | | | | }}

}}

corpuswordscoverage
<nowinter>[[|quran]]</nowinter>quran165K ~89.2%
<nowinter>[[|NewTestament]]</nowinter>NewTestament137K ~94.2%
<nowinter>[[|aytmatov]]</nowinter>aytmatov5K ~93.4%
<nowinter>[[|wp2013]]</nowinter>wp2013128K ~87.3%
<nowinter>[[|tatnews2005/11]]</nowinter>tatnews2005/114.6M ~90.7%
<nowinter>[[|{{{corpus6}}}]]</nowinter>{{{corpus6}}} ~%
<nowinter>[[|{{{corpus7}}}]]</nowinter>{{{corpus7}}} ~%
<nowinter>[[|{{{corpus8}}}]]</nowinter>{{{corpus8}}} ~%
<nowinter>[[|{{{corpus9}}}]]</nowinter>{{{corpus9}}} ~%
<nowinter>[[|{{{corpus10}}}]]</nowinter>{{{corpus10}}} ~%