ATT format

From Apertium
Revision as of 14:18, 10 March 2014 by Unhammer (talk | contribs)
Jump to navigation Jump to search

ATT format is a transducer format based on a four-column layout. It is a tab separated four-column format.

Both lttoolbox and HFST can read ATT format as input to compile dictionaries (lt-comp, hfst-txt2fst), and print compiled dictionaries to ATT format (lt-print, hfst-fst2txt).

Example

Say we want to represent the following transducer:

Test att.png

We can do it thusly:

$ cat test.dix 
<dictionary>
  <alphabet>abcdefghijklmnopqrstuvwxyz</alphabet>
  <sdefs>
    <sdef n="n"/>
  </sdefs>
  <section id="main" type="standard">
    <e><p><l>test</l><r>foo</r></p></e>
  </section>
</dictionary>


$ lt-comp lr test.dix test.bin
main@standard 5 4


$ lt-print test.bin 
0	1	t	f	
1	2	e	o	
2	3	s	o	
3	4	t	ε	
4

See also