Difference between revisions of "ATT format"

Revision as of 14:18, 10 March 2014

ATT format is a transducer format based on a four-column layout. It is a tab separated four-column format.

Both lttoolbox and HFST can read ATT format as input to compile dictionaries (lt-comp, hfst-txt2fst), and print compiled dictionaries to ATT format (lt-print, hfst-fst2txt).

Example

Say we want to represent the following transducer:

We can do it thusly:

$ cat test.dix 
<dictionary>
  <alphabet>abcdefghijklmnopqrstuvwxyz</alphabet>
  <sdefs>
    <sdef n="n"/>
  </sdefs>
  <section id="main" type="standard">
    <e><p><l>test</l><r>foo</r></p></e>
  </section>
</dictionary>


$ lt-comp lr test.dix test.bin
main@standard 5 4


$ lt-print test.bin 
0	1	t	f	
1	2	e	o	
2	3	s	o	
3	4	t	ε	
4

@@ Line 1: / Line 1: @@
 '''ATT format''' is a transducer format based on a four-column layout. It is a tab separated four-column format.
+Both lttoolbox and HFST can read ATT format as input to compile dictionaries (lt-comp, hfst-txt2fst), and print compiled dictionaries to ATT format (lt-print, hfst-fst2txt).
-==Example output==
+==Example==
 Say we want to represent the following transducer:
@@ Line 8: / Line 9: @@
 We can do it thusly:
 <pre>
 $ cat test.dix
 <dictionary>
@@ Line 33: / Line 32: @@
 	4	t	ε
 </pre>

Difference between revisions of "ATT format"

Revision as of 14:18, 10 March 2014

Example

See also

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools