Difference between revisions of "Hfst documentation"

From Apertium
Jump to navigation Jump to search
(Should we deeplink to the kitwiki documentation?)
(links)
Line 6: Line 6:
Hfst consists of a large number of smaller programs, with different functions:
Hfst consists of a large number of smaller programs, with different functions:


* hfst-calculate
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstCalculate hfst-calculate]
** Compiles SFST files into HFST transducers
** Compiles SFST files into HFST transducers
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstCompare hfst-compare]
* hfst-compare
** Compares two transducers, checking for equivalence
** Compares two transducers, checking for equivalence
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstCompose hfst-compose]
* hfst-compose
** Composes two transducers
** Composes two transducers
* hfst-compose-intersect
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstComposeIntersect hfst-compose-intersect]
** Perform intersecting composition on two transducers (typically the morphotactic transducer/lexicon and the morphophonological transducer)
** Perform intersecting composition on two transducers (typically the morphotactic transducer/lexicon and the morphophonological transducer)
* hfst-concatenate
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstConcatenate hfst-concatenate]
** Concatenates two transducers
** Concatenates two transducers
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstConjunct hfst-conjunct]
* hfst-conjunct
** Conjuncts two transducers
** Conjuncts two transducers
* hfst-determinize
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstDeterminize hfst-determinize]
**
**
* hfst-diff-test
* [ hfst-diff-test]
**
**
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstConjunct hfst-disjunct]
* hfst-disjunct
** Disjuncts two transducers
** Disjuncts two transducers
* hfst-duplicate
* [ hfst-duplicate]
**
**
* hfst-foma-wrapper.sh
* [ hfst-foma-wrapper.sh]
**
**
* hfst-format
* [ hfst-format]
**
**
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstFst2Fst hfst-fst2fst]
* hfst-fst2fst
** Converts between Hfst, OpenFst, SFST and Foma transducers
** Converts between Hfst, OpenFst, SFST and Foma transducers
* hfst-fst2pairstrings
* [ hfst-fst2pairstrings]
**
**
* hfst-fst2strings
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstFst2Strings hfst-fst2strings]
**
**
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstFst2Txt hfst-fst2txt]
* hfst-fst2txt
** Prints transducers in AT&T tabular format
** Prints transducers in AT&T tabular format
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstHead hfst-head]
* hfst-head
**
**
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstInvert hfst-invert]
* hfst-invert
** Turn a transducer upside down.
** Turn a transducer upside down.
* '''hfst-lexc'''
* [ '''hfst-lexc''']
** Compile a lexc file into a finite-state transducer
** Compile a lexc file into a finite-state transducer
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstLexc2Fst hfst-lexc2fst]
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstLexc2Fst hfst-lexc2fst]]
**
**
* '''hfst-lookup'''
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstLookUp '''hfst-lookup''']
** lookup, gives ''lemma+analysis'' of wordforms
** lookup, gives ''lemma+analysis'' of wordforms
* hfst-lookup-optimize
* [ hfst-lookup-optimize]
**
**
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstMinimize hfst-minimize]
* hfst-minimize
**
**
* hfst-name
* [ hfst-name]
**
**
* hfst-omor-evaluate
* [ hfst-omor-evaluate]
**
**
* hfst-pair-test
* [ hfst-pair-test]
**
**
* hfst-preprocess-for-optimized-lookup-format
* [ hfst-preprocess-for-optimized-lookup-format]
**
**
* hfst-proc
* [ hfst-proc]
**
**
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstProject hfst-project]
* hfst-project
**
**
* hfst-push-weights
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstPushWeights hfst-push-weights]
**
**
* hfst-regexp2fst
* [ hfst-regexp2fst]
**
**
* hfst-remove-epsilons
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstRemoveEpsilons hfst-remove-epsilons]
**
**
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstRepeat hfst-repeat]
* hfst-repeat
**
**
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstReverse hfst-reverse]
* hfst-reverse
**
**
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstSplit hfst-split]
* hfst-split
**
**
* hfst-strings2fst
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstStrings2Fst hfst-strings2fst]
** Compiles string-pairs and pair-strings into transducers
** Compiles string-pairs and pair-strings into transducers
* hfst-strip-header
* [ hfst-strip-header]
**
**
* hfst-substitute
* [ hfst-substitute]
**
**
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstSubtract hfst-subtract]
* hfst-subtract
**
**
* hfst-summarise
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstSummarize hfst-summarise]
** Calculates the properties of a transducer
** Calculates the properties of a transducer
* hfst-symbols
* [ hfst-symbols]
**
**
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstTail hfst-tail]
* hfst-tail
**
**
* '''hfst-twolc'''
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstTwolC '''hfst-twolc''']
** Compiles a twol (two-level morphophonology) file into an Hfst transducer
** Compiles a twol (two-level morphophonology) file into an Hfst transducer
* hfst-twolc-loc
* [ hfst-twolc-loc]
**
**
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstTxt2Fst hfst-txt2fst]
* hfst-txt2fst
** Converts AT&T tabular format into binary transducers
** Converts AT&T tabular format into binary transducers
* '''hfst-xfst'''
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstXfst '''hfst-xfst''']
** Compiles xfst files into Hfst transducers
** Compiles xfst files into Hfst transducers



Revision as of 17:57, 18 July 2011

Hfst is a compiler for finite state transducers. The best documentation for writing such transducers is still Beesley and Karttunen (2003): Finite State Morphology. There still are some important differences between Hfst and the compilers described in the B&K book.


The Hfst subprograms

Hfst consists of a large number of smaller programs, with different functions:

See also