Difference between revisions of "Hfst documentation"
Jump to navigation
Jump to search
(Should we deeplink to the kitwiki documentation?) |
(links) |
||
Line 6: | Line 6: | ||
Hfst consists of a large number of smaller programs, with different functions: |
Hfst consists of a large number of smaller programs, with different functions: |
||
* hfst-calculate |
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstCalculate hfst-calculate] |
||
** Compiles SFST files into HFST transducers |
** Compiles SFST files into HFST transducers |
||
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstCompare hfst-compare] |
|||
* hfst-compare |
|||
** Compares two transducers, checking for equivalence |
** Compares two transducers, checking for equivalence |
||
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstCompose hfst-compose] |
|||
* hfst-compose |
|||
** Composes two transducers |
** Composes two transducers |
||
* hfst-compose-intersect |
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstComposeIntersect hfst-compose-intersect] |
||
** Perform intersecting composition on two transducers (typically the morphotactic transducer/lexicon and the morphophonological transducer) |
** Perform intersecting composition on two transducers (typically the morphotactic transducer/lexicon and the morphophonological transducer) |
||
* hfst-concatenate |
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstConcatenate hfst-concatenate] |
||
** Concatenates two transducers |
** Concatenates two transducers |
||
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstConjunct hfst-conjunct] |
|||
* hfst-conjunct |
|||
** Conjuncts two transducers |
** Conjuncts two transducers |
||
* hfst-determinize |
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstDeterminize hfst-determinize] |
||
** |
** |
||
* hfst-diff-test |
* [ hfst-diff-test] |
||
** |
** |
||
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstConjunct hfst-disjunct] |
|||
* hfst-disjunct |
|||
** Disjuncts two transducers |
** Disjuncts two transducers |
||
* hfst-duplicate |
* [ hfst-duplicate] |
||
** |
** |
||
* hfst-foma-wrapper.sh |
* [ hfst-foma-wrapper.sh] |
||
** |
** |
||
* hfst-format |
* [ hfst-format] |
||
** |
** |
||
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstFst2Fst hfst-fst2fst] |
|||
* hfst-fst2fst |
|||
** Converts between Hfst, OpenFst, SFST and Foma transducers |
** Converts between Hfst, OpenFst, SFST and Foma transducers |
||
* hfst-fst2pairstrings |
* [ hfst-fst2pairstrings] |
||
** |
** |
||
* hfst-fst2strings |
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstFst2Strings hfst-fst2strings] |
||
** |
** |
||
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstFst2Txt hfst-fst2txt] |
|||
* hfst-fst2txt |
|||
** Prints transducers in AT&T tabular format |
** Prints transducers in AT&T tabular format |
||
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstHead hfst-head] |
|||
* hfst-head |
|||
** |
** |
||
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstInvert hfst-invert] |
|||
* hfst-invert |
|||
** Turn a transducer upside down. |
** Turn a transducer upside down. |
||
* '''hfst-lexc''' |
* [ '''hfst-lexc'''] |
||
** Compile a lexc file into a finite-state transducer |
** Compile a lexc file into a finite-state transducer |
||
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstLexc2Fst hfst-lexc2fst] |
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstLexc2Fst hfst-lexc2fst]] |
||
** |
** |
||
* '''hfst-lookup''' |
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstLookUp '''hfst-lookup'''] |
||
** lookup, gives ''lemma+analysis'' of wordforms |
** lookup, gives ''lemma+analysis'' of wordforms |
||
* hfst-lookup-optimize |
* [ hfst-lookup-optimize] |
||
** |
** |
||
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstMinimize hfst-minimize] |
|||
* hfst-minimize |
|||
** |
** |
||
* hfst-name |
* [ hfst-name] |
||
** |
** |
||
* hfst-omor-evaluate |
* [ hfst-omor-evaluate] |
||
** |
** |
||
* hfst-pair-test |
* [ hfst-pair-test] |
||
** |
** |
||
* hfst-preprocess-for-optimized-lookup-format |
* [ hfst-preprocess-for-optimized-lookup-format] |
||
** |
** |
||
* hfst-proc |
* [ hfst-proc] |
||
** |
** |
||
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstProject hfst-project] |
|||
* hfst-project |
|||
** |
** |
||
* hfst-push-weights |
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstPushWeights hfst-push-weights] |
||
** |
** |
||
* hfst-regexp2fst |
* [ hfst-regexp2fst] |
||
** |
** |
||
* hfst-remove-epsilons |
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstRemoveEpsilons hfst-remove-epsilons] |
||
** |
** |
||
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstRepeat hfst-repeat] |
|||
* hfst-repeat |
|||
** |
** |
||
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstReverse hfst-reverse] |
|||
* hfst-reverse |
|||
** |
** |
||
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstSplit hfst-split] |
|||
* hfst-split |
|||
** |
** |
||
* hfst-strings2fst |
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstStrings2Fst hfst-strings2fst] |
||
** Compiles string-pairs and pair-strings into transducers |
** Compiles string-pairs and pair-strings into transducers |
||
* hfst-strip-header |
* [ hfst-strip-header] |
||
** |
** |
||
* hfst-substitute |
* [ hfst-substitute] |
||
** |
** |
||
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstSubtract hfst-subtract] |
|||
* hfst-subtract |
|||
** |
** |
||
* hfst-summarise |
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstSummarize hfst-summarise] |
||
** Calculates the properties of a transducer |
** Calculates the properties of a transducer |
||
* hfst-symbols |
* [ hfst-symbols] |
||
** |
** |
||
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstTail hfst-tail] |
|||
* hfst-tail |
|||
** |
** |
||
* '''hfst-twolc''' |
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstTwolC '''hfst-twolc'''] |
||
** Compiles a twol (two-level morphophonology) file into an Hfst transducer |
** Compiles a twol (two-level morphophonology) file into an Hfst transducer |
||
* hfst-twolc-loc |
* [ hfst-twolc-loc] |
||
** |
** |
||
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstTxt2Fst hfst-txt2fst] |
|||
* hfst-txt2fst |
|||
** Converts AT&T tabular format into binary transducers |
** Converts AT&T tabular format into binary transducers |
||
* '''hfst-xfst''' |
* [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstXfst '''hfst-xfst'''] |
||
** Compiles xfst files into Hfst transducers |
** Compiles xfst files into Hfst transducers |
||
Revision as of 17:57, 18 July 2011
Hfst is a compiler for finite state transducers. The best documentation for writing such transducers is still Beesley and Karttunen (2003): Finite State Morphology. There still are some important differences between Hfst and the compilers described in the B&K book.
The Hfst subprograms
Hfst consists of a large number of smaller programs, with different functions:
- hfst-calculate
- Compiles SFST files into HFST transducers
- hfst-compare
- Compares two transducers, checking for equivalence
- hfst-compose
- Composes two transducers
- hfst-compose-intersect
- Perform intersecting composition on two transducers (typically the morphotactic transducer/lexicon and the morphophonological transducer)
- hfst-concatenate
- Concatenates two transducers
- hfst-conjunct
- Conjuncts two transducers
- hfst-determinize
- [ hfst-diff-test]
- hfst-disjunct
- Disjuncts two transducers
- [ hfst-duplicate]
- [ hfst-foma-wrapper.sh]
- [ hfst-format]
- hfst-fst2fst
- Converts between Hfst, OpenFst, SFST and Foma transducers
- [ hfst-fst2pairstrings]
- hfst-fst2strings
- hfst-fst2txt
- Prints transducers in AT&T tabular format
- hfst-head
- hfst-invert
- Turn a transducer upside down.
- [ hfst-lexc]
- Compile a lexc file into a finite-state transducer
- hfst-lexc2fst]
- hfst-lookup
- lookup, gives lemma+analysis of wordforms
- [ hfst-lookup-optimize]
- hfst-minimize
- [ hfst-name]
- [ hfst-omor-evaluate]
- [ hfst-pair-test]
- [ hfst-preprocess-for-optimized-lookup-format]
- [ hfst-proc]
- hfst-project
- hfst-push-weights
- [ hfst-regexp2fst]
- hfst-remove-epsilons
- hfst-repeat
- hfst-reverse
- hfst-split
- hfst-strings2fst
- Compiles string-pairs and pair-strings into transducers
- [ hfst-strip-header]
- [ hfst-substitute]
- hfst-subtract
- hfst-summarise
- Calculates the properties of a transducer
- [ hfst-symbols]
- hfst-tail
- hfst-twolc
- Compiles a twol (two-level morphophonology) file into an Hfst transducer
- [ hfst-twolc-loc]
- hfst-txt2fst
- Converts AT&T tabular format into binary transducers
- hfst-xfst
- Compiles xfst files into Hfst transducers