Difference between revisions of "Hfst documentation"
Jump to navigation
Jump to search
(Should we deeplink to the kitwiki documentation?) |
(links) |
||
Line 6: | Line 6: | ||
Hfst consists of a large number of smaller programs, with different functions: |
Hfst consists of a large number of smaller programs, with different functions: |
||
− | * hfst-calculate |
+ | * [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstCalculate hfst-calculate] |
** Compiles SFST files into HFST transducers |
** Compiles SFST files into HFST transducers |
||
+ | * [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstCompare hfst-compare] |
||
− | * hfst-compare |
||
** Compares two transducers, checking for equivalence |
** Compares two transducers, checking for equivalence |
||
+ | * [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstCompose hfst-compose] |
||
− | * hfst-compose |
||
** Composes two transducers |
** Composes two transducers |
||
− | * hfst-compose-intersect |
+ | * [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstComposeIntersect hfst-compose-intersect] |
** Perform intersecting composition on two transducers (typically the morphotactic transducer/lexicon and the morphophonological transducer) |
** Perform intersecting composition on two transducers (typically the morphotactic transducer/lexicon and the morphophonological transducer) |
||
− | * hfst-concatenate |
+ | * [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstConcatenate hfst-concatenate] |
** Concatenates two transducers |
** Concatenates two transducers |
||
+ | * [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstConjunct hfst-conjunct] |
||
− | * hfst-conjunct |
||
** Conjuncts two transducers |
** Conjuncts two transducers |
||
− | * hfst-determinize |
+ | * [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstDeterminize hfst-determinize] |
** |
** |
||
− | * hfst-diff-test |
+ | * [ hfst-diff-test] |
** |
** |
||
+ | * [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstConjunct hfst-disjunct] |
||
− | * hfst-disjunct |
||
** Disjuncts two transducers |
** Disjuncts two transducers |
||
− | * hfst-duplicate |
+ | * [ hfst-duplicate] |
** |
** |
||
− | * hfst-foma-wrapper.sh |
+ | * [ hfst-foma-wrapper.sh] |
** |
** |
||
− | * hfst-format |
+ | * [ hfst-format] |
** |
** |
||
+ | * [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstFst2Fst hfst-fst2fst] |
||
− | * hfst-fst2fst |
||
** Converts between Hfst, OpenFst, SFST and Foma transducers |
** Converts between Hfst, OpenFst, SFST and Foma transducers |
||
− | * hfst-fst2pairstrings |
+ | * [ hfst-fst2pairstrings] |
** |
** |
||
− | * hfst-fst2strings |
+ | * [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstFst2Strings hfst-fst2strings] |
** |
** |
||
+ | * [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstFst2Txt hfst-fst2txt] |
||
− | * hfst-fst2txt |
||
** Prints transducers in AT&T tabular format |
** Prints transducers in AT&T tabular format |
||
+ | * [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstHead hfst-head] |
||
− | * hfst-head |
||
** |
** |
||
+ | * [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstInvert hfst-invert] |
||
− | * hfst-invert |
||
** Turn a transducer upside down. |
** Turn a transducer upside down. |
||
− | * '''hfst-lexc''' |
+ | * [ '''hfst-lexc'''] |
** Compile a lexc file into a finite-state transducer |
** Compile a lexc file into a finite-state transducer |
||
− | * [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstLexc2Fst hfst-lexc2fst] |
+ | * [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstLexc2Fst hfst-lexc2fst]] |
** |
** |
||
− | * '''hfst-lookup''' |
+ | * [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstLookUp '''hfst-lookup'''] |
** lookup, gives ''lemma+analysis'' of wordforms |
** lookup, gives ''lemma+analysis'' of wordforms |
||
− | * hfst-lookup-optimize |
+ | * [ hfst-lookup-optimize] |
** |
** |
||
+ | * [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstMinimize hfst-minimize] |
||
− | * hfst-minimize |
||
** |
** |
||
− | * hfst-name |
+ | * [ hfst-name] |
** |
** |
||
− | * hfst-omor-evaluate |
+ | * [ hfst-omor-evaluate] |
** |
** |
||
− | * hfst-pair-test |
+ | * [ hfst-pair-test] |
** |
** |
||
− | * hfst-preprocess-for-optimized-lookup-format |
+ | * [ hfst-preprocess-for-optimized-lookup-format] |
** |
** |
||
− | * hfst-proc |
+ | * [ hfst-proc] |
** |
** |
||
+ | * [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstProject hfst-project] |
||
− | * hfst-project |
||
** |
** |
||
− | * hfst-push-weights |
+ | * [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstPushWeights hfst-push-weights] |
** |
** |
||
− | * hfst-regexp2fst |
+ | * [ hfst-regexp2fst] |
** |
** |
||
− | * hfst-remove-epsilons |
+ | * [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstRemoveEpsilons hfst-remove-epsilons] |
** |
** |
||
+ | * [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstRepeat hfst-repeat] |
||
− | * hfst-repeat |
||
** |
** |
||
+ | * [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstReverse hfst-reverse] |
||
− | * hfst-reverse |
||
** |
** |
||
+ | * [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstSplit hfst-split] |
||
− | * hfst-split |
||
** |
** |
||
− | * hfst-strings2fst |
+ | * [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstStrings2Fst hfst-strings2fst] |
** Compiles string-pairs and pair-strings into transducers |
** Compiles string-pairs and pair-strings into transducers |
||
− | * hfst-strip-header |
+ | * [ hfst-strip-header] |
** |
** |
||
− | * hfst-substitute |
+ | * [ hfst-substitute] |
** |
** |
||
+ | * [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstSubtract hfst-subtract] |
||
− | * hfst-subtract |
||
** |
** |
||
− | * hfst-summarise |
+ | * [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstSummarize hfst-summarise] |
** Calculates the properties of a transducer |
** Calculates the properties of a transducer |
||
− | * hfst-symbols |
+ | * [ hfst-symbols] |
** |
** |
||
+ | * [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstTail hfst-tail] |
||
− | * hfst-tail |
||
** |
** |
||
− | * '''hfst-twolc''' |
+ | * [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstTwolC '''hfst-twolc'''] |
** Compiles a twol (two-level morphophonology) file into an Hfst transducer |
** Compiles a twol (two-level morphophonology) file into an Hfst transducer |
||
− | * hfst-twolc-loc |
+ | * [ hfst-twolc-loc] |
** |
** |
||
+ | * [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstTxt2Fst hfst-txt2fst] |
||
− | * hfst-txt2fst |
||
** Converts AT&T tabular format into binary transducers |
** Converts AT&T tabular format into binary transducers |
||
− | * '''hfst-xfst''' |
+ | * [https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstXfst '''hfst-xfst'''] |
** Compiles xfst files into Hfst transducers |
** Compiles xfst files into Hfst transducers |
||
Revision as of 17:57, 18 July 2011
Hfst is a compiler for finite state transducers. The best documentation for writing such transducers is still Beesley and Karttunen (2003): Finite State Morphology. There still are some important differences between Hfst and the compilers described in the B&K book.
The Hfst subprograms
Hfst consists of a large number of smaller programs, with different functions:
- hfst-calculate
- Compiles SFST files into HFST transducers
- hfst-compare
- Compares two transducers, checking for equivalence
- hfst-compose
- Composes two transducers
- hfst-compose-intersect
- Perform intersecting composition on two transducers (typically the morphotactic transducer/lexicon and the morphophonological transducer)
- hfst-concatenate
- Concatenates two transducers
- hfst-conjunct
- Conjuncts two transducers
- hfst-determinize
- [ hfst-diff-test]
- hfst-disjunct
- Disjuncts two transducers
- [ hfst-duplicate]
- [ hfst-foma-wrapper.sh]
- [ hfst-format]
- hfst-fst2fst
- Converts between Hfst, OpenFst, SFST and Foma transducers
- [ hfst-fst2pairstrings]
- hfst-fst2strings
- hfst-fst2txt
- Prints transducers in AT&T tabular format
- hfst-head
- hfst-invert
- Turn a transducer upside down.
- [ hfst-lexc]
- Compile a lexc file into a finite-state transducer
- hfst-lexc2fst]
- hfst-lookup
- lookup, gives lemma+analysis of wordforms
- [ hfst-lookup-optimize]
- hfst-minimize
- [ hfst-name]
- [ hfst-omor-evaluate]
- [ hfst-pair-test]
- [ hfst-preprocess-for-optimized-lookup-format]
- [ hfst-proc]
- hfst-project
- hfst-push-weights
- [ hfst-regexp2fst]
- hfst-remove-epsilons
- hfst-repeat
- hfst-reverse
- hfst-split
- hfst-strings2fst
- Compiles string-pairs and pair-strings into transducers
- [ hfst-strip-header]
- [ hfst-substitute]
- hfst-subtract
- hfst-summarise
- Calculates the properties of a transducer
- [ hfst-symbols]
- hfst-tail
- hfst-twolc
- Compiles a twol (two-level morphophonology) file into an Hfst transducer
- [ hfst-twolc-loc]
- hfst-txt2fst
- Converts AT&T tabular format into binary transducers
- hfst-xfst
- Compiles xfst files into Hfst transducers