Difference between revisions of "Lttoolbox"
Line 2: | Line 2: | ||
'''lttoolbox''' is a toolbox for lexical processing, [[morphological analysis]] and generation of words. The analysis is the process of splitting of words splitting a word (e.g. cats) into its lemma 'cat' and the grammatical information <code><n><pl></code>. The generation is the opposite process. |
'''lttoolbox''' is a toolbox for lexical processing, [[morphological analysis]] and generation of words. The analysis is the process of splitting of words splitting a word (e.g. cats) into its lemma 'cat' and the grammatical information <code><n><pl></code>. The generation is the opposite process. |
||
The package is split into three programs, <code>lt-comp</code>, the compiler, <code>lt-proc</code>, the processor, and <code>lt-expand</code>, which generates all possible mappings between surface |
The package is split into three programs, <code>lt-comp</code>, the compiler, <code>lt-proc</code>, the processor, and <code>lt-expand</code>, which generates all possible mappings between [[surface form]]s and [[lexical form]]s in the dictionary. |
||
==Creation== |
|||
{{main|Monodix basics}} |
|||
Morphological analyser specification files, or morphological dictionaries may be found in all of our [[language pair]] packages, from the [[incubator]], or you may elect to create your own (more instructions at the page ''[[Monodix basics]]''). |
|||
==Compilation== |
==Compilation== |
||
Compilation into the binary format is achieved by means of the <code>lt-comp</code> program. You can compile a given <code>.dix</code> from left-to-right (<code>LR</code>), or from right-to-left (<code>RL</code>). Compiling <code>LR</code> usually creates an ''analyser'', compiling <code>RL</code> usually creates a generator.<ref>In all current linguistic packages, left to right is analysis, and right to left is generation. This is not however a software restriction.</ref> |
|||
==Processing== |
==Processing== |
||
Line 13: | Line 20: | ||
==Expansion== |
==Expansion== |
||
==Notes== |
|||
<references/> |
|||
==See also== |
==See also== |
Revision as of 20:52, 5 February 2008
lttoolbox is a toolbox for lexical processing, morphological analysis and generation of words. The analysis is the process of splitting of words splitting a word (e.g. cats) into its lemma 'cat' and the grammatical information <n><pl>
. The generation is the opposite process.
The package is split into three programs, lt-comp
, the compiler, lt-proc
, the processor, and lt-expand
, which generates all possible mappings between surface forms and lexical forms in the dictionary.
Creation
- Main article: Monodix basics
Morphological analyser specification files, or morphological dictionaries may be found in all of our language pair packages, from the incubator, or you may elect to create your own (more instructions at the page Monodix basics).
Compilation
Compilation into the binary format is achieved by means of the lt-comp
program. You can compile a given .dix
from left-to-right (LR
), or from right-to-left (RL
). Compiling LR
usually creates an analyser, compiling RL
usually creates a generator.[1]
Processing
Analysis
Generation
Expansion
Notes
- ↑ In all current linguistic packages, left to right is analysis, and right to left is generation. This is not however a software restriction.