Difference between revisions of "Constraint Grammar"
Jump to navigation
Jump to search
Line 9: | Line 9: | ||
::Apertium equivalent: <code>^word<n><pl>$ |
::Apertium equivalent: <code>^word<n><pl>$ |
||
* ''wordform'' — a [[surface form]] of a word. |
* ''wordform'' — a [[surface form]] of a word. |
||
==Languages using CG in Apertium== |
|||
* [[Breton]] |
|||
* [[Welsh]] |
|||
* [[Norwegian Nynorsk and Norwegian Bokmål]] |
|||
* [[Irish Gaelic]] |
|||
==See also== |
==See also== |
Revision as of 08:52, 17 February 2010
Constraint Grammar is a tool that can be used to POS-tag ambiguous text. There are free constraint grammars developed outside the Apertium project for: Norwegian (the Oslo-Bergen tagger), Sámi languages (from Giellatekno) and Faroese (also from Giellatekno).
Terminology
- See also: Apertium stream format
- cohort — a surface form of a word, along with its analyses (possible lexical units), an ambiguous lexical unit.
- Apertium equivalent:
^words/word<n><pl>/word<vblex><pres><p3><sg>$
- Apertium equivalent:
- baseform — the lemma of a word.
- reading — a single analysis of a word.
- Apertium equivalent:
^word<n><pl>$
- Apertium equivalent:
- wordform — a surface form of a word.
Languages using CG in Apertium
See also
- Apertium and Constraint Grammar -- installation and use
- Constructing a TSX file with a Constraint Grammar
External links