Difference between revisions of "Constraint Grammar"
Jump to navigation
Jump to search
m |
m (missing </code>) |
||
Line 8: | Line 8: | ||
* ''baseform'' — the [[lemma]] of a word. |
* ''baseform'' — the [[lemma]] of a word. |
||
* ''reading'' — a single analysis of a word. |
* ''reading'' — a single analysis of a word. |
||
::Apertium equivalent: <code>^word<n><pl>$ |
::Apertium equivalent: <code>^word<n><pl>$</code> |
||
* ''wordform'' — a [[surface form]] of a word. |
* ''wordform'' — a [[surface form]] of a word. |
||
Revision as of 13:38, 24 March 2010
Constraint Grammar is a tool that can be used to POS-tag ambiguous text. There are free constraint grammars developed outside the Apertium project for: Norwegian (the Oslo-Bergen tagger), Sámi languages (from Giellatekno) and Faroese (also from Giellatekno).
Terminology
- See also: Apertium stream format
- cohort — a surface form of a word, along with its analyses (possible lexical units), an ambiguous lexical unit.
- Apertium equivalent:
^words/word<n><pl>/word<vblex><pres><p3><sg>$
- Apertium equivalent:
- baseform — the lemma of a word.
- reading — a single analysis of a word.
- Apertium equivalent:
^word<n><pl>$
- Apertium equivalent:
- wordform — a surface form of a word.
Languages using CG in Apertium
See also
- Apertium and Constraint Grammar -- installation and use
- Introduksjon til føringsgrammatikk -- a HOWTO, in Norwegian bokmål
External links
- VISL CG-3 Development Information + documentation and downloads
- Basic Tutorial for VISL CG-3
- cg-mode for emacs, gives basic syntax highlighting and indentation