Difference between revisions of "Speling format"
Jump to navigation
Jump to search
(New page: The '''Speling format''' is a way of representing "full form" vocabulary lists in a way that makes generating paradigms, and lemma-paradigm pairs for Apertium monodices easy. ...) |
|||
Line 18: | Line 18: | ||
wolf; wolf; sg; n |
wolf; wolf; sg; n |
||
wolf; wolves; sg; n |
wolf; wolves; sg; n |
||
</pre> |
|||
Or in Spanish: |
|||
<pre> |
|||
casa; casa; sg; n.f |
|||
casa; casas; pl; n.f |
|||
... |
|||
</pre> |
</pre> |
||
Revision as of 19:42, 22 March 2008
The Speling format is a way of representing "full form" vocabulary lists in a way that makes generating paradigms, and lemma-paradigm pairs for Apertium monodices easy.
The format is broadly organised as follows:
lemma; surface form; sub-category; part-of-speech
So for example in English we might see some noun inflection represented as:
house; house; sg; n house; houses; pl; n computer; computer; sg; n computer; computers; sg; n bird; bird; sg; n bird; birds; sg; n wolf; wolf; sg; n wolf; wolves; sg; n
Or in Spanish:
casa; casa; sg; n.f casa; casas; pl; n.f ...
From lists such as these it is fairly straightforward in languages with basic inflectional morphology to generate paradigms, or alternatively to generate partial paradigms for languages with richer morphology.
The format is named after speling.org who collect full form lists for spell-checkers in several Germanic languages and Finnish.