Difference between revisions of "List of symbols"
(columns and a little update) |
|||
Line 1: | Line 1: | ||
This page lists the symbols in Apertium used to denote part-of-speech and further morphological features, as well as chunk tags used for more syntactic functions, as well as XML tags. |
This page lists the symbols in Apertium used to denote part-of-speech and further morphological features, as well as chunk tags used for more syntactic functions, as well as XML tags. |
||
[[Liste |
[[Liste de symboles|En français]] · [[Список символов|по-русски]] |
||
{{TOCD}} |
{{TOCD}} |
||
Line 33: | Line 33: | ||
| <code>preadv</code> || Pre-adverb || |
| <code>preadv</code> || Pre-adverb || |
||
|- |
|- |
||
| <code>postadv</code> |
| <code>postadv</code> || Post-adverb || |
||
|- |
|- |
||
| <code>mod</code> || Модальное слово || [http://dic.academic.ru/dic.nsf/lingvistic/749] |
| <code>mod</code> || Модальное слово || [http://dic.academic.ru/dic.nsf/lingvistic/749] |
||
|- |
|- |
||
| <code>det</code> || Determiner || [http://en.wikipedia.org/wiki/Determiner_(class) wikipedia] |
| <code>det</code> || Determiner || [http://en.wikipedia.org/wiki/Determiner_(class) wikipedia] |
||
Line 55: | Line 55: | ||
| <code>cnjadv</code> || Conjunctive adverb || [http://en.wikipedia.org/wiki/Conjunctive_adverb wikipedia] |
| <code>cnjadv</code> || Conjunctive adverb || [http://en.wikipedia.org/wiki/Conjunctive_adverb wikipedia] |
||
|- |
|- |
||
| <code>sent</code> || Sentence-ending punctuation |
| <code>sent</code> || Sentence-ending punctuation || e.g. full stop, question mark |
||
|- |
|- |
||
| <code>cm</code> || Comma punctuation || , |
| <code>cm</code> || Comma punctuation || , |
||
|- |
|- |
||
| <code>lquot</code> || Left quote || « |
| <code>lquot</code> || Left quote || « |
||
⚫ | |||
⚫ | |||
|- |
|- |
||
⚫ | |||
⚫ | |||
|} |
|} |
||
Line 80: | Line 80: | ||
| <code>ma</code> || Masculine (animate) || Mostly in Slavic languages |
| <code>ma</code> || Masculine (animate) || Mostly in Slavic languages |
||
|- |
|- |
||
| <code>mi</code> || Masculine (inanimate) |
| <code>mi</code> || Masculine (inanimate) || Mostly in Slavic languages |
||
|- |
|- |
||
| <code>mp</code> || Masculine (personal) |
| <code>mp</code> || Masculine (personal) || in Polish |
||
|- |
|- |
||
| <code>mn</code> || Masculine and Neuter |
| <code>mn</code> || Masculine and Neuter || |
||
|- |
|- |
||
| <code>fn</code> || Feminine and Neuter |
| <code>fn</code> || Feminine and Neuter || |
||
|- |
|- |
||
| <code>ut</code> || Common || From ''utrum'', found in Scandinavian languages. |
| <code>ut</code> || Common || From ''utrum'', found in Scandinavian languages. |
||
Line 92: | Line 92: | ||
| <code>mf</code> || Masculine , feminine || This is used where the gender can be either masculine or feminine |
| <code>mf</code> || Masculine , feminine || This is used where the gender can be either masculine or feminine |
||
|- |
|- |
||
| <code>mfn</code> || Masculine , feminine , neuter |
| <code>mfn</code> || Masculine , feminine , neuter || This is used where the gender can be either masculine, feminine or neuter |
||
|- |
|- |
||
| <code>un</code> || Common, neuter || As above, only common or neuter |
| <code>un</code> || Common, neuter || As above, only common or neuter |
||
|- |
|- |
||
| <code>GD</code> || Gender to be determined || |
| <code>GD</code> || Gender to be determined || |
||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
|- |
|- |
||
|} |
|} |
||
Line 122: | Line 111: | ||
| <code>du</code> || Dual || |
| <code>du</code> || Dual || |
||
|- |
|- |
||
| <code>ct</code> || Count |
| <code>ct</code> || Count || see mk-bg |
||
|- |
|- |
||
| <code>coll</code> |
| <code>coll</code> || Collective || |
||
|- |
|- |
||
| <code>sp</code> || Singular , plural || |
| <code>sp</code> || Singular , plural || |
||
Line 130: | Line 119: | ||
| <code>ND</code> || Number to be determined || |
| <code>ND</code> || Number to be determined || |
||
|- |
|- |
||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
|- |
|||
|} |
|} |
||
Line 149: | Line 149: | ||
| <code>voc</code> || Vocative || |
| <code>voc</code> || Vocative || |
||
|- |
|- |
||
| <code>abl</code> || Ablative || |
| <code>abl</code> || Ablative || [http://en.wikipedia.org/wiki/Ablative wikipedia] |
||
|- |
|- |
||
| <code>ins</code> || Instrumental || [http://en.wikipedia.org/wiki/Instrumental_case wikipedia] |
| <code>ins</code> || Instrumental || [http://en.wikipedia.org/wiki/Instrumental_case wikipedia] |
||
|- |
|- |
||
| <code>loc</code> || Locative || [http://en.wikipedia.org/wiki/Locative wikipedia] |
| <code>loc</code> || Locative || [http://en.wikipedia.org/wiki/Locative wikipedia] |
||
⚫ | |||
⚫ | |||
|- |
|- |
||
| <code>prp</code> || Prepositional || [http://en.wikipedia.org/wiki/Prepositional wikipedia] |
| <code>prp</code> || Prepositional || [http://en.wikipedia.org/wiki/Prepositional wikipedia] |
||
|- |
|- |
||
| <code>tra</code> || Translative |
| <code>tra</code> || Translative || |
||
|- |
|- |
||
| <code>ill</code> || Illative || |
| <code>ill</code> || Illative || |
||
|- |
|- |
||
| <code>ine</code> || Inessive || |
| <code>ine</code> || Inessive || |
||
|- |
|- |
||
| <code>ade</code> || Adessive || |
| <code>ade</code> || Adessive || |
||
|- |
|- |
||
| <code>all</code> || Allative || |
| <code>all</code> || Allative || |
||
|- |
|- |
||
| <code>abe</code> || Abessive || |
| <code>abe</code> || Abessive || |
||
|- |
|- |
||
| <code>ess</code> || Essive || |
| <code>ess</code> || Essive || |
||
|- |
|- |
||
| <code>par</code> || Partitive || |
| <code>par</code> || Partitive || |
||
|- |
|- |
||
| <code>dis</code> || Distributive |
| <code>dis</code> || Distributive || |
||
|- |
|- |
||
| <code>com</code> || Comitative || |
| <code>com</code> || Comitative || |
||
|- |
|- |
||
| <code>soc</code> || Sociative || |
| <code>soc</code> || Sociative || |
||
|- |
|- |
||
| <code>prl</code> || Prolative || |
| <code>prl</code> || Prolative || |
||
|} |
|} |
||
Line 191: | Line 189: | ||
| <code>actv</code> || Active voice || |
| <code>actv</code> || Active voice || |
||
|- |
|- |
||
| |
| <code>pass</code> || Passive voice || is more used in Turkic. |
||
⚫ | |||
⚫ | |||
|- |
|- |
||
| <code>midv</code> || Middle voice || |
| <code>midv</code> || Middle voice || |
||
Line 254: | Line 254: | ||
|} |
|} |
||
=== |
===Person=== |
||
{|class=wikitable |
{|class=wikitable |
||
! Symbol !! Gloss !! Notes |
! Symbol !! Gloss !! Notes |
||
|- |
|- |
||
| <code> |
| <code>p1</code> || First person || |
||
|- |
|- |
||
| <code> |
| <code>p2</code> || Second person || |
||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
|- |
|- |
||
|} |
|} |
||
===Derivations=== |
|||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
===Possession=== |
===Possession=== |
||
Line 268: | Line 282: | ||
! Symbol !! Gloss !! Notes |
! Symbol !! Gloss !! Notes |
||
|- |
|- |
||
| <code>px1sg</code> || First person singular possessive |
| <code>px1sg</code> || First person singular possessive || e.g. in [[Turkic languages]] |
||
|- |
|- |
||
| <code>px2sg</code> || Second person singular possessive |
| <code>px2sg</code> || Second person singular possessive || e.g. in [[Turkic languages]] |
||
|- |
|- |
||
| <code>px3sg</code> || Third person singular possessive |
| <code>px3sg</code> || Third person singular possessive || e.g. in [[Turkic languages]] |
||
|- |
|- |
||
| <code>px1pl</code> || First person plural possessive |
| <code>px1pl</code> || First person plural possessive || e.g. in [[Turkic languages]] |
||
|- |
|- |
||
| <code>px2pl</code> || Second person plural possessive |
| <code>px2pl</code> || Second person plural possessive || e.g. in [[Turkic languages]] |
||
|- |
|- |
||
| <code>px3pl</code> || Third person plural possessive |
| <code>px3pl</code> || Third person plural possessive || e.g. in [[Turkic languages]] |
||
|- |
|- |
||
| <code>px3sp</code> || Third person possessive singular/plural |
| <code>px3sp</code> || Third person possessive singular/plural || e.g. in [[Turkic languages]] |
||
|- |
|- |
||
|} |
|} |
||
Line 300: | Line 314: | ||
|- |
|- |
||
| <code>al</code> || Altres || Other, misc. |
| <code>al</code> || Altres || Other, misc. |
||
⚫ | |||
===Person=== |
|||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
|} |
|} |
||
Line 346: | Line 345: | ||
|- |
|- |
||
| <code>pred</code> || Predicative || [http://en.wikipedia.org/wiki/Adjective#Attributive.2C_predicative.2C_absolute.2C_and_substantive_adjectives wikipedia] |
| <code>pred</code> || Predicative || [http://en.wikipedia.org/wiki/Adjective#Attributive.2C_predicative.2C_absolute.2C_and_substantive_adjectives wikipedia] |
||
⚫ | |||
| <code>preadj</code> || Pre-adjective || for languages where most of adjectives are after the noun (ex: French in eo->fr bidix) |
|||
⚫ | |||
| <code>preadj_nh</code> || Pre-adjective if not human || according to the noun, the adjective is before or after |
|||
|- |
|- |
||
|} |
|} |
||
{| class="wikitable" border="1" |
{| class="wikitable" border="1" |
||
! Symbol !! Gloss !! Notes |
! Symbol !! Gloss !! Notes |
||
|- |
|- |
||
| <code>tn</code> || Tónico |
| <code>tn</code> || Tónico || |
||
|- |
|- |
||
| <code>detnt</code> || Neuter determiner |
| <code>detnt</code> || Neuter determiner || |
||
|- |
|- |
||
| <code>predet</code> || Pre determiner |
| <code>predet</code> || Pre determiner || |
||
|- |
|- |
||
| <code>atn</code> || Atónico |
| <code>atn</code> || Atónico || |
||
|- |
|- |
||
| <code>qnt</code> || Quantifier |
| <code>qnt</code> || Quantifier || |
||
|- |
|- |
||
| <code>ord</code> || Ordinal |
| <code>ord</code> || Ordinal || |
||
|- |
|- |
||
| <code>obj</code> || Object |
| <code>obj</code> || Object || |
||
|- |
|- |
||
| <code>subj</code> || Subject |
| <code>subj</code> || Subject || |
||
|- |
|- |
||
| <code>pro</code> || Proclitic |
| <code>pro</code> || Proclitic || |
||
|- |
|- |
||
| <code>enc</code> || Enclitic |
| <code>enc</code> || Enclitic || |
||
|- |
|- |
||
| <code>acr</code> || Acronym |
| <code>acr</code> || Acronym || |
||
|- |
|- |
||
| <code>rel</code> || Relative |
| <code>rel</code> || Relative || |
||
|- |
|- |
||
| <code>ind</code> || Indefinite |
| <code>ind</code> || Indefinite || |
||
|- |
|- |
||
| <code>itg</code> || Interrogative |
| <code>itg</code> || Interrogative || |
||
|- |
|- |
||
| <code>dem</code> || Demonstrative |
| <code>dem</code> || Demonstrative || |
||
|- |
|- |
||
| <code>def</code> || Definite |
| <code>def</code> || Definite || |
||
|- |
|- |
||
| <code>pos</code> || Possesive |
| <code>pos</code> || Possesive || |
||
|- |
|- |
||
| <code>ref</code> || Reflexive |
| <code>ref</code> || Reflexive || |
||
|- |
|- |
||
| <code>prx</code> || Proximate |
| <code>prx</code> || Proximate || |
||
|- |
|- |
||
| <code>dst</code> || Distal |
| <code>dst</code> || Distal || |
||
|} |
|} |
||
Line 414: | Line 417: | ||
{|class=wikitable |
{|class=wikitable |
||
! XML tag |
! XML tag !! Means !! Appears in XML tags / notes / examples |
||
|- |
|- |
||
| <code><dictionary></code> |
| <code><dictionary></code> || Mono- or bilingual dictionary || In files apertium-eo-en.en.dix, apertium-eo-en.eo-en.dix, apertium-eo-en.post-en.dix, apertium-eo-en.post-eo.dix |
||
|- |
|- |
||
| <code><alphabet></code> |
| <code><alphabet></code> || Set of characters in the language|| In <code><dictionary></code> |
||
|- |
|- |
||
| <code><sdefs></code> |
| <code><sdefs></code> || Symbol definitions || In <code><dictionary></code> |
||
|- |
|- |
||
| <code><sdef></code> || Symbol definition || In <code><sdefs></code>. Ex: <code><sdef n="noun"/></code> |
| <code><sdef></code> || Symbol definition || In <code><sdefs></code>. Ex: <code><sdef n="noun"/></code> |
||
|- |
|- |
||
| <code><pardefs></code> |
| <code><pardefs></code> || Paradigm definitions || In <code><dictionary></code>. |
||
|- |
|- |
||
| <code><pardef></code> |
| <code><pardef></code> || Paradigm definition || In <code><pardefs></code>. |
||
|- |
|- |
||
| <code><section> </code> |
| <code><section> </code> || A section of the dictionary || In <code><dictionary></code>. Ex: <code><section id="main" type="standard"></code> |
||
|- |
|- |
||
| <code><e></code> || A dictionary entry (a word) |
| <code><e></code> || A dictionary entry (a word) || In <code><section></code> and in <code><pardef></code>. |
||
|- |
|- |
||
| <code><i></code> || Invariant (left and right side) |
| <code><i></code> || Invariant (left and right side) || In <code><e></code>. Ex.: <code><i>beer</i></code> |
||
|- |
|- |
||
| <code><p></code> || A pair || In <code><e></code>. |
| <code><p></code> || A pair || In <code><e></code>. |
||
Line 469: | Line 472: | ||
{|class=wikitable |
{|class=wikitable |
||
! XML attribute value |
! XML attribute value !! Means !! Appears in attribute || Notes |
||
|- |
|- |
||
| <code>whole</code> || lemma and grammatical symbols |
| <code>whole</code> || lemma and grammatical symbols || part |
||
|- |
|- |
||
| <code>lem</code> || lemma || part |
| <code>lem</code> || lemma || part |
||
|- |
|- |
||
| <code>lemh</code> || (inflected) head word of [[Chunking:_A_full_example#Handling_of_multiwords_with_inner_inflection|multiword]] |
| <code>lemh</code> || (inflected) head word of [[Chunking:_A_full_example#Handling_of_multiwords_with_inner_inflection|multiword]] || part |
||
|- |
|- |
||
| <code>lemq</code> || following queue of [[Chunking:_A_full_example#Handling_of_multiwords_with_inner_inflection|multiword]] || part |
| <code>lemq</code> || following queue of [[Chunking:_A_full_example#Handling_of_multiwords_with_inner_inflection|multiword]] || part |
||
|- |
|- |
||
|} |
|} |
Revision as of 21:46, 11 June 2016
This page lists the symbols in Apertium used to denote part-of-speech and further morphological features, as well as chunk tags used for more syntactic functions, as well as XML tags.
This is meant to be a glossary of symbol names in alphabetical order with notes. Some of these names are specific to particular packages or language pairs, as not all languages have the same grammatical features (most don't have spatial distinction in articles for example).
If you were wondering what the symbols #, /, @, +, ~ or * mean, read Apertium stream format.
Part-of-speech Categories
Symbol | Gloss | Notes |
---|---|---|
n |
Noun | see 'np' for proper noun |
vblex |
Standard verb | see also: vbser, vbhaver, vbmod, vaux |
vbmod |
Modal verb | |
vbser |
Verb "to be" | from ser (to be) |
vbhaver |
Verb "to have" | from haver (to have) |
vaux |
Auxilliary verb | wikipedia |
adj |
Adjective | |
post |
Postposition | |
adv |
Adverb | |
preadv |
Pre-adverb | |
postadv |
Post-adverb | |
mod |
Модальное слово | [1] |
det |
Determiner | wikipedia |
prn |
Pronoun | wikipedia |
pr |
Preposition | wikipedia |
num |
Numeral | |
np |
Proper noun | From nom propi wikipedia |
ij |
Interjection | wikipedia |
cnjcoo |
Co-ordinating conjunction | wikipedia |
cnjsub |
Sub-ordinating conjunction | |
cnjadv |
Conjunctive adverb | wikipedia |
sent |
Sentence-ending punctuation | e.g. full stop, question mark |
cm |
Comma punctuation | , |
lquot |
Left quote | « |
rquot |
Right quote | » |
Part-of-speech Sub-categories
Gender
Symbol | Gloss | Notes |
---|---|---|
f |
Feminine | |
m |
Masculine | |
nt |
Neuter | |
ma |
Masculine (animate) | Mostly in Slavic languages |
mi |
Masculine (inanimate) | Mostly in Slavic languages |
mp |
Masculine (personal) | in Polish |
mn |
Masculine and Neuter | |
fn |
Feminine and Neuter | |
ut |
Common | From utrum, found in Scandinavian languages. |
mf |
Masculine , feminine | This is used where the gender can be either masculine or feminine |
mfn |
Masculine , feminine , neuter | This is used where the gender can be either masculine, feminine or neuter |
un |
Common, neuter | As above, only common or neuter |
GD |
Gender to be determined |
Number
Symbol | Gloss | Notes |
---|---|---|
sg |
Singular | |
pl |
Plural | |
du |
Dual | |
ct |
Count | see mk-bg |
coll |
Collective | |
sp |
Singular , plural | |
ND |
Number to be determined |
Count/Mass
Symbol | Gloss | Notes |
---|---|---|
cnt |
Countable | |
unc |
Uncountable (mass) |
Case
Symbol | Gloss | Notes |
---|---|---|
nom |
Nominative | |
acc |
Accusative | |
dat |
Dative | |
gen |
Genitive | |
dg |
Dative and Genitive | in ro-es, discouraged in new developments |
voc |
Vocative | |
abl |
Ablative | wikipedia |
ins |
Instrumental | wikipedia |
loc |
Locative | wikipedia |
prp |
Prepositional | wikipedia |
tra |
Translative | |
ill |
Illative | |
ine |
Inessive | |
ade |
Adessive | |
all |
Allative | |
abe |
Abessive | |
ess |
Essive | |
par |
Partitive | |
dis |
Distributive | |
com |
Comitative | |
soc |
Sociative | |
prl |
Prolative |
Voice
Symbol | Gloss | Notes |
---|---|---|
actv |
Active voice | |
pass |
Passive voice | is more used in Turkic. |
pasv |
Passive voice | is more used in Germanic. |
midv |
Middle voice | |
nactv |
Non-active voice | See Albanian. |
Tense and mode
Symbol | Gloss | Notes |
---|---|---|
pres |
Present | |
pret |
Preterite | |
past |
Past | |
imp |
Imperative | |
inf |
Infinitive | |
pp |
Past participle | wikipedia |
pp2 |
Past participle (???) | It's at least used in the Esperanto dictionaries for future active participles, ont (seems quite odd) |
pp3 |
Past participle (???) | It's at least used in the Esperanto dictionaries for past active participles, int (seems quite odd) |
pprs |
Present participle | Also appears as ppres (deprecated)
|
ger |
Gerund | wikipedia |
supn |
Supine | wikipedia |
pri |
Present indicative | see also: pres. wikipedia |
pii |
Imperfect | from Pretério imperfecto de indicativo |
fti |
Future indicative | |
fts |
Future subjunctive | |
cni |
Conditional | |
plu |
Pluperfect | In cy-en
|
pmp |
Pluperfect | In es-gl (from Pluscamperfecto)
|
prs |
Present subjunctive | wikipedia |
pis |
Imperfect subjunctive | |
ifi |
Past definite | from Pretério perfecto o indefinido |
aff |
Affirmative | |
itg |
Interrogative | |
neg |
Negative |
Person
Symbol | Gloss | Notes |
---|---|---|
p1 |
First person | |
p2 |
Second person | |
p3 |
Third person | |
impers |
Impersonal | Sometimes called 'autonomous' |
Derivations
Possession
Symbol | Gloss | Notes |
---|---|---|
caus |
Causative | |
ingr |
Ingressive | https://nn.wikipedia.org/w/index.php?title=Ingressiv |
Symbol | Gloss | Notes |
---|---|---|
px1sg |
First person singular possessive | e.g. in Turkic languages |
px2sg |
Second person singular possessive | e.g. in Turkic languages |
px3sg |
Third person singular possessive | e.g. in Turkic languages |
px1pl |
First person plural possessive | e.g. in Turkic languages |
px2pl |
Second person plural possessive | e.g. in Turkic languages |
px3pl |
Third person plural possessive | e.g. in Turkic languages |
px3sp |
Third person possessive singular/plural | e.g. in Turkic languages |
Proper nouns
Symbol | Gloss | Notes |
---|---|---|
ant |
Anthroponym | wikipedia |
top |
Toponym | In some language pairs without the locative case this may be loc. Although this should be changed. wikipedia |
hyd |
Hydronym | wikipedia |
cog |
Cognomen | In normal use, surnames |
org |
Organisation | |
al |
Altres | Other, misc. |
Animacy
Symbol | Gloss | Notes |
---|---|---|
aa |
Animate | |
an |
Animate / inanimate | |
nn |
Inanimate |
Adjectives
Symbol | Gloss | Notes |
---|---|---|
sint |
Synthetic | "nice, nicer, nicest" is synthetic. "handsome, more handsome, the most handsome" is not. wikipedia |
pst |
Positive | |
comp |
Comparative | wikipedia |
sup |
Superlative | wikipedia |
attr |
Attributive | wikipedia |
pred |
Predicative | wikipedia |
preadj |
Pre-adjective | for languages where most of adjectives are after the noun (ex: French in eo->fr bidix) |
preadj_nh |
Pre-adjective if not human | according to the noun, the adjective is before or after |
Symbol | Gloss | Notes |
---|---|---|
tn |
Tónico | |
detnt |
Neuter determiner | |
predet |
Pre determiner | |
atn |
Atónico | |
qnt |
Quantifier | |
ord |
Ordinal | |
obj |
Object | |
subj |
Subject | |
pro |
Proclitic | |
enc |
Enclitic | |
acr |
Acronym | |
rel |
Relative | |
ind |
Indefinite | |
itg |
Interrogative | |
dem |
Demonstrative | |
def |
Definite | |
pos |
Possesive | |
ref |
Reflexive | |
prx |
Proximate | |
dst |
Distal |
Chunk tags
Tag | Description |
---|---|
<SN> |
Noun phrase / noun group (sintagma nominal) |
<SA> |
Adjective phrase / adjective group |
<SV> |
Verb phrase / verb group (sintagma verbal) |
XML tags
Note: All XML tags are explained in depth in the PDF documentation, see also the dix.dtd/dix.rng files in lttoolbox (svn).
XML tag | Means | Appears in XML tags / notes / examples |
---|---|---|
<dictionary> |
Mono- or bilingual dictionary | In files apertium-eo-en.en.dix, apertium-eo-en.eo-en.dix, apertium-eo-en.post-en.dix, apertium-eo-en.post-eo.dix |
<alphabet> |
Set of characters in the language | In <dictionary>
|
<sdefs> |
Symbol definitions | In <dictionary>
|
<sdef> |
Symbol definition | In <sdefs> . Ex: <sdef n="noun"/>
|
<pardefs> |
Paradigm definitions | In <dictionary> .
|
<pardef> |
Paradigm definition | In <pardefs> .
|
<section> |
A section of the dictionary | In <dictionary> . Ex: <section id="main" type="standard">
|
<e> |
A dictionary entry (a word) | In <section> and in <pardef> .
|
<i> |
Invariant (left and right side) | In <e> . Ex.: <i>beer</i>
|
<p> |
A pair | In <e> .
|
<l> |
Left side (surface form) | In <p> . Ex.: <l>beer</l>
|
<r> |
Right side (lexical unit) | In <p> . Ex.: <r>beer<s n="noun"/><s n="singular"/></r>
|
<s> |
A lexical symbol (noun, adj..) | In <r> , <l> and <i> . Ex.: <s n="noun"/>
|
<a> |
Post-generator wake-up mark | In <r> , <l> and <i> . Ex.: <l><a/>a<s ... (for the a/an rule in English)
|
<b> |
Blank space | In <r> , <l> and <i> . Ex.: <l>you're<b/>welcome<s ...
|
TODO: Probably there are more. --Jacob Nordfalk 14:47, 25 August 2008 (UTC)
Other tags:
<j/> (in stream format #) is to mark multiwords <t/> and <v/> are only in crossdix t = template, v = variable t matches any single tag, v is like + in regexes (0 or more) <sa/> and <prm/> are only used in metadixes. 'sa' lets you add n optional extra tag, prm is an extra string for the paradigm
Transfer
<clip> tag
See the documentation (pdf), p.144 for more information.
XML attribute value | Means | Appears in attribute | Notes |
---|---|---|---|
whole |
lemma and grammatical symbols | part | |
lem |
lemma | part | |
lemh |
(inflected) head word of multiword | part | |
lemq |
following queue of multiword | part |