Difference between revisions of "List of symbols"

From Apertium
Jump to navigation Jump to search
(columns and a little update)
Line 1: Line 1:
 
This page lists the symbols in Apertium used to denote part-of-speech and further morphological features, as well as chunk tags used for more syntactic functions, as well as XML tags.
 
This page lists the symbols in Apertium used to denote part-of-speech and further morphological features, as well as chunk tags used for more syntactic functions, as well as XML tags.
   
[[Liste des symboles|En français]] · [[Список символов|по-русски]]
+
[[Liste de symboles|En français]] · [[Список символов|по-русски]]
   
 
{{TOCD}}
 
{{TOCD}}
Line 33: Line 33:
 
| <code>preadv</code> || Pre-adverb ||
 
| <code>preadv</code> || Pre-adverb ||
 
|-
 
|-
| <code>postadv</code> || Post-adverb ||
+
| <code>postadv</code> || Post-adverb ||
 
|-
 
|-
| <code>mod</code> || Модальное слово || [http://dic.academic.ru/dic.nsf/lingvistic/749]
+
| <code>mod</code> || Модальное слово || [http://dic.academic.ru/dic.nsf/lingvistic/749]
 
|-
 
|-
 
| <code>det</code> || Determiner || [http://en.wikipedia.org/wiki/Determiner_(class) wikipedia]
 
| <code>det</code> || Determiner || [http://en.wikipedia.org/wiki/Determiner_(class) wikipedia]
Line 55: Line 55:
 
| <code>cnjadv</code> || Conjunctive adverb || [http://en.wikipedia.org/wiki/Conjunctive_adverb wikipedia]
 
| <code>cnjadv</code> || Conjunctive adverb || [http://en.wikipedia.org/wiki/Conjunctive_adverb wikipedia]
 
|-
 
|-
| <code>sent</code> || Sentence-ending punctuation || e.g. full stop, question mark
+
| <code>sent</code> || Sentence-ending punctuation || e.g. full stop, question mark
 
|-
 
|-
| <code>cm</code> || Comma punctuation || ,
+
| <code>cm</code> || Comma punctuation || ,
 
|-
 
|-
| <code>lquot</code> || Left quote || «
+
| <code>lquot</code> || Left quote || «
|-
 
| <code>rquot</code> || Right quote || »
 
 
|-
 
|-
 
| <code>rquot</code> || Right quote || »
 
|-
 
|}
 
|}
   
Line 80: Line 80:
 
| <code>ma</code> || Masculine (animate) || Mostly in Slavic languages
 
| <code>ma</code> || Masculine (animate) || Mostly in Slavic languages
 
|-
 
|-
| <code>mi</code> || Masculine (inanimate) || Mostly in Slavic languages
+
| <code>mi</code> || Masculine (inanimate) || Mostly in Slavic languages
 
|-
 
|-
| <code>mp</code> || Masculine (personal) || in Polish
+
| <code>mp</code> || Masculine (personal) || in Polish
 
|-
 
|-
| <code>mn</code> || Masculine and Neuter ||
+
| <code>mn</code> || Masculine and Neuter ||
 
|-
 
|-
| <code>fn</code> || Feminine and Neuter ||
+
| <code>fn</code> || Feminine and Neuter ||
 
|-
 
|-
 
| <code>ut</code> || Common || From ''utrum'', found in Scandinavian languages.
 
| <code>ut</code> || Common || From ''utrum'', found in Scandinavian languages.
Line 92: Line 92:
 
| <code>mf</code> || Masculine , feminine || This is used where the gender can be either masculine or feminine
 
| <code>mf</code> || Masculine , feminine || This is used where the gender can be either masculine or feminine
 
|-
 
|-
| <code>mfn</code> || Masculine , feminine , neuter || This is used where the gender can be either masculine, feminine or neuter
+
| <code>mfn</code> || Masculine , feminine , neuter || This is used where the gender can be either masculine, feminine or neuter
 
|-
 
|-
 
| <code>un</code> || Common, neuter || As above, only common or neuter
 
| <code>un</code> || Common, neuter || As above, only common or neuter
 
|-
 
|-
 
| <code>GD</code> || Gender to be determined ||
 
| <code>GD</code> || Gender to be determined ||
|-
 
|}
 
 
===Count/Mass===
 
 
{|class=wikitable
 
! Symbol !! Gloss !! Notes
 
|-
 
| <code>cnt</code> || Countable ||
 
|-
 
| <code>unc</code> || Uncountable (mass) ||
 
 
|-
 
|-
 
|}
 
|}
Line 122: Line 111:
 
| <code>du</code> || Dual ||
 
| <code>du</code> || Dual ||
 
|-
 
|-
| <code>ct</code> || Count || see mk-bg
+
| <code>ct</code> || Count || see mk-bg
 
|-
 
|-
| <code>coll</code> || Collective ||
+
| <code>coll</code> || Collective ||
 
|-
 
|-
 
| <code>sp</code> || Singular , plural ||
 
| <code>sp</code> || Singular , plural ||
Line 130: Line 119:
 
| <code>ND</code> || Number to be determined ||
 
| <code>ND</code> || Number to be determined ||
 
|-
 
|-
 
|}
  +
 
===Count/Mass===
  +
 
{|class=wikitable
 
! Symbol !! Gloss !! Notes
 
|-
 
| <code>cnt</code> || Countable ||
 
|-
 
| <code>unc</code> || Uncountable (mass) ||
  +
|-
 
|}
 
|}
   
Line 149: Line 149:
 
| <code>voc</code> || Vocative ||
 
| <code>voc</code> || Vocative ||
 
|-
 
|-
| <code>abl</code> || Ablative ||
+
| <code>abl</code> || Ablative || [http://en.wikipedia.org/wiki/Ablative wikipedia]
 
|-
 
|-
 
| <code>ins</code> || Instrumental || [http://en.wikipedia.org/wiki/Instrumental_case wikipedia]
 
| <code>ins</code> || Instrumental || [http://en.wikipedia.org/wiki/Instrumental_case wikipedia]
 
|-
 
|-
 
| <code>loc</code> || Locative || [http://en.wikipedia.org/wiki/Locative wikipedia]
 
| <code>loc</code> || Locative || [http://en.wikipedia.org/wiki/Locative wikipedia]
|-
 
| <code>abl</code> || Ablative || [http://en.wikipedia.org/wiki/Ablative wikipedia]
 
 
|-
 
|-
 
| <code>prp</code> || Prepositional || [http://en.wikipedia.org/wiki/Prepositional wikipedia]
 
| <code>prp</code> || Prepositional || [http://en.wikipedia.org/wiki/Prepositional wikipedia]
 
|-
 
|-
| <code>tra</code> || Translative ||
+
| <code>tra</code> || Translative ||
 
|-
 
|-
| <code>ill</code> || Illative ||
+
| <code>ill</code> || Illative ||
 
|-
 
|-
| <code>ine</code> || Inessive ||
+
| <code>ine</code> || Inessive ||
 
|-
 
|-
| <code>ade</code> || Adessive ||
+
| <code>ade</code> || Adessive ||
 
|-
 
|-
| <code>all</code> || Allative ||
+
| <code>all</code> || Allative ||
 
|-
 
|-
| <code>abe</code> || Abessive ||
+
| <code>abe</code> || Abessive ||
 
|-
 
|-
| <code>ess</code> || Essive ||
+
| <code>ess</code> || Essive ||
 
|-
 
|-
| <code>par</code> || Partitive ||
+
| <code>par</code> || Partitive ||
 
|-
 
|-
| <code>dis</code> || Distributive ||
+
| <code>dis</code> || Distributive ||
 
|-
 
|-
 
| <code>com</code> || Comitative ||
 
| <code>com</code> || Comitative ||
 
|-
 
|-
| <code>soc</code> || Sociative ||
+
| <code>soc</code> || Sociative ||
 
|-
 
|-
| <code>prl</code> || Prolative ||
+
| <code>prl</code> || Prolative ||
 
|}
 
|}
   
Line 191: Line 189:
 
| <code>actv</code> || Active voice ||
 
| <code>actv</code> || Active voice ||
 
|-
 
|-
| <code>pasv</code>,<code>pass</code> || Passive voice || {{tag|pass}} is more used in Turkic, {{tag|pasv}} in Germanic.
+
| <code>pass</code> || Passive voice || is more used in Turkic.
 
|-
 
| <code>pasv</code> || Passive voice || is more used in Germanic.
 
|-
 
|-
 
| <code>midv</code> || Middle voice ||
 
| <code>midv</code> || Middle voice ||
Line 254: Line 254:
 
|}
 
|}
   
===Derivations===
+
===Person===
  +
 
{|class=wikitable
 
{|class=wikitable
 
! Symbol !! Gloss !! Notes
 
! Symbol !! Gloss !! Notes
 
|-
 
|-
| <code>caus</code> || Causative ||
+
| <code>p1</code> || First person ||
 
|-
 
|-
| <code>ingr</code> || Ingressive || https://nn.wikipedia.org/w/index.php?title=Ingressiv
+
| <code>p2</code> || Second person ||
 
|-
 
| <code>p3</code> || Third person ||
 
|-
 
| <code>impers</code> || Impersonal || Sometimes called 'autonomous'
 
|-
 
|-
 
|}
 
|}
  +
  +
===Derivations===
 
{|class=wikitable
 
! Symbol !! Gloss !! Notes
 
|-
 
| <code>caus</code> || Causative ||
 
|-
 
| <code>ingr</code> || Ingressive || https://nn.wikipedia.org/w/index.php?title=Ingressiv
 
|-|}
   
 
===Possession===
 
===Possession===
Line 268: Line 282:
 
! Symbol !! Gloss !! Notes
 
! Symbol !! Gloss !! Notes
 
|-
 
|-
| <code>px1sg</code> || First person singular possessive || e.g. in [[Turkic languages]]
+
| <code>px1sg</code> || First person singular possessive || e.g. in [[Turkic languages]]
 
|-
 
|-
| <code>px2sg</code> || Second person singular possessive || e.g. in [[Turkic languages]]
+
| <code>px2sg</code> || Second person singular possessive || e.g. in [[Turkic languages]]
 
|-
 
|-
| <code>px3sg</code> || Third person singular possessive || e.g. in [[Turkic languages]]
+
| <code>px3sg</code> || Third person singular possessive || e.g. in [[Turkic languages]]
 
|-
 
|-
| <code>px1pl</code> || First person plural possessive || e.g. in [[Turkic languages]]
+
| <code>px1pl</code> || First person plural possessive || e.g. in [[Turkic languages]]
 
|-
 
|-
| <code>px2pl</code> || Second person plural possessive || e.g. in [[Turkic languages]]
+
| <code>px2pl</code> || Second person plural possessive || e.g. in [[Turkic languages]]
 
|-
 
|-
| <code>px3pl</code> || Third person plural possessive || e.g. in [[Turkic languages]]
+
| <code>px3pl</code> || Third person plural possessive || e.g. in [[Turkic languages]]
 
|-
 
|-
| <code>px3sp</code> || Third person possessive singular/plural || e.g. in [[Turkic languages]]
+
| <code>px3sp</code> || Third person possessive singular/plural || e.g. in [[Turkic languages]]
 
|-
 
|-
 
|}
 
|}
Line 300: Line 314:
 
|-
 
|-
 
| <code>al</code> || Altres || Other, misc.
 
| <code>al</code> || Altres || Other, misc.
|}
 
 
===Person===
 
 
{|class=wikitable
 
! Symbol !! Gloss !! Notes
 
|-
 
| <code>p1</code> || First person ||
 
|-
 
| <code>p2</code> || Second person ||
 
|-
 
| <code>p3</code> || Third person ||
 
|-
 
| <code>impers</code> || Impersonal || Sometimes called 'autonomous'
 
|-
 
 
|}
 
|}
   
Line 346: Line 345:
 
|-
 
|-
 
| <code>pred</code> || Predicative || [http://en.wikipedia.org/wiki/Adjective#Attributive.2C_predicative.2C_absolute.2C_and_substantive_adjectives wikipedia]
 
| <code>pred</code> || Predicative || [http://en.wikipedia.org/wiki/Adjective#Attributive.2C_predicative.2C_absolute.2C_and_substantive_adjectives wikipedia]
 
|-
  +
| <code>preadj</code> || Pre-adjective || for languages where most of adjectives are after the noun (ex: French in eo->fr bidix)
 
|-
  +
| <code>preadj_nh</code> || Pre-adjective if not human || according to the noun, the adjective is before or after
 
|-
 
|-
 
|}
 
|}
   
 
{| class="wikitable" border="1"
 
{| class="wikitable" border="1"
! Symbol !! Gloss !! Notes
+
! Symbol !! Gloss !! Notes
 
|-
 
|-
| <code>tn</code> || Tónico
+
| <code>tn</code> || Tónico ||
 
|-
 
|-
| <code>detnt</code> || Neuter determiner
+
| <code>detnt</code> || Neuter determiner ||
 
|-
 
|-
| <code>predet</code> || Pre determiner
+
| <code>predet</code> || Pre determiner ||
 
|-
 
|-
| <code>atn</code> || Atónico
+
| <code>atn</code> || Atónico ||
 
|-
 
|-
| <code>qnt</code> || Quantifier
+
| <code>qnt</code> || Quantifier ||
 
|-
 
|-
| <code>ord</code> || Ordinal
+
| <code>ord</code> || Ordinal ||
 
|-
 
|-
| <code>obj</code> || Object
+
| <code>obj</code> || Object ||
 
|-
 
|-
| <code>subj</code> || Subject
+
| <code>subj</code> || Subject ||
 
|-
 
|-
| <code>pro</code> || Proclitic
+
| <code>pro</code> || Proclitic ||
 
|-
 
|-
| <code>enc</code> || Enclitic
+
| <code>enc</code> || Enclitic ||
 
|-
 
|-
| <code>acr</code> || Acronym
+
| <code>acr</code> || Acronym ||
 
|-
 
|-
| <code>rel</code> || Relative
+
| <code>rel</code> || Relative ||
 
|-
 
|-
| <code>ind</code> || Indefinite
+
| <code>ind</code> || Indefinite ||
 
|-
 
|-
| <code>itg</code> || Interrogative
+
| <code>itg</code> || Interrogative ||
 
|-
 
|-
| <code>dem</code> || Demonstrative
+
| <code>dem</code> || Demonstrative ||
 
|-
 
|-
| <code>def</code> || Definite
+
| <code>def</code> || Definite ||
 
|-
 
|-
| <code>pos</code> || Possesive
+
| <code>pos</code> || Possesive ||
 
|-
 
|-
| <code>ref</code> || Reflexive
+
| <code>ref</code> || Reflexive ||
 
|-
 
|-
| <code>prx</code> || Proximate
+
| <code>prx</code> || Proximate ||
 
|-
 
|-
| <code>dst</code> || Distal
+
| <code>dst</code> || Distal ||
 
|}
 
|}
   
Line 414: Line 417:
   
 
{|class=wikitable
 
{|class=wikitable
! XML tag !! Means !! Appears in XML tags / notes / examples
+
! XML tag !! Means !! Appears in XML tags / notes / examples
 
|-
 
|-
| <code><dictionary></code> || Mono- or bilingual dictionary || In files apertium-eo-en.en.dix, apertium-eo-en.eo-en.dix, apertium-eo-en.post-en.dix, apertium-eo-en.post-eo.dix
+
| <code><dictionary></code> || Mono- or bilingual dictionary || In files apertium-eo-en.en.dix, apertium-eo-en.eo-en.dix, apertium-eo-en.post-en.dix, apertium-eo-en.post-eo.dix
 
|-
 
|-
| <code><alphabet></code> || Set of characters in the language|| In <code><dictionary></code>
+
| <code><alphabet></code> || Set of characters in the language|| In <code><dictionary></code>
 
|-
 
|-
| <code><sdefs></code> || Symbol definitions || In <code>&lt;dictionary></code>
+
| <code><sdefs></code> || Symbol definitions || In <code>&lt;dictionary></code>
 
|-
 
|-
 
| <code><sdef></code> || Symbol definition || In <code>&lt;sdefs></code>. Ex: <code>&lt;sdef n="noun"/></code>
 
| <code><sdef></code> || Symbol definition || In <code>&lt;sdefs></code>. Ex: <code>&lt;sdef n="noun"/></code>
 
|-
 
|-
| <code><pardefs></code> || Paradigm definitions || In <code>&lt;dictionary></code>.
+
| <code><pardefs></code> || Paradigm definitions || In <code>&lt;dictionary></code>.
 
|-
 
|-
| <code><pardef></code> || Paradigm definition || In <code>&lt;pardefs></code>.
+
| <code><pardef></code> || Paradigm definition || In <code>&lt;pardefs></code>.
 
|-
 
|-
| <code><section> </code> || A section of the dictionary || In <code>&lt;dictionary></code>. Ex: <code>&lt;section id="main" type="standard"></code>
+
| <code><section> </code> || A section of the dictionary || In <code>&lt;dictionary></code>. Ex: <code>&lt;section id="main" type="standard"></code>
 
|-
 
|-
| <code>&lt;e></code> || A dictionary entry (a word) || In <code><section></code> and in <code>&lt;pardef></code>.
+
| <code>&lt;e></code> || A dictionary entry (a word) || In <code><section></code> and in <code>&lt;pardef></code>.
 
|-
 
|-
| <code>&lt;i></code> || Invariant (left and right side) || In <code>&lt;e></code>. Ex.: <code>&lt;i>beer&lt;/i></code>
+
| <code>&lt;i></code> || Invariant (left and right side) || In <code>&lt;e></code>. Ex.: <code>&lt;i>beer&lt;/i></code>
 
|-
 
|-
 
| <code>&lt;p></code> || A pair || In <code><e></code>.
 
| <code>&lt;p></code> || A pair || In <code><e></code>.
Line 469: Line 472:
   
 
{|class=wikitable
 
{|class=wikitable
! XML attribute value !! Means !! Appears in attribute || Notes
+
! XML attribute value !! Means !! Appears in attribute || Notes
 
|-
 
|-
| <code>whole</code> || lemma and grammatical symbols || part
+
| <code>whole</code> || lemma and grammatical symbols || part
 
|-
 
|-
| <code>lem</code> || lemma || part
+
| <code>lem</code> || lemma || part
 
|-
 
|-
| <code>lemh</code> || (inflected) head word of [[Chunking:_A_full_example#Handling_of_multiwords_with_inner_inflection|multiword]] || part
+
| <code>lemh</code> || (inflected) head word of [[Chunking:_A_full_example#Handling_of_multiwords_with_inner_inflection|multiword]] || part
 
|-
 
|-
| <code>lemq</code> || following queue of [[Chunking:_A_full_example#Handling_of_multiwords_with_inner_inflection|multiword]] || part
+
| <code>lemq</code> || following queue of [[Chunking:_A_full_example#Handling_of_multiwords_with_inner_inflection|multiword]] || part
 
|-
 
|-
 
|}
 
|}

Revision as of 21:46, 11 June 2016

This page lists the symbols in Apertium used to denote part-of-speech and further morphological features, as well as chunk tags used for more syntactic functions, as well as XML tags.

En français · по-русски

This is meant to be a glossary of symbol names in alphabetical order with notes. Some of these names are specific to particular packages or language pairs, as not all languages have the same grammatical features (most don't have spatial distinction in articles for example).

If you were wondering what the symbols #, /, @, +, ~ or * mean, read Apertium stream format.

Part-of-speech Categories

Symbol Gloss Notes
n Noun see 'np' for proper noun
vblex Standard verb see also: vbser, vbhaver, vbmod, vaux
vbmod Modal verb
vbser Verb "to be" from ser (to be)
vbhaver Verb "to have" from haver (to have)
vaux Auxilliary verb wikipedia
adj Adjective
post Postposition
adv Adverb
preadv Pre-adverb
postadv Post-adverb
mod Модальное слово [1]
det Determiner wikipedia
prn Pronoun wikipedia
pr Preposition wikipedia
num Numeral
np Proper noun From nom propi wikipedia
ij Interjection wikipedia
cnjcoo Co-ordinating conjunction wikipedia
cnjsub Sub-ordinating conjunction
cnjadv Conjunctive adverb wikipedia
sent Sentence-ending punctuation e.g. full stop, question mark
cm Comma punctuation ,
lquot Left quote «
rquot Right quote »

Part-of-speech Sub-categories

Gender

Symbol Gloss Notes
f Feminine
m Masculine
nt Neuter
ma Masculine (animate) Mostly in Slavic languages
mi Masculine (inanimate) Mostly in Slavic languages
mp Masculine (personal) in Polish
mn Masculine and Neuter
fn Feminine and Neuter
ut Common From utrum, found in Scandinavian languages.
mf Masculine , feminine This is used where the gender can be either masculine or feminine
mfn Masculine , feminine , neuter This is used where the gender can be either masculine, feminine or neuter
un Common, neuter As above, only common or neuter
GD Gender to be determined

Number

Symbol Gloss Notes
sg Singular
pl Plural
du Dual
ct Count see mk-bg
coll Collective
sp Singular , plural
ND Number to be determined

Count/Mass

Symbol Gloss Notes
cnt Countable
unc Uncountable (mass)

Case

Symbol Gloss Notes
nom Nominative
acc Accusative
dat Dative
gen Genitive
dg Dative and Genitive in ro-es, discouraged in new developments
voc Vocative
abl Ablative wikipedia
ins Instrumental wikipedia
loc Locative wikipedia
prp Prepositional wikipedia
tra Translative
ill Illative
ine Inessive
ade Adessive
all Allative
abe Abessive
ess Essive
par Partitive
dis Distributive
com Comitative
soc Sociative
prl Prolative

Voice

Symbol Gloss Notes
actv Active voice
pass Passive voice is more used in Turkic.
pasv Passive voice is more used in Germanic.
midv Middle voice
nactv Non-active voice See Albanian.

Tense and mode

Symbol Gloss Notes
pres Present
pret Preterite
past Past
imp Imperative
inf Infinitive
pp Past participle wikipedia
pp2 Past participle (???) It's at least used in the Esperanto dictionaries for future active participles, ont (seems quite odd)
pp3 Past participle (???) It's at least used in the Esperanto dictionaries for past active participles, int (seems quite odd)
pprs Present participle Also appears as ppres (deprecated)
ger Gerund wikipedia
supn Supine wikipedia
pri Present indicative see also: pres. wikipedia
pii Imperfect from Pretério imperfecto de indicativo
fti Future indicative
fts Future subjunctive
cni Conditional
plu Pluperfect In cy-en
pmp Pluperfect In es-gl (from Pluscamperfecto)
prs Present subjunctive wikipedia
pis Imperfect subjunctive
ifi Past definite from Pretério perfecto o indefinido
aff Affirmative
itg Interrogative
neg Negative

Person

Symbol Gloss Notes
p1 First person
p2 Second person
p3 Third person
impers Impersonal Sometimes called 'autonomous'

Derivations

Possession

Symbol Gloss Notes
caus Causative
ingr Ingressive https://nn.wikipedia.org/w/index.php?title=Ingressiv
Symbol Gloss Notes
px1sg First person singular possessive e.g. in Turkic languages
px2sg Second person singular possessive e.g. in Turkic languages
px3sg Third person singular possessive e.g. in Turkic languages
px1pl First person plural possessive e.g. in Turkic languages
px2pl Second person plural possessive e.g. in Turkic languages
px3pl Third person plural possessive e.g. in Turkic languages
px3sp Third person possessive singular/plural e.g. in Turkic languages

Proper nouns

Symbol Gloss Notes
ant Anthroponym wikipedia
top Toponym In some language pairs without the locative case this may be loc. Although this should be changed. wikipedia
hyd Hydronym wikipedia
cog Cognomen In normal use, surnames
org Organisation
al Altres Other, misc.

Animacy

Symbol Gloss Notes
aa Animate
an Animate / inanimate
nn Inanimate

Adjectives

Symbol Gloss Notes
sint Synthetic "nice, nicer, nicest" is synthetic. "handsome, more handsome, the most handsome" is not. wikipedia
pst Positive
comp Comparative wikipedia
sup Superlative wikipedia
attr Attributive wikipedia
pred Predicative wikipedia
preadj Pre-adjective for languages where most of adjectives are after the noun (ex: French in eo->fr bidix)
preadj_nh Pre-adjective if not human according to the noun, the adjective is before or after
Symbol Gloss Notes
tn Tónico
detnt Neuter determiner
predet Pre determiner
atn Atónico
qnt Quantifier
ord Ordinal
obj Object
subj Subject
pro Proclitic
enc Enclitic
acr Acronym
rel Relative
ind Indefinite
itg Interrogative
dem Demonstrative
def Definite
pos Possesive
ref Reflexive
prx Proximate
dst Distal

Chunk tags

Tag Description
<SN> Noun phrase / noun group (sintagma nominal)
<SA> Adjective phrase / adjective group
<SV> Verb phrase / verb group (sintagma verbal)

XML tags

Note: All XML tags are explained in depth in the PDF documentation, see also the dix.dtd/dix.rng files in lttoolbox (svn).

XML tag Means Appears in XML tags / notes / examples
<dictionary> Mono- or bilingual dictionary In files apertium-eo-en.en.dix, apertium-eo-en.eo-en.dix, apertium-eo-en.post-en.dix, apertium-eo-en.post-eo.dix
<alphabet> Set of characters in the language In <dictionary>
<sdefs> Symbol definitions In <dictionary>
<sdef> Symbol definition In <sdefs>. Ex: <sdef n="noun"/>
<pardefs> Paradigm definitions In <dictionary>.
<pardef> Paradigm definition In <pardefs>.
<section> A section of the dictionary In <dictionary>. Ex: <section id="main" type="standard">
<e> A dictionary entry (a word) In <section> and in <pardef>.
<i> Invariant (left and right side) In <e>. Ex.: <i>beer</i>
<p> A pair In <e>.
<l> Left side (surface form) In <p>. Ex.: <l>beer</l>
<r> Right side (lexical unit) In <p>. Ex.: <r>beer<s n="noun"/><s n="singular"/></r>
<s> A lexical symbol (noun, adj..) In <r>, <l> and <i>. Ex.: <s n="noun"/>
<a> Post-generator wake-up mark In <r>, <l> and <i>. Ex.: <l><a/>a<s ... (for the a/an rule in English)
<b> Blank space In <r>, <l> and <i>. Ex.: <l>you're<b/>welcome<s ...

TODO: Probably there are more. --Jacob Nordfalk 14:47, 25 August 2008 (UTC)

Other tags:

<j/> (in stream format #) is to mark multiwords

<t/> and <v/> are only in crossdix
t = template, v = variable
t matches any single tag, v is like + in regexes (0 or more)

<sa/> and <prm/> are only used in metadixes.
'sa' lets you add n optional extra tag, prm is an extra string for the paradigm

Transfer

<clip> tag

See the documentation (pdf), p.144 for more information.

XML attribute value Means Appears in attribute Notes
whole lemma and grammatical symbols part
lem lemma part
lemh (inflected) head word of multiword part
lemq following queue of multiword part

See also