List of symbols

From Apertium
Revision as of 11:07, 17 November 2009 by Unhammer (talk | contribs)
Jump to navigation Jump to search

Eventually this will be a glossary of symbol names in alphabetical order with notes. Some of these names are specific to particular packages or language pairs, as not all languages have the same grammatical features (most don't have spatial distinction in articles for example).

If you were wondering what the symbols #, /, @, + or * mean, read Apertium stream format.

Categories

Symbol Gloss Notes
n Noun see 'np' for proper noun
vblex Standard verb see also: vbser, vbhaver, vbmod, vaux
vbmod Modal verb
vbser Verb "to be" from ser (to be)
vbhaver Verb "to have" from haver (to have)
vaux Auxilliary verb wikipedia
adj Adjective
adv Adverb
preadv Pre-adverb
det Determiner wikipedia
prn Pronoun wikipedia
pr Preposition wikipedia
num Numeral
np Proper noun From nom propi wikipedia
ij Interjection wikipedia
cnjcoo Co-ordinating conjunction wikipedia
cnjsub Sub-ordinating conjunction
cnjadv Conjunctive adverb wikipedia

Sub-categories

Gender

Symbol Gloss Notes
f Feminine
m Masculine
nt Neuter
mn Masculine and Neuter
fn Feminine and Neuter
ut Common From utrum, found in Scandinavian languages.
mf Masculine , feminine This is used where the gender can be either masculine or feminine
un Common, neuter As above, only common or neuter
GD Gender to be determined

Number

Symbol Gloss Notes
sg Singular
pl Plural
du Dual
sp Singular , plural
ND Number to be determined

Case

Symbol Gloss Notes
nom Nominative
acc Accusative
dat Dative
gen Genitive
dg Dative and Genitive
voc Vocative
ins Instrumental wikipedia
loc Locative wikipedia
abl Ablative wikipedia
cpr Prepositional wikipedia

Tense and mood

Symbol Gloss Notes
pres Present
past Past
imp Imperative
inf Infinitive
pp Past participle wikipedia
pp2 Past participle (???) It's at least used in the Esperanto dictionaries for future active participles, ont (seems quite odd)
pp3 Past participle (???) It's at least used in the Esperanto dictionaries for past active participles, int (seems quite odd)
pprs Present participle
ger Gerund wikipedia
pri Present indicative see also: pres. wikipedia
pii Imperfect from Pretério imperfecto de indicativo
fti Future indicative
fts Future subjunctive
cni Conditional
plu Pluperfect In cy-en
pmp Pluperfect In es-gl (from Pluscamperfecto)
prs Present subjunctive wikipedia
pis Imperfect subjunctive
ifi from Pretério perfecto o indefinido
aff Affirmative
itg Interrogative
neg Negative

Proper nouns

Symbol Gloss Notes
ant Anthroponym wikipedia
top Toponym In some language pairs without the locative case this may be loc. Although this should be changed. wikipedia
hyd Hydronym wikipedia
cog Cognomen In normal use, surnames
org Organisation
al Altres Other, misc.

Person

Symbol Gloss Notes
p1 First person
p2 Second person
p3 Third person
impers Impersonal Sometimes called 'autonomous'

Animacy

Symbol Gloss Notes
aa Animate
an Animate / inanimate
nn Inanimate


Adjectives

Symbol Gloss Notes
sint Synthetic "nice, nicer, nicest" is synthetic. "handsome, more handsome, the most handsome" is not. wikipedia
pve Positive
comp Comparative wikipedia
sup Superlative wikipedia
attr Attributive wikipedia
pred Predicative wikipedia
Symbol Gloss Notes
tn Tonico
detnt Neuter determiner
predet Pre determiner
atn
qnt Quantifier
ord Ordinal
obj Object
subj Subject
pro Proclitic
enc Enclitic
acr Acronym
rel Relative
ind Indefinite
itg Interrogative
dem Demonstrative
def Definite
pos Possesive
ref Reflexive
prx Proximate
dst Distal
pprep Post preposition (in Slavic languages, for pronoun changes: 'ich' -> 'nich')

XML tags

Note: All XML tags are explained in depth in the PDF documentation.

XML tag Means Appears in XML tags / notes / examples
<dictionary> Mono- or bilingual dictionary In files apertium-eo-en.en.dix, apertium-eo-en.eo-en.dix, apertium-eo-en.post-en.dix, apertium-eo-en.post-eo.dix
<alphabet> Set of characters in the language In <dictionary>
<sdefs> Symbol definitions In <dictionary>
<sdef> Symbol definition In <sdefs>. Ex: <sdef n="noun"/>
<pardefs> Paradigm definitions In <dictionary>.
<pardef> Paradigm definition In <pardefs>.
<section> A section of the dictionary In <dictionary>. Ex: <section id="main" type="standard">
<e> A dictionary entry (a word) In <section> and in <pardef>.
<i> Invariant (left and right side) In <e>. Ex.: <i>beer</i>
<p> A pair In <e>.
<l> Left side (surface form) In <p>. Ex.: <l>beer</l>
<r> Right side (lexical unit) In <p>. Ex.: <r>beer<s n="noun"/><s n="singular"/></r>
<s> A lexical symbol (noun, adj..) In <r>, <l> and <i>. Ex.: <s n="noun"/>

TODO: Probably there are more. --Jacob Nordfalk 14:47, 25 August 2008 (UTC)

Other tags:

<b/> is a blank (a space)

<j/> (in stream format #) is to mark multiwords

<t/> and <v/> are only in crossdix
t = template, v = variable
t matches any single tag, v is like + in regexes (0 or more)

<sa/> and <prm/> are only used in metadixes.
'sa' lets you add n optional extra tag, prm is an extra string for the paradigm

Transfer

<clip> tag

See the documentation (pdf), p.144 for more information.

XML attribute value Means Appears in attribute Notes
whole lemma and grammatical symbols part
lem lemma part
lemh (inflected) head word of multiword part
lemq following queue of multiword part