Difference between revisions of "List of symbols"
TommiPirinen (talk | contribs) |
TommiPirinen (talk | contribs) (univerasals) |
||
Line 1: | Line 1: | ||
[Liste de symboles|En français]] · [[Список символов|по-русски]] |
|||
This page lists the symbols in Apertium used to denote part-of-speech and further morphological features, as well as chunk tags used for more syntactic functions, as well as XML tags. |
This page lists the symbols in Apertium used to denote part-of-speech and further morphological features, as well as chunk tags used for more syntactic functions, as well as XML tags. |
||
Line 11: | Line 11: | ||
{|class=wikitable |
{|class=wikitable |
||
! Symbol !! Gloss !! Notes |
! Symbol !! Gloss !! Notes !! Universal POS |
||
|- |
|- |
||
| <code>n</code> || Noun || ''see 'np' for proper noun'' |
| <code>n</code> || Noun || ''see 'np' for proper noun'' || NOUN |
||
|- |
|- |
||
| <code>vblex</code> || Standard ("lexical") verb || ''see also: vbser, vbhaver, vbmod, vaux'' |
| <code>vblex</code> || Standard ("lexical") verb || ''see also: vbser, vbhaver, vbmod, vaux'' || VERB |
||
|- |
|- |
||
| <code>v</code> || Standard verb || shortened form of vblex, often used in agglutinative languages |
| <code>v</code> || Standard verb || shortened form of vblex, often used in agglutinative languages || VERB |
||
|- |
|- |
||
| <code>vbmod</code> || Modal verb || |
| <code>vbmod</code> || Modal verb || || VERB |
||
|- |
|- |
||
| <code>vbser</code> || Verb "to be" || from ''ser'' (to be) |
| <code>vbser</code> || Verb "to be" || from ''ser'' (to be) || VERB (or AUX) |
||
|- |
|- |
||
| <code>vbhaver</code> || Verb "to have" || from ''haver'' (to have) |
| <code>vbhaver</code> || Verb "to have" || from ''haver'' (to have) || VERB |
||
|- |
|- |
||
| <code>vaux</code> || Auxiliary verb || [http://en.wikipedia.org/wiki/Auxilliary_verb wikipedia] |
| <code>vaux</code> || Auxiliary verb || [http://en.wikipedia.org/wiki/Auxilliary_verb wikipedia] || AUX |
||
|- |
|- |
||
| <code>cop</code> || Copula || [http://en.wikipedia.org/wiki/Copula_(linguistics) wikipedia]; sometimes verb-like, sometimes not |
| <code>cop</code> || Copula || [http://en.wikipedia.org/wiki/Copula_(linguistics) wikipedia]; sometimes verb-like, sometimes not || AUX, ... |
||
|- |
|- |
||
| <code>adj</code> || Adjective || |
| <code>adj</code> || Adjective || || ADJ |
||
|- |
|- |
||
| <code>post</code> || Postposition || |
| <code>post</code> || Postposition || || ADP |
||
|- |
|- |
||
| <code>adv</code> || Adverb || |
| <code>adv</code> || Adverb || || ADV |
||
|- |
|- |
||
| <code>preadv</code> || Pre-adverb || |
| <code>preadv</code> || Pre-adverb || || ADV |
||
|- |
|- |
||
| <code>postadv</code> || Post-adverb || |
| <code>postadv</code> || Post-adverb || || ADV |
||
|- |
|- |
||
| <code>mod</code> || Modal word || [http://dic.academic.ru/dic.nsf/lingvistic/749] |
| <code>mod</code> || Modal word || [http://dic.academic.ru/dic.nsf/lingvistic/749] || PART |
||
|- |
|- |
||
| <code>det</code> || Determiner || [http://en.wikipedia.org/wiki/Determiner_(class) wikipedia] |
| <code>det</code> || Determiner || [http://en.wikipedia.org/wiki/Determiner_(class) wikipedia] || DET |
||
|- |
|- |
||
| <code>prn</code> || Pronoun || [http://en.wikipedia.org/wiki/Pronoun wikipedia] |
| <code>prn</code> || Pronoun || [http://en.wikipedia.org/wiki/Pronoun wikipedia] || PRON |
||
|- |
|- |
||
| <code>pr</code> || Preposition || [http://en.wikipedia.org/wiki/Preposition wikipedia] |
| <code>pr</code> || Preposition || [http://en.wikipedia.org/wiki/Preposition wikipedia] || ADP |
||
|- |
|- |
||
| <code>num</code> || Numeral || |
| <code>num</code> || Numeral || || NUM |
||
|- |
|- |
||
| <code>np</code> || Proper noun || From ''nom propi'' [http://en.wikipedia.org/wiki/Proper_noun wikipedia] |
| <code>np</code> || Proper noun || From ''nom propi'' [http://en.wikipedia.org/wiki/Proper_noun wikipedia] || PROPN |
||
|- |
|- |
||
| <code>ij</code> || Interjection || [http://en.wikipedia.org/wiki/Interjection wikipedia] |
| <code>ij</code> || Interjection || [http://en.wikipedia.org/wiki/Interjection wikipedia] || INTJ |
||
|- |
|- |
||
| <code>cnjcoo</code> || Co-ordinating conjunction || [http://en.wikipedia.org/wiki/Co-ordinating_conjunction wikipedia] |
| <code>cnjcoo</code> || Co-ordinating conjunction || [http://en.wikipedia.org/wiki/Co-ordinating_conjunction wikipedia] || CCONJ |
||
|- |
|- |
||
| <code>cnjsub</code> || Sub-ordinating conjunction || |
| <code>cnjsub</code> || Sub-ordinating conjunction || || SCONJ |
||
|- |
|- |
||
| <code>cnjadv</code> || Conjunctive adverb || [http://en.wikipedia.org/wiki/Conjunctive_adverb wikipedia] |
| <code>cnjadv</code> || Conjunctive adverb || [http://en.wikipedia.org/wiki/Conjunctive_adverb wikipedia] || SCONJ, ADV |
||
|- |
|- |
||
| <code>sent</code> || Sentence-ending punctuation || e.g. full stop, question mark |
| <code>sent</code> || Sentence-ending punctuation || e.g. full stop, question mark || PUNCT |
||
|- |
|- |
||
| <code>cm</code> || Comma punctuation || , |
| <code>cm</code> || Comma punctuation || , || PUNCT |
||
|- |
|- |
||
| <code>lquot</code> || Left quote || « |
| <code>lquot</code> || Left quote || « || PUNCT |
||
|- |
|- |
||
| <code>rquot</code> || Right quote || » |
| <code>rquot</code> || Right quote || » || PUNCT |
||
|- |
|- |
||
| <code>lpar</code> || Left parenthesis || ( |
| <code>lpar</code> || Left parenthesis || ( || PUNCT |
||
|- |
|- |
||
| <code>rpar</code> || Right parenthesis || ) |
| <code>rpar</code> || Right parenthesis || ) || PUNCT |
||
|- |
|- |
||
|} |
|} |
||
Line 80: | Line 80: | ||
{|class=wikitable |
{|class=wikitable |
||
! Symbol !! Gloss !! Notes |
! Symbol !! Gloss !! Notes !! Universal featurs |
||
|- |
|- |
||
| <code>f</code> || Feminine || |
| <code>f</code> || Feminine || || Gender=Fem |
||
|- |
|- |
||
| <code>m</code> || Masculine || |
| <code>m</code> || Masculine || || Gender=Masc |
||
|- |
|- |
||
| <code>nt</code> || Neuter || |
| <code>nt</code> || Neuter || || Gender=Neut |
||
|- |
|- |
||
| <code>ma</code> || Masculine (animate) || Mostly in Slavic languages |
| <code>ma</code> || Masculine (animate) || Mostly in Slavic languages || Gender=Masc |
||
|- |
|- |
||
| <code>mi</code> || Masculine (inanimate) || Mostly in Slavic languages |
| <code>mi</code> || Masculine (inanimate) || Mostly in Slavic languages || Gender=Masc |
||
|- |
|- |
||
| <code>mp</code> || Masculine (personal) || in Polish |
| <code>mp</code> || Masculine (personal) || in Polish || Gender=Masc |
||
|- |
|- |
||
| <code>mn</code> || Masculine or neuter || |
| <code>mn</code> || Masculine or neuter || || Gender=Masc,Neut |
||
|- |
|- |
||
| <code>fn</code> || Feminine or neuter || |
| <code>fn</code> || Feminine or neuter || || Gender=Fem,Neut |
||
|- |
|- |
||
| <code>mf</code> || Masculine or feminine || This is used where the gender can be either masculine or feminine |
| <code>mf</code> || Masculine or feminine || This is used where the gender can be either masculine or feminine || Gender=Masc,Fem |
||
|- |
|- |
||
| <code>mfn</code> || Masculine , feminine , neuter || This is used where the gender can be either masculine, feminine or neuter |
| <code>mfn</code> || Masculine , feminine , neuter || This is used where the gender can be either masculine, feminine or neuter || Gender=Masc,Fem,Neut |
||
|- |
|- |
||
| <code>ut</code> || Common || From ''utrum'', found in Scandinavian languages. |
| <code>ut</code> || Common || From ''utrum'', found in Scandinavian languages. || Gender=Com |
||
|- |
|- |
||
| <code>un</code> || Common or neuter || As above, only common or neuter |
| <code>un</code> || Common or neuter || As above, only common or neuter || Gender=Com,Neut |
||
|- |
|- |
||
| <code>GD</code> || Gender to be determined || |
| <code>GD</code> || Gender to be determined || |
||
Line 115: | Line 115: | ||
{|class=wikitable |
{|class=wikitable |
||
! Symbol !! Gloss !! Notes |
! Symbol !! Gloss !! Notes !! Universal feature |
||
|- |
|- |
||
| <code>cnt</code> || Countable || |
| <code>cnt</code> || Countable || |
||
Line 128: | Line 128: | ||
{|class=wikitable |
{|class=wikitable |
||
! Symbol !! Gloss !! Notes |
! Symbol !! Gloss !! Notes !! Universal feature |
||
|- |
|- |
||
| <code>aa</code> || Animate || |
| <code>aa</code> || Animate || |
||
Line 141: | Line 141: | ||
{|class=wikitable |
{|class=wikitable |
||
! Symbol !! Gloss !! Notes |
! Symbol !! Gloss !! Notes !! Universal feature |
||
|- |
|- |
||
| <code>sint</code> || Synthetic || "nice, nicer, nicest" is synthetic. "handsome, more handsome, the most handsome" is not. [http://en.wikipedia.org/wiki/Synthetic_language wikipedia] |
| <code>sint</code> || Synthetic || "nice, nicer, nicest" is synthetic. "handsome, more handsome, the most handsome" is not. [http://en.wikipedia.org/wiki/Synthetic_language wikipedia] |
||
Line 154: | Line 154: | ||
{| class="wikitable" border="1" |
{| class="wikitable" border="1" |
||
! Symbol !! Gloss !! Notes |
! Symbol !! Gloss !! Notes !! Universal feature |
||
|- |
|- |
||
| <code>pers</code> || Personal || |
| <code>pers</code> || Personal || || PronType=Prs |
||
|- |
|- |
||
| <code>tn</code> || Tónico || |
| <code>tn</code> || Tónico || |
||
|- |
|- |
||
| <code>detnt</code> || Neuter determiner || POS? |
| <code>detnt</code> || Neuter determiner || POS? || DET |
||
|- |
|- |
||
| <code>predet</code> || Pre determiner || POS? |
| <code>predet</code> || Pre determiner || POS? || DET |
||
|- |
|- |
||
| <code>atn</code> || Atónico || |
| <code>atn</code> || Atónico || |
||
|- |
|- |
||
| <code>qnt</code> || Quantifier || |
| <code>qnt</code> || Quantifier || || PronType=Ind |
||
|- |
|- |
||
| <code>ord</code> || Ordinal || |
| <code>ord</code> || Ordinal || || NumType=Ord |
||
|- |
|- |
||
| <code>obj</code> || Object || |
| <code>obj</code> || Object || |
||
Line 178: | Line 178: | ||
| <code>enc</code> || Enclitic || |
| <code>enc</code> || Enclitic || |
||
|- |
|- |
||
| <code>acr</code> || Acronym || Not Pronuon? |
| <code>acr</code> || Acronym || Not Pronuon? || Abbr=Yes |
||
|- |
|- |
||
| <code>rel</code> || Relative || |
| <code>rel</code> || Relative || || PronType=Rel |
||
|- |
|- |
||
| <code>ind</code> || Indefinite || |
| <code>ind</code> || Indefinite || || PronType=Ind |
||
|- |
|- |
||
| <code>itg</code> || Interrogative || |
| <code>itg</code> || Interrogative || || PronType=Int |
||
|- |
|- |
||
| <code>dem</code> || Demonstrative || |
| <code>dem</code> || Demonstrative || || PronType=Dem |
||
|- |
|- |
||
| <code>def</code> || Definite || |
| <code>def</code> || Definite || |
||
|- |
|- |
||
| <code>pos</code> || Possessive || |
| <code>pos</code> || Possessive || || Poss=Yes |
||
|- |
|- |
||
| <code>ref</code> || Reflexive || |
| <code>ref</code> || Reflexive || || Reflex=Yes |
||
|- |
|- |
||
| <code>prx</code> || Proximate || |
| <code>prx</code> || Proximate || |
||
Line 204: | Line 204: | ||
{| class="wikitable" border="1" |
{| class="wikitable" border="1" |
||
! Symbol !! Gloss !! Notes |
! Symbol !! Gloss !! Notes !! Universal feature |
||
|- |
|- |
||
| <code>tv</code> || Transitive || takes direct object in accusative case (used in Turkic) |
| <code>tv</code> || Transitive || takes direct object in accusative case (used in Turkic) |
||
Line 219: | Line 219: | ||
{|class=wikitable |
{|class=wikitable |
||
! Symbol !! Gloss !! Notes |
! Symbol !! Gloss !! Notes !! Universal feature |
||
|- |
|- |
||
| <code>sg</code> || Singular || |
| <code>sg</code> || Singular || || Number=Sing |
||
|- |
|- |
||
| <code>pl</code> || Plural || |
| <code>pl</code> || Plural || || Number=Plur |
||
|- |
|- |
||
| <code>sp</code> || Singular or plural || |
| <code>sp</code> || Singular or plural || || Number=Sing,Plur |
||
|- |
|- |
||
| <code>du</code> || Dual || |
| <code>du</code> || Dual || || Number=Dual |
||
|- |
|- |
||
| <code>ct</code> || Count || see mk-bg |
| <code>ct</code> || Count || see mk-bg || Number=Count |
||
|- |
|- |
||
| <code>coll</code> || Collective || |
| <code>coll</code> || Collective || || Number=Coll |
||
|- |
|- |
||
| <code>ND</code> || Number to be determined || |
| <code>ND</code> || Number to be determined || |
||
Line 241: | Line 241: | ||
{|class=wikitable |
{|class=wikitable |
||
! Symbol !! Gloss !! Notes |
! Symbol !! Gloss !! Notes !! Universal feature |
||
|- |
|- |
||
| <code>nom</code> || Nominative || |
| <code>nom</code> || Nominative || || Case=Nom |
||
|- |
|- |
||
| <code>acc</code> || Accusative || |
| <code>acc</code> || Accusative || || Case=Acc |
||
|- |
|- |
||
| <code>dat</code> || Dative || |
| <code>dat</code> || Dative || || Case=Dat |
||
|- |
|- |
||
| <code>gen</code> || Genitive || |
| <code>gen</code> || Genitive || || Case=Gen |
||
|- |
|- |
||
| <code>dg</code> || Dative and Genitive || in [[ro-es]], discouraged in new developments |
| <code>dg</code> || Dative and Genitive || in [[ro-es]], discouraged in new developments || Case=Dat,Gen |
||
|- |
|- |
||
| <code>voc</code> || Vocative || |
| <code>voc</code> || Vocative || || Case=Voc |
||
|- |
|- |
||
| <code>abl</code> || Ablative || [http://en.wikipedia.org/wiki/Ablative wikipedia] |
| <code>abl</code> || Ablative || [http://en.wikipedia.org/wiki/Ablative wikipedia] || Case=Abl |
||
|- |
|- |
||
| <code>ins</code> || Instrumental or Instructive || [http://en.wikipedia.org/wiki/Instrumental_case wikipedia] |
| <code>ins</code> || Instrumental or Instructive || [http://en.wikipedia.org/wiki/Instrumental_case wikipedia] || Case=Ins |
||
|- |
|- |
||
| <code>loc</code> || Locative || [http://en.wikipedia.org/wiki/Locative wikipedia] |
| <code>loc</code> || Locative || [http://en.wikipedia.org/wiki/Locative wikipedia] || Case=Loc |
||
|- |
|- |
||
| <code>prp</code> || Prepositional || [http://en.wikipedia.org/wiki/Prepositional wikipedia] |
| <code>prp</code> || Prepositional || [http://en.wikipedia.org/wiki/Prepositional wikipedia] |
||
|- |
|- |
||
| <code>tra</code> || Translative || |
| <code>tra</code> || Translative || || Case=Tra |
||
|- |
|- |
||
| <code>ill</code> || Illative || |
| <code>ill</code> || Illative || || Case=Ill |
||
|- |
|- |
||
| <code>ine</code> || Inessive || |
| <code>ine</code> || Inessive || || Case=Ine |
||
|- |
|- |
||
| <code>ade</code> || Adessive || |
| <code>ade</code> || Adessive || || Case=Ade |
||
|- |
|- |
||
| <code>all</code> || Allative || |
| <code>all</code> || Allative || || Case=All |
||
|- |
|- |
||
| <code>abe</code> || Abessive || |
| <code>abe</code> || Abessive || || Case=Abe |
||
|- |
|- |
||
| <code>ess</code> || Essive || |
| <code>ess</code> || Essive || || Case=Ess |
||
|- |
|- |
||
| <code>par</code> || Partitive || |
| <code>par</code> || Partitive || || Case=Par |
||
|- |
|- |
||
| <code>dis</code> || Distributive || |
| <code>dis</code> || Distributive || || Case=Dis |
||
|- |
|- |
||
| <code>com</code> || Comitative || |
| <code>com</code> || Comitative || || Case=Com |
||
|- |
|- |
||
| <code>soc</code> || Sociative || |
| <code>soc</code> || Sociative || || |
||
|- |
|- |
||
| <code>prl</code> || Prolative || |
| <code>prl</code> || Prolative || || Case=Pro |
||
|- |
|- |
||
| <code>ses</code> || Superessive || [[Hungarian]] || |
| <code>ses</code> || Superessive || [[Hungarian]] || Case=Sup |
||
|- |
|- |
||
| <code>sub</code> || Sublative || [[Hungarian]] || |
| <code>sub</code> || Sublative || [[Hungarian]] || Case=Sub |
||
|- |
|- |
||
| <code>dela</code> || Delative || [[Hungarian]] || |
| <code>dela</code> || Delative || [[Hungarian]] || Case=Del |
||
|- |
|- |
||
| <code>term</code> || Terminative || [[Hungarian]], Estonian, ... || |
| <code>term</code> || Terminative || [[Hungarian]], Estonian, ... || |
||
Line 299: | Line 299: | ||
{|class=wikitable |
{|class=wikitable |
||
! Symbol !! Gloss !! Notes |
! Symbol !! Gloss !! Notes !! Universal feature |
||
|- |
|- |
||
| <code>actv</code> || Active voice || |
| <code>actv</code> || Active voice || || Voice=Act |
||
|- |
|- |
||
| <code>pass</code> || Passive voice || is more used in Turkic. |
| <code>pass</code> || Passive voice || is more used in Turkic. || Voice=Pass |
||
|- |
|- |
||
| <code>pasv</code> || Passive voice || is more used in Germanic. |
| <code>pasv</code> || Passive voice || is more used in Germanic. || Voice=PAss |
||
|- |
|- |
||
| <code>midv</code> || Middle voice || |
| <code>midv</code> || Middle voice || || Voice=Mid |
||
|- |
|- |
||
| <code>nactv</code> || Non-active voice || See Albanian. |
| <code>nactv</code> || Non-active voice || See Albanian. || |
||
|- |
|- |
||
| <code>caus</code> || Causative voice || see also [[#Derivations]] |
| <code>caus</code> || Causative voice || see also [[#Derivations]] || Voice=Cau |
||
|- |
|- |
||
|} |
|} |
||
Line 318: | Line 318: | ||
{|class=wikitable |
{|class=wikitable |
||
! Symbol !! Gloss !! Notes |
! Symbol !! Gloss !! Notes !! Universal features |
||
|- |
|- |
||
| <code>pres</code> || Present || |
| <code>pres</code> || Present || || Tense=Pres |
||
|- |
|- |
||
| <code>pret</code> || Preterite || [https://en.wikipedia.org/wiki/Preterite Preterite] |
| <code>pret</code> || Preterite || [https://en.wikipedia.org/wiki/Preterite Preterite] || Tense=Past |
||
|- |
|- |
||
| <code>past</code> || Past || |
| <code>past</code> || Past || || Tense=Past |
||
|- |
|- |
||
| <code>imp</code> || Imperative || [http://www.englishlanguageguide.com/grammar/imperative.asp englishlanguageguide] |
| <code>imp</code> || Imperative || [http://www.englishlanguageguide.com/grammar/imperative.asp englishlanguageguide] || Mood=Imp |
||
|- |
|- |
||
| <code>inf</code> || Infinitive || [https://en.wikipedia.org/wiki/Infinitive wikipedia] |
| <code>inf</code> || Infinitive || [https://en.wikipedia.org/wiki/Infinitive wikipedia] || VerbForm=Inf |
||
|- |
|- |
||
| <code>aor</code> || Aorist || [https://en.wikipedia.org/wiki/Aorist wikipedia] A tense in Turkic languages. |
| <code>aor</code> || Aorist || [https://en.wikipedia.org/wiki/Aorist wikipedia] A tense in Turkic languages. || Tense=Past |
||
|- |
|- |
||
| <code>pp</code> || Past participle || [http://en.wikipedia.org/wiki/Participle wikipedia] |
| <code>pp</code> || Past participle || [http://en.wikipedia.org/wiki/Participle wikipedia] || VerbForm=Part |
||
|- |
|- |
||
| <code>pp2</code> || Past participle (???) || It's at least used in the Esperanto dictionaries for future active participles, ''ont'' (seems quite odd) |
| <code>pp2</code> || Past participle (???) || It's at least used in the Esperanto dictionaries for future active participles, ''ont'' (seems quite odd) || |
||
|- |
|- |
||
| <code>pp3</code> || Past participle (???) || It's at least used in the Esperanto dictionaries for past active participles, ''int'' (seems quite odd) |
| <code>pp3</code> || Past participle (???) || It's at least used in the Esperanto dictionaries for past active participles, ''int'' (seems quite odd) || |
||
|- |
|- |
||
| <code>pprs</code> || Present participle || Also appears as <code>ppres</code> (deprecated) |
| <code>pprs</code> || Present participle || Also appears as <code>ppres</code> (deprecated) || VerbForm=Part |
||
|- |
|- |
||
| <code>ger</code> || Gerund || [http://en.wikipedia.org/wiki/Gerund wikipedia] |
| <code>ger</code> || Gerund || [http://en.wikipedia.org/wiki/Gerund wikipedia] || VerbForm=Ger |
||
|- |
|- |
||
| <code>supn</code> || Supine || [http://en.wikipedia.org/wiki/Supine wikipedia] |
| <code>supn</code> || Supine || [http://en.wikipedia.org/wiki/Supine wikipedia] || VerbForm=Sup |
||
|- |
|- |
||
| <code>pri</code> || Present indicative || ''see also: pres''. [http://en.wikipedia.org/wiki/Present_indicative wikipedia] |
| <code>pri</code> || Present indicative || ''see also: pres''. [http://en.wikipedia.org/wiki/Present_indicative wikipedia] || Tense=Pres Mood=Ind |
||
|- |
|- |
||
| <code>pii</code> || Imperfect || from ''Pretério imperfecto de indicativo'' [https://en.wikipedia.org/wiki/Imperfect wikipedia] |
| <code>pii</code> || Imperfect || from ''Pretério imperfecto de indicativo'' [https://en.wikipedia.org/wiki/Imperfect wikipedia] || Tense=Past Mood=Ind |
||
|- |
|- |
||
| <code>fti</code> || Future indicative || |
| <code>fti</code> || Future indicative || || Tense=Fut Mood=Ind |
||
|- |
|- |
||
| <code>fts</code> || Future subjunctive || |
| <code>fts</code> || Future subjunctive || || Tense=Fut Mood=Sub |
||
|- |
|- |
||
| <code>cni</code> || Conditional || Lot of pairs will probably use cnd or cond... |
| <code>cni</code> || Conditional || Lot of pairs will probably use cnd or cond... || Mood=Cnd |
||
|- |
|- |
||
| <code>plu</code> || Pluperfect || In <code>cy-en</code> |
| <code>plu</code> || Pluperfect || In <code>cy-en</code> || Tense=Pqp |
||
|- |
|- |
||
| <code>pmp</code> || Pluperfect || In <code>es-gl</code> (from ''Pluscamperfecto'') |
| <code>pmp</code> || Pluperfect || In <code>es-gl</code> (from ''Pluscamperfecto'') || Tense=Pqp |
||
|- |
|- |
||
| <code>prs</code> || Present subjunctive || [http://en.wikipedia.org/wiki/Present_subjunctive wikipedia] |
| <code>prs</code> || Present subjunctive || [http://en.wikipedia.org/wiki/Present_subjunctive wikipedia] || Tense=Pres Mood=Sub |
||
|- |
|- |
||
| <code>pis</code> || Imperfect subjunctive || |
| <code>pis</code> || Imperfect subjunctive || || Tense=Past Mood=Sub |
||
|- |
|- |
||
| <code>ifi</code> || Past definite || from ''Pretério perfecto o indefinido'' |
| <code>ifi</code> || Past definite || from ''Pretério perfecto o indefinido'' || Tense=Past Definite=Def |
||
|- |
|- |
||
| <code>aff</code> || Affirmative || [https://en.wikipedia.org/wiki/Affirmation_and_negation wikipedia] |
| <code>aff</code> || Affirmative || [https://en.wikipedia.org/wiki/Affirmation_and_negation wikipedia] || Polarity=Pos |
||
|- |
|- |
||
| <code>itg</code> || Interrogative || |
| <code>itg</code> || Interrogative || || |
||
|- |
|- |
||
| <code>neg</code> || Negative || |
| <code>neg</code> || Negative || || Polarity=Neg |
||
|- |
|- |
||
| <code>lp</code> || L-participle || |
| <code>lp</code> || L-participle || |
||
Line 378: | Line 378: | ||
{|class=wikitable |
{|class=wikitable |
||
! Symbol !! Gloss !! Notes |
! Symbol !! Gloss !! Notes !! Universal feature |
||
|- |
|- |
||
| <code>p1</code> || First person || |
| <code>p1</code> || First person || || Person=1 |
||
|- |
|- |
||
| <code>p2</code> || Second person || |
| <code>p2</code> || Second person || || Person=2 |
||
|- |
|- |
||
| <code>p3</code> || Third person || |
| <code>p3</code> || Third person || || Person=3 |
||
|- |
|- |
||
| <code>impers</code> || Impersonal || Sometimes called 'autonomous' |
| <code>impers</code> || Impersonal || Sometimes called 'autonomous' || Person=0 |
||
|- |
|- |
||
|} |
|} |
||
Line 401: | Line 401: | ||
===Possession=== |
===Possession=== |
||
{|class=wikitable |
{|class=wikitable |
||
! Symbol !! Gloss !! Notes |
! Symbol !! Gloss !! Notes !! Universal feature |
||
|- |
|- |
||
| <code>px1sg</code> || First person singular possessive || e.g. in [[Turkic languages]] |
| <code>px1sg</code> || First person singular possessive || e.g. in [[Turkic languages]] || Person[psor]=1 Number[psor]=Sing |
||
|- |
|- |
||
| <code>px2sg</code> || Second person singular possessive || e.g. in [[Turkic languages]] |
| <code>px2sg</code> || Second person singular possessive || e.g. in [[Turkic languages]] || Person[psor]=2 Number[psor]=Sing |
||
|- |
|- |
||
| <code>px3sg</code> || Third person singular possessive || e.g. in [[Turkic languages]] |
| <code>px3sg</code> || Third person singular possessive || e.g. in [[Turkic languages]] || Person[psor]=3 Number[psor]=Sing |
||
|- |
|- |
||
| <code>px1pl</code> || First person plural possessive || e.g. in [[Turkic languages]] |
| <code>px1pl</code> || First person plural possessive || e.g. in [[Turkic languages]] || Person[psor]=1 Number[psor]=Plur |
||
|- |
|- |
||
| <code>px2pl</code> || Second person plural possessive || e.g. in [[Turkic languages]] |
| <code>px2pl</code> || Second person plural possessive || e.g. in [[Turkic languages]] || Person[psor]=2 Number[psor]=Plur |
||
|- |
|- |
||
| <code>px3pl</code> || Third person plural possessive || e.g. in [[Turkic languages]] |
| <code>px3pl</code> || Third person plural possessive || e.g. in [[Turkic languages]] || Person[psor]=3 Number[psor]=Plur |
||
|- |
|- |
||
| <code>px3sp</code> || Third person possessive singular or plural || e.g. in [[Turkic languages]] |
| <code>px3sp</code> || Third person possessive singular or plural || e.g. in [[Turkic languages]] || Person[psor]=3 |
||
|- |
|- |
||
|} |
|} |
||
Line 424: | Line 424: | ||
{|class=wikitable |
{|class=wikitable |
||
! Symbol !! Gloss !! Notes |
! Symbol !! Gloss !! Notes !! Universal features |
||
|- |
|- |
||
| <code>o_sg1</code> || First person singular object || |
| <code>o_sg1</code> || First person singular object || |
||
Line 443: | Line 443: | ||
{|class=wikitable |
{|class=wikitable |
||
! Symbol !! Gloss !! Notes |
! Symbol !! Gloss !! Notes !! Universal features |
||
|- |
|- |
||
| <code>ant</code> || Anthroponym || [http://en.wikipedia.org/wiki/Anthroponym wikipedia] |
| <code>ant</code> || Anthroponym || [http://en.wikipedia.org/wiki/Anthroponym wikipedia] |
||
Line 462: | Line 462: | ||
{|class=wikitable |
{|class=wikitable |
||
! Symbol !! Gloss !! Notes |
! Symbol !! Gloss !! Notes !! Universal features |
||
|- |
|- |
||
| <code>pst</code> || Positive || |
| <code>pst</code> || Positive || || Degree=Pos |
||
|- |
|- |
||
| <code>comp</code> || Comparative || [http://en.wikipedia.org/wiki/Adjective#Attributive.2C_predicative.2C_absolute.2C_and_substantive_adjectives wikipedia] |
| <code>comp</code> || Comparative || [http://en.wikipedia.org/wiki/Adjective#Attributive.2C_predicative.2C_absolute.2C_and_substantive_adjectives wikipedia] || Degree=Comp |
||
|- |
|- |
||
| <code>sup</code> || Superlative || [http://en.wikipedia.org/wiki/Adjective#Attributive.2C_predicative.2C_absolute.2C_and_substantive_adjectives wikipedia] |
| <code>sup</code> || Superlative || [http://en.wikipedia.org/wiki/Adjective#Attributive.2C_predicative.2C_absolute.2C_and_substantive_adjectives wikipedia] || Degree=Sup |
||
|- |
|- |
||
| <code>attr</code> || Attributive || [http://en.wikipedia.org/wiki/Adjective#Attributive.2C_predicative.2C_absolute.2C_and_substantive_adjectives wikipedia] |
| <code>attr</code> || Attributive || [http://en.wikipedia.org/wiki/Adjective#Attributive.2C_predicative.2C_absolute.2C_and_substantive_adjectives wikipedia] |
Revision as of 11:44, 21 February 2019
[Liste de symboles|En français]] · по-русски This page lists the symbols in Apertium used to denote part-of-speech and further morphological features, as well as chunk tags used for more syntactic functions, as well as XML tags.
This is meant to be a glossary of symbol names in alphabetical order with notes. Some of these names are specific to particular packages or language pairs, as not all languages have the same grammatical features (most don't have spatial distinction in articles for example).
If you were wondering what the symbols #, /, @, +, ~ or * mean, read Apertium stream format.
Part-of-speech Categories
Symbol | Gloss | Notes | Universal POS |
---|---|---|---|
n |
Noun | see 'np' for proper noun | NOUN |
vblex |
Standard ("lexical") verb | see also: vbser, vbhaver, vbmod, vaux | VERB |
v |
Standard verb | shortened form of vblex, often used in agglutinative languages | VERB |
vbmod |
Modal verb | VERB | |
vbser |
Verb "to be" | from ser (to be) | VERB (or AUX) |
vbhaver |
Verb "to have" | from haver (to have) | VERB |
vaux |
Auxiliary verb | wikipedia | AUX |
cop |
Copula | wikipedia; sometimes verb-like, sometimes not | AUX, ... |
adj |
Adjective | ADJ | |
post |
Postposition | ADP | |
adv |
Adverb | ADV | |
preadv |
Pre-adverb | ADV | |
postadv |
Post-adverb | ADV | |
mod |
Modal word | [1] | PART |
det |
Determiner | wikipedia | DET |
prn |
Pronoun | wikipedia | PRON |
pr |
Preposition | wikipedia | ADP |
num |
Numeral | NUM | |
np |
Proper noun | From nom propi wikipedia | PROPN |
ij |
Interjection | wikipedia | INTJ |
cnjcoo |
Co-ordinating conjunction | wikipedia | CCONJ |
cnjsub |
Sub-ordinating conjunction | SCONJ | |
cnjadv |
Conjunctive adverb | wikipedia | SCONJ, ADV |
sent |
Sentence-ending punctuation | e.g. full stop, question mark | PUNCT |
cm |
Comma punctuation | , | PUNCT |
lquot |
Left quote | « | PUNCT |
rquot |
Right quote | » | PUNCT |
lpar |
Left parenthesis | ( | PUNCT |
rpar |
Right parenthesis | ) | PUNCT |
Part-of-speech Sub-categories
Gender
These tags are usually used with nouns, and things that agree/concord with nouns (like adjectives and verbs).
Symbol | Gloss | Notes | Universal featurs |
---|---|---|---|
f |
Feminine | Gender=Fem | |
m |
Masculine | Gender=Masc | |
nt |
Neuter | Gender=Neut | |
ma |
Masculine (animate) | Mostly in Slavic languages | Gender=Masc |
mi |
Masculine (inanimate) | Mostly in Slavic languages | Gender=Masc |
mp |
Masculine (personal) | in Polish | Gender=Masc |
mn |
Masculine or neuter | Gender=Masc,Neut | |
fn |
Feminine or neuter | Gender=Fem,Neut | |
mf |
Masculine or feminine | This is used where the gender can be either masculine or feminine | Gender=Masc,Fem |
mfn |
Masculine , feminine , neuter | This is used where the gender can be either masculine, feminine or neuter | Gender=Masc,Fem,Neut |
ut |
Common | From utrum, found in Scandinavian languages. | Gender=Com |
un |
Common or neuter | As above, only common or neuter | Gender=Com,Neut |
GD |
Gender to be determined |
Count/Mass
These tags are usually used with nouns, and things that agree/concord with nouns (like adjectives and verbs).
Symbol | Gloss | Notes | Universal feature |
---|---|---|---|
cnt |
Countable | ||
unc |
Uncountable (mass) |
Animacy
These tags are usually used with nouns, and things that agree/concord with nouns (like adjectives and verbs).
Symbol | Gloss | Notes | Universal feature |
---|---|---|---|
aa |
Animate | ||
an |
Animate or inanimate | ||
nn |
Inanimate |
Adjectives
Symbol | Gloss | Notes | Universal feature |
---|---|---|---|
sint |
Synthetic | "nice, nicer, nicest" is synthetic. "handsome, more handsome, the most handsome" is not. wikipedia | |
preadj |
Pre-adjective | for languages where most of adjectives are after the noun (ex: French in eo->fr bidix) | |
preadj_nh |
Pre-adjective if not human | according to the noun, the adjective is before or after |
Pronoun types
Symbol | Gloss | Notes | Universal feature |
---|---|---|---|
pers |
Personal | PronType=Prs | |
tn |
Tónico | ||
detnt |
Neuter determiner | POS? | DET |
predet |
Pre determiner | POS? | DET |
atn |
Atónico | ||
qnt |
Quantifier | PronType=Ind | |
ord |
Ordinal | NumType=Ord | |
obj |
Object | ||
subj |
Subject | ||
pro |
Proclitic | ||
enc |
Enclitic | ||
acr |
Acronym | Not Pronuon? | Abbr=Yes |
rel |
Relative | PronType=Rel | |
ind |
Indefinite | PronType=Ind | |
itg |
Interrogative | PronType=Int | |
dem |
Demonstrative | PronType=Dem | |
def |
Definite | ||
pos |
Possessive | Poss=Yes | |
ref |
Reflexive | Reflex=Yes | |
prx |
Proximate | ||
dst |
Distal |
Transitivity
Used for verbs.
Symbol | Gloss | Notes | Universal feature |
---|---|---|---|
tv |
Transitive | takes direct object in accusative case (used in Turkic) | |
iv |
Intransitive | does not take direct object in accusative case (used in Turkic) | |
TD |
Transitivity to be determined | if the sub-category is [currently] unknown |
Inflectional morphology
Number
Note: number can be a sub-category tag too, e.g. with pronouns.
Symbol | Gloss | Notes | Universal feature |
---|---|---|---|
sg |
Singular | Number=Sing | |
pl |
Plural | Number=Plur | |
sp |
Singular or plural | Number=Sing,Plur | |
du |
Dual | Number=Dual | |
ct |
Count | see mk-bg | Number=Count |
coll |
Collective | Number=Coll | |
ND |
Number to be determined |
Case
Symbol | Gloss | Notes | Universal feature |
---|---|---|---|
nom |
Nominative | Case=Nom | |
acc |
Accusative | Case=Acc | |
dat |
Dative | Case=Dat | |
gen |
Genitive | Case=Gen | |
dg |
Dative and Genitive | in ro-es, discouraged in new developments | Case=Dat,Gen |
voc |
Vocative | Case=Voc | |
abl |
Ablative | wikipedia | Case=Abl |
ins |
Instrumental or Instructive | wikipedia | Case=Ins |
loc |
Locative | wikipedia | Case=Loc |
prp |
Prepositional | wikipedia | |
tra |
Translative | Case=Tra | |
ill |
Illative | Case=Ill | |
ine |
Inessive | Case=Ine | |
ade |
Adessive | Case=Ade | |
all |
Allative | Case=All | |
abe |
Abessive | Case=Abe | |
ess |
Essive | Case=Ess | |
par |
Partitive | Case=Par | |
dis |
Distributive | Case=Dis | |
com |
Comitative | Case=Com | |
soc |
Sociative | ||
prl |
Prolative | Case=Pro | |
ses |
Superessive | Hungarian | Case=Sup |
sub |
Sublative | Hungarian | Case=Sub |
dela |
Delative | Hungarian | Case=Del |
term |
Terminative | Hungarian, Estonian, ... |
Voice
Symbol | Gloss | Notes | Universal feature |
---|---|---|---|
actv |
Active voice | Voice=Act | |
pass |
Passive voice | is more used in Turkic. | Voice=Pass |
pasv |
Passive voice | is more used in Germanic. | Voice=PAss |
midv |
Middle voice | Voice=Mid | |
nactv |
Non-active voice | See Albanian. | |
caus |
Causative voice | see also #Derivations | Voice=Cau |
Tense and mode
Symbol | Gloss | Notes | Universal features |
---|---|---|---|
pres |
Present | Tense=Pres | |
pret |
Preterite | Preterite | Tense=Past |
past |
Past | Tense=Past | |
imp |
Imperative | englishlanguageguide | Mood=Imp |
inf |
Infinitive | wikipedia | VerbForm=Inf |
aor |
Aorist | wikipedia A tense in Turkic languages. | Tense=Past |
pp |
Past participle | wikipedia | VerbForm=Part |
pp2 |
Past participle (???) | It's at least used in the Esperanto dictionaries for future active participles, ont (seems quite odd) | |
pp3 |
Past participle (???) | It's at least used in the Esperanto dictionaries for past active participles, int (seems quite odd) | |
pprs |
Present participle | Also appears as ppres (deprecated) |
VerbForm=Part |
ger |
Gerund | wikipedia | VerbForm=Ger |
supn |
Supine | wikipedia | VerbForm=Sup |
pri |
Present indicative | see also: pres. wikipedia | Tense=Pres Mood=Ind |
pii |
Imperfect | from Pretério imperfecto de indicativo wikipedia | Tense=Past Mood=Ind |
fti |
Future indicative | Tense=Fut Mood=Ind | |
fts |
Future subjunctive | Tense=Fut Mood=Sub | |
cni |
Conditional | Lot of pairs will probably use cnd or cond... | Mood=Cnd |
plu |
Pluperfect | In cy-en |
Tense=Pqp |
pmp |
Pluperfect | In es-gl (from Pluscamperfecto) |
Tense=Pqp |
prs |
Present subjunctive | wikipedia | Tense=Pres Mood=Sub |
pis |
Imperfect subjunctive | Tense=Past Mood=Sub | |
ifi |
Past definite | from Pretério perfecto o indefinido | Tense=Past Definite=Def |
aff |
Affirmative | wikipedia | Polarity=Pos |
itg |
Interrogative | ||
neg |
Negative | Polarity=Neg | |
lp |
L-participle |
Person
Note: person can be a sub-category tag, e.g. with pronouns.
Symbol | Gloss | Notes | Universal feature |
---|---|---|---|
p1 |
First person | Person=1 | |
p2 |
Second person | Person=2 | |
p3 |
Third person | Person=3 | |
impers |
Impersonal | Sometimes called 'autonomous' | Person=0 |
Derivations
Symbol | Gloss | Notes |
---|---|---|
caus |
Causative | |
ingr |
Ingressive | https://nn.wikipedia.org/w/index.php?title=Ingressiv |
Possession
Symbol | Gloss | Notes | Universal feature |
---|---|---|---|
px1sg |
First person singular possessive | e.g. in Turkic languages | Person[psor]=1 Number[psor]=Sing |
px2sg |
Second person singular possessive | e.g. in Turkic languages | Person[psor]=2 Number[psor]=Sing |
px3sg |
Third person singular possessive | e.g. in Turkic languages | Person[psor]=3 Number[psor]=Sing |
px1pl |
First person plural possessive | e.g. in Turkic languages | Person[psor]=1 Number[psor]=Plur |
px2pl |
Second person plural possessive | e.g. in Turkic languages | Person[psor]=2 Number[psor]=Plur |
px3pl |
Third person plural possessive | e.g. in Turkic languages | Person[psor]=3 Number[psor]=Plur |
px3sp |
Third person possessive singular or plural | e.g. in Turkic languages | Person[psor]=3 |
Object marking
e.g. in verbs with both
Symbol | Gloss | Notes | Universal features |
---|---|---|---|
o_sg1 |
First person singular object | ||
o_sg2 |
Second person singular object | ||
o_sg3 |
Third person singular object | ||
o_pl1 |
First person plural object | ||
o_pl2 |
Second person plural object | ||
o_pl3 |
Third person plural object |
Proper nouns
Symbol | Gloss | Notes | Universal features |
---|---|---|---|
ant |
Anthroponym | wikipedia | |
top |
Toponym | In some language pairs without the locative case this may be loc. Although this should be changed. wikipedia | |
hyd |
Hydronym | wikipedia | |
cog |
Cognomen | In normal use, surnames | |
org |
Organisation | ||
al |
Altres | Other, misc. |
Adjectives
Symbol | Gloss | Notes | Universal features |
---|---|---|---|
pst |
Positive | Degree=Pos | |
comp |
Comparative | wikipedia | Degree=Comp |
sup |
Superlative | wikipedia | Degree=Sup |
attr |
Attributive | wikipedia | |
pred |
Predicative | wikipedia |
Others
Symbol | Gloss | Notes |
---|---|---|
web |
Links and Emails |
See also
Chunk tags
Tag | Description |
---|---|
<SN> |
Noun phrase / noun group (sintagma nominal) |
<SA> |
Adjective phrase / adjective group |
<SV> |
Verb phrase / verb group (sintagma verbal) |
XML tags
Note: All XML tags are explained in depth in the PDF documentation, see also the dix.dtd and dix.rng files in the GitHub repository.
XML tag | Means | Appears in XML tags / notes / examples |
---|---|---|
<dictionary> |
Mono- or bilingual dictionary | In files apertium-eo-en.en.dix, apertium-eo-en.eo-en.dix, apertium-eo-en.post-en.dix, apertium-eo-en.post-eo.dix |
<alphabet> |
Set of characters in the language | In <dictionary>
|
<sdefs> |
Symbol definitions | In <dictionary>
|
<sdef> |
Symbol definition | In <sdefs> . Ex: <sdef n="noun"/>
|
<pardefs> |
Paradigm definitions | In <dictionary> .
|
<pardef> |
Paradigm definition | In <pardefs> .
|
<section> |
A section of the dictionary | In <dictionary> . Ex: <section id="main" type="standard">
|
<e> |
A dictionary entry (a word) | In <section> and in <pardef> .
|
<i> |
Invariant (left and right side) | In <e> . Ex.: <i>beer</i>
|
<p> |
A pair | In <e> .
|
<l> |
Left side (surface form) | In <p> . Ex.: <l>beer</l>
|
<r> |
Right side (lexical unit) | In <p> . Ex.: <r>beer<s n="noun"/><s n="singular"/></r>
|
<s> |
A lexical symbol (noun, adj..) | In <r> , <l> and <i> . Ex.: <s n="noun"/>
|
<a> |
Post-generator wake-up mark | In <r> , <l> and <i> . Ex.: <l><a/>a<s ... (for the a/an rule in English)
|
<b> |
Blank space | In <r> , <l> and <i> . Ex.: <l>you're<b/>welcome<s ...
|
TODO: Probably there are more. --Jacob Nordfalk 14:47, 25 August 2008 (UTC)
Other tags:
<j/> (in stream format #) is to mark multiwords <t/> and <v/> are only in crossdix t = template, v = variable t matches any single tag, v is like + in regexes (0 or more) <sa/> and <prm/> are only used in metadixes. 'sa' lets you add n optional extra tag, prm is an extra string for the paradigm
Transfer
<clip> tag
See the documentation (pdf), p.144 for more information.
XML attribute value | Means | Appears in attribute | Notes |
---|---|---|---|
whole |
lemma and grammatical symbols | part | |
lem |
lemma | part | |
lemh |
(inflected) head word of multiword | part | |
lemq |
following queue of multiword | part |