Irish FST tags

From Apertium
Revision as of 16:09, 24 March 2012 by Francis Tyers (talk | contribs)
Jump to navigation Jump to search
Irish FST Apertium Description
+Noun <n> noun
+Num <num> Numeral
+Abr <abbr> Abbreviation
+Adj <adj> adjective
+Adv <adv> Adverb
+VD <> ditransitive - not at present
+VF <> - form used before a word starting with a vowel or f+vowel
+Verb <v> verb
+VI <v><iv> intransitive
+VT <v><tv> transitive
+VTI <v><tv>, <v><iv> transitive, intrans., both trans & intrans
+Art <det><def> article
+Base <pst> positive / base form (changed from +Pos to +Base 10/09/03)
+Card <card> Cardinal (one two three ...)
+Cmpd Compound
+Comp <comp> comparative
+Cond <>
+Conj conjunction
+Coord <cnjcoo> co-ordinate
+Cop <cop> Copula
+Def <def> Definite
+DefArt <> noun preceeded by definite article (an)
+Deg <> with Degree Particle
+Dem <dem> Demonstrative
+Dep <> dependant forms
+Det <det> Determiner
+Dir <> Directional
+Ecl <> (+Urú) e.g. after compound prep eg ar an gcat
+Emph <emph> Emphatic (Contrastive) form of personal pronoun
+English English
+Foreign <> Foreign
+FutInd <> Future Indicative
+Gn <> General
+Idf <ind> Indefinite
+Imper <imp> - Imperative Mood - Modh Ordaitheach
+Int <> sentence internal
+Itj <ij> Interjection
+Its <> intensifiers e.g. sách, ró- etc.
+Len <> e.g. ab fhearr, ba mhó a masc noun must be either lenitedor eclipsed according to preference/dialect. If it is lenited then the adj is likewise lenited. If it is eclipsed then the adj has no initial mutation.
+NStem <> de-nominal verbal (action) noun
+Neg <neg> Negative
+NegQ <>
+NotSlen <> qualifies a plural noun ending in a broad consonantor a vowel
+Obj <> á = "do a" when obj of VN
+Op <>
+Ord <ord> Ordinal (first, second, third..) i.e. mo dhá lámh, an chéad dhá theach
+Part <part> see irregular nouns
+Past <past> copula past & conditional
+PastImp <> Gháthchaite Past Habitual (Imperfect Indicative)
+PastInd <> Past Indicative
+PastSubj <> Past Subjunctive
+Pers <pers> Personal
+Poss <pos> Possessive
+Prep <pr> Preposition
+Pres <pres> copula present & future
+PresImp <> Gháthláithreach Pres Habitual (Verb bí only)
+PresInd <> Present Indicative
+PresSubj <> Present Subjunctive
+Pro <> Pronoun with Copula
+Pron <prn> Pronoun
+Prop <np> proper
+Q <itg> Interrogative
+Qty <qnt> Quantifier
+Ref <ref> Reflexive
+Rel <rel> relative forms - direct
+RelInd <> rel. indirect
+Sbj <> sí, sé and siad are used only when pron follows predicate verb in subject position eg Chuaigh SÍ amach (She went out but Téigh gan Í (Go without her)
+Simp <> Simple
+Slender <> qualifies a plural noun ending in a slender consonant
+Strong <> same for all cases) the adj will also have the same form in all cases BUT when the adj. is qualifying a weak plural noun,in the gen.case, it is not inflected i.e. the base form of the adjective is used. Nom and Voc plurals are inflecte as for strong plural nouns
+Strong <> strong plural
+Subord <cnjsub> subordinate
+Subst <> - copula+pron+art+noun - séard (is é an rud)
+Subst <> substantive - functions like a noun, but lack noun inflections
+Temp <> Temporal e.g. inniu, amárach etc.
+Verbal <>
+Vow <> vowel-initial : used to allow past-tense Len e.g. d´ith
+Weak <> when an adj is qualifying a strong plural noun(i.e. noun plural is the
+hPref <> prefix e.g. (h)iontach
+Adj <adj> adj used in noun compound e.g. domhain+Adj+comhrá+Noun+...
+Guess <>
Case
+Voc <voc> vocative case
+Dat <dat> dative (e.g. teach)
+Com <nom> nominative case
+Gen <gen> genitive case
+Loc <loc> Locative
Person
+1P <p1> First person
+2P <p2> Second person
+3P <p3> Third person
+Auto <impers> Autonomous
Number
+Sg <sg> singular
+Pl <pl> plural
Gender
+Fem <f> feminine gender
+Masc <m> masculine gender
Punctuation
+Punct <> Abbreviation
+Fin <> sentence final
+End end bracket, quote etc
+Brack <lpar>, <rpar> round, square and curly brackets
+Bar <guio> hyphen, underscore, dash etc.
+St start bracket, quote etc
+Quo <lquot><rquot> all quotation marks double, single etc.
Speech
+Xxx <> Indecipherable speech
+Filler Filled Pause (eh, em,
+Event Simple Event (laugh, sneeze etc.)
+Cmc Communicator (yeah, y'know)
Dialect
+CC Canúint Chonnachta
+CM Canúint na Mumhan
+CM canúint na Mumhan, Munster dialect
+CU Canúint Uladh

See also