Difference between revisions of "Irish"

From Apertium
Jump to navigation Jump to search
(moved Irish to Irish FST tags: "Irish" to become a more general page.)
 
(Outline of how the sub-transducers are organized in the Irish FST)
Line 1: Line 1:
  +
== Included files ==
#REDIRECT [[Irish FST tags]]
 
  +
  +
{| class="wikitable"
  +
| '''File'''
  +
| '''Contents'''
  +
|-
  +
| abr-lex.txt
  +
| Abbreviations
  +
|-
  +
| adj-lex-cc.txt
  +
| Adjective continuation classes
  +
|-
  +
| adj-lex-fgb1.txt
  +
| Adjectives from the Foclóir Gaeilge-Béarla
  +
|-
  +
| adj-lex-fgb2.txt
  +
| Adjectives from the Foclóir Gaeilge-Béarla
  +
|-
  +
| adj-lex-fgb-eqvar.txt
  +
| Variant adjectives in Ó Dónaill
  +
|-
  +
| adj-lex-fgb-var.txt
  +
| More variant adjectives in Ó Dónaill
  +
|-
  +
| adj-lex-multi.txt
  +
| Multichar symbols for adjectives
  +
|-
  +
| adj-lex-stems2.txt
  +
| Descriptive adjective stems
  +
|-
  +
| adj-lex-stems3.txt
  +
| Descriptive adjective stems
  +
|-
  +
| adj-lex-stems4.txt
  +
| Color adjective stems
  +
|-
  +
| adj-lex-stems5.txt
  +
| manually classified adjective stems
  +
|-
  +
| adj-lex-stems-irreg.txt
  +
| irregular adjective stems
  +
|-
  +
| adj-lex-stems.txt
  +
| Descriptive adjectives
  +
|-
  +
| adv-lex.txt
  +
| Adverbs
  +
|-
  +
| art-lex.txt
  +
| Articles
  +
|-
  +
| conj-lex.txt
  +
| Conjunctions
  +
|-
  +
| det-lex.txt
  +
| Determiners
  +
|-
  +
| f-reg-tidy.txt
  +
| Replace rules
  +
|-
  +
| itj-lex.txt
  +
| Interjections
  +
|-
  +
| n-lex-cc.txt
  +
| noun continuation classes
  +
|-
  +
| n-lex-fgb1.txt
  +
| Masculine nouns
  +
|-
  +
| n-lex-fgb2.txt
  +
| Masculine nouns
  +
|-
  +
| n-lex-fgb-eqvar.txt
  +
| Variant nouns from Ó Dónaill
  +
|-
  +
| n-lex-fgb-variants.txt
  +
| Variant nouns from Ó Dónaill
  +
|-
  +
| n-lex-irreg.txt
  +
| Irregular nouns
  +
|-
  +
| n-lex-multi.txt
  +
| Multichar symbols for nouns
  +
|-
  +
| n-lex-stems2.txt
  +
| Noun stems
  +
|-
  +
| n-lex-stems3.txt
  +
| Noun stems
  +
|-
  +
| n-lex-stems4.txt
  +
| Noun stems
  +
|-
  +
| n-lex-stems5.txt
  +
| Noun stems
  +
|-
  +
| n-lex-stems6.txt
  +
| Noun stems
  +
|-
  +
| n-lex-stems.txt
  +
| Noun stems
  +
|-
  +
| np-lex-cc.txt
  +
| Continuation classes for proper nouns
  +
|-
  +
| np-lex-eire-en.txt
  +
| Placenames of Ireland, in English
  +
|-
  +
| np-lex-eire.txt
  +
| Placenames of Ireland, in Irish
  +
|-
  +
| np-lex-fam-en.txt
  +
| Family names of Ireland, in English
  +
|-
  +
| np-lex-fam.txt
  +
| Family names of Ireland, in Irish
  +
|-
  +
| np-lex-irreg.txt
  +
| Irregular proper nouns
  +
|-
  +
| np-lex-multi.txt
  +
| Multichar symbols for proper nouns
  +
|-
  +
| np-lex-org.txt
  +
| Organisations (political parties, etc.) in Ireland
  +
|-
  +
| np-lex-pers-en-new.txt
  +
| Hiberno-English anthroponyms
  +
|-
  +
| np-lex-pers.txt
  +
| Irish anthroponyms
  +
|-
  +
| np-lex-tir-en.txt
  +
| Names of countries, in English
  +
|-
  +
| np-lex-tir.txt
  +
| Names of countries
  +
|-
  +
| np-logainm-sampla-ga.txt
  +
| Some placenames in Ireland
  +
|-
  +
| n-reg-br.txt
  +
| Replace rules: broadening of slender words
  +
|-
  +
| n-reg-chg.txt
  +
| Various replace rules for Nouns and adjectives
  +
|-
  +
| n-reg-chk.txt
  +
| Removal of extraneous ^Caol and ^Lea tags for
  +
|-
  +
| n-reg-cmpd.txt
  +
| Insertion of h- prefix after certain initial consonants
  +
|-
  +
| n-reg-diph.txt
  +
| Rewriting of diphthongs
  +
|-
  +
| n-reg-init.txt
  +
| Initial mutations
  +
|-
  +
| n-reg-slen.txt
  +
| Slenderisation of broad words
  +
|-
  +
| n-reg-sync.txt
  +
| Syncopation of last unstressed syllable before endings
  +
|-
  +
| n-reg-tidy.txt
  +
| Miscellaneous tidy up
  +
|-
  +
| n-reg-vh.txt
  +
| Vowel harmony
  +
|-
  +
| n-reg-x.txt
  +
| -
  +
|-
  +
| num-lex.txt
  +
| Cardinal numbers
  +
|-
  +
| part-lex.txt
  +
| Particles
  +
|-
  +
| prep-lex.txt
  +
| Prepositions (simple, compound, emphatic)
  +
|-
  +
| pron-lex.txt
  +
| Pronouns (Indefinite, personal), pronominals
  +
|-
  +
| punct-lex.txt
  +
| Abbreviations
  +
|-
  +
| tags-lex.txt
  +
| XML/HTML-like tags
  +
|-
  +
| tobar-lex.txt
  +
| -
  +
|-
  +
| v12-lex-cc.txt
  +
| Continuation classes common to 1st and 2nd verb conjugations
  +
|-
  +
| v1-lex-cc.txt
  +
| Continuation classes for 1st verb conjugation
  +
|-
  +
| v1-lex-stems.txt
  +
| 1st conjugation verb stems
  +
|-
  +
| v2-lex-cc.txt
  +
| Continuation classes for 2nd verb conjugation
  +
|-
  +
| v2-lex-stems.txt
  +
| 2nd conjugation verb stems
  +
|-
  +
| v3-lex-stems.txt
  +
| Irregular verbs
  +
|-
  +
| v4-lex-stems.txt
  +
| Mixed verb stesm
  +
|-
  +
| v5-lex-stems.txt
  +
| Verb stems
  +
|-
  +
| va-lex-stems.txt
  +
| Verb stems from Foclóir Póca
  +
|-
  +
| v-lex-fgb1.txt
  +
| Verb stems from Foclóir Gaeilge-Béarla
  +
|-
  +
| v-lex-fgb2.txt
  +
| Verb stems from Foclóir Gaeilge-Béarla
  +
|-
  +
| v-lex-fgb-eqvar.txt
  +
| Verbal variants from Foclóir Gaeilge-Béarla
  +
|-
  +
| v-lex-fgb-var.txt
  +
| Verb variants from Foclóir Gaeilge-Béarla
  +
|-
  +
| v-lex-multi.txt
  +
| Multichar symbols for verbs
  +
|-
  +
| vng-lex-stems.txt
  +
| Verbal nouns from Foclóir Póca
  +
|-
  +
| vn-lex-fgb-eqvar.txt
  +
| Verbal noun variants from Foclóir Gaeilge-Béarla
  +
|-
  +
| vn-lex-fgb-var.txt
  +
| Verbal noun variants from Foclóir Gaeilge-Béarla
  +
|-
  +
| vn-lex-stems.txt
  +
| Verbal noun stems
  +
|-
  +
| vnva-lex-cc.txt
  +
| Continuation classes for verbal nouns and verbal adjectives
  +
|-
  +
| vnva-lex-fgb1.txt
  +
| Verbal nouns and verbal adjectives found in noun entries
  +
|-
  +
| vnva-lex-multi.txt
  +
| Multichar symbols for verbal nouns and verbal adjectives
  +
|-
  +
| v-reg.txt
  +
| Replace rules for verbs
  +
|-
  +
| adj-lex-stems-irreg.txt
  +
| irregular adjective stems
  +
|-
  +
| adj-lex-stems.txt
  +
| Descriptive adjectives
  +
|-
  +
| adv-lex.txt
  +
| Adverbs
  +
|-
  +
| art-lex.txt
  +
| Articles
  +
|-
  +
| conj-lex.txt
  +
| Conjunctions
  +
|-
  +
| det-lex.txt
  +
| Determiners
  +
|-
  +
| f-reg-tidy.txt
  +
| Replace rules
  +
|-
  +
| itj-lex.txt
  +
| Interjections
  +
|-
  +
| n-lex-cc.txt
  +
| noun continuation classes
  +
|-
  +
| n-lex-fgb1.txt
  +
| Masculine nouns
  +
|-
  +
| n-lex-fgb2.txt
  +
| Masculine nouns
  +
|-
  +
| n-lex-fgb-eqvar.txt
  +
| Variant nouns from Ó Dónaill
  +
|-
  +
| n-lex-fgb-variants.txt
  +
| Variant nouns from Ó Dónaill
  +
|-
  +
| n-lex-irreg.txt
  +
| Irregular nouns
  +
|-
  +
| n-lex-multi.txt
  +
| Multichar symbols for nouns
  +
|-
  +
| n-lex-stems2.txt
  +
| Noun stems
  +
|-
  +
| n-lex-stems3.txt
  +
| Noun stems
  +
|-
  +
| n-lex-stems4.txt
  +
| Noun stems
  +
|-
  +
| n-lex-stems5.txt
  +
| Noun stems
  +
|-
  +
| n-lex-stems6.txt
  +
| Noun stems
  +
|-
  +
| n-lex-stems.txt
  +
| Noun stems
  +
|-
  +
| np-lex-cc.txt
  +
| Continuation classes for proper nouns
  +
|-
  +
| np-lex-eire-en.txt
  +
| Placenames of Ireland, in English
  +
|-
  +
| np-lex-eire.txt
  +
| Placenames of Ireland, in Irish
  +
|-
  +
| np-lex-fam-en.txt
  +
| Family names of Ireland, in English
  +
|-
  +
| np-lex-fam.txt
  +
| Family names of Ireland, in Irish
  +
|-
  +
| np-lex-irreg.txt
  +
| Irregular proper nouns
  +
|-
  +
| np-lex-multi.txt
  +
| Multichar symbols for proper nouns
  +
|-
  +
| np-lex-org.txt
  +
| Organisations (political parties, etc.) in Ireland
  +
|-
  +
| np-lex-pers-en-new.txt
  +
| Hiberno-English anthroponyms
  +
|-
  +
| np-lex-pers.txt
  +
| Irish anthroponyms
  +
|-
  +
| np-lex-tir-en.txt
  +
| Names of countries, in English
  +
|-
  +
| np-lex-tir.txt
  +
| Names of countries
  +
|-
  +
| np-logainm-sampla-ga.txt
  +
| Some placenames in Ireland
  +
|-
  +
| n-reg-br.txt
  +
| Replace rules: broadening of slender words
  +
|-
  +
| n-reg-chg.txt
  +
| Various replace rules for Nouns and adjectives
  +
|-
  +
| n-reg-chk.txt
  +
| Removal of extraneous ^Caol and ^Lea tags for
  +
|-
  +
| n-reg-cmpd.txt
  +
| Insertion of h- prefix after certain initial consonants
  +
|-
  +
| n-reg-diph.txt
  +
| Rewriting of diphthongs
  +
|-
  +
| n-reg-init.txt
  +
| Initial mutations
  +
|-
  +
| n-reg-slen.txt
  +
| Slenderisation of broad words
  +
|-
  +
| n-reg-sync.txt
  +
| Syncopation of last unstressed syllable before endings
  +
|-
  +
| n-reg-tidy.txt
  +
| Miscellaneous tidy up
  +
|-
  +
| n-reg-vh.txt
  +
| Vowel harmony
  +
|-
  +
| n-reg-x.txt
  +
| -
  +
|-
  +
| num-lex.txt
  +
| Cardinal numbers
  +
|-
  +
| part-lex.txt
  +
| Particles
  +
|-
  +
| prep-lex.txt
  +
| Prepositions (simple, compound, emphatic)
  +
|-
  +
| pron-lex.txt
  +
| Pronouns (Indefinite, personal), pronominals
  +
|-
  +
| punct-lex.txt
  +
| Abbreviations
  +
|-
  +
| tags-lex.txt
  +
| XML/HTML-like tags
  +
|-
  +
| tobar-lex.txt
  +
| -
  +
|-
  +
| v12-lex-cc.txt
  +
| Continuation classes common to 1st and 2nd verb conjugations
  +
|-
  +
| v1-lex-cc.txt
  +
| Continuation classes for 1st verb conjugation
  +
|-
  +
| v1-lex-stems.txt
  +
| 1st conjugation verb stems
  +
|-
  +
| v2-lex-cc.txt
  +
| Continuation classes for 2nd verb conjugation
  +
|-
  +
| v2-lex-stems.txt
  +
| 2nd conjugation verb stems
  +
|-
  +
| v3-lex-stems.txt
  +
| Irregular verbs
  +
|-
  +
| v4-lex-stems.txt
  +
| Mixed verb stesm
  +
|-
  +
| v5-lex-stems.txt
  +
| Verb stems
  +
|-
  +
| va-lex-stems.txt
  +
| Verb stems from Foclóir Póca
  +
|-
  +
| v-lex-fgb1.txt
  +
| Verb stems from Foclóir Gaeilge-Béarla
  +
|-
  +
| v-lex-fgb2.txt
  +
| Verb stems from Foclóir Gaeilge-Béarla
  +
|-
  +
| v-lex-fgb-eqvar.txt
  +
| Verbal variants from Foclóir Gaeilge-Béarla
  +
|-
  +
| v-lex-fgb-var.txt
  +
| Verb variants from Foclóir Gaeilge-Béarla
  +
|-
  +
| v-lex-multi.txt
  +
| Multichar symbols for verbs
  +
|-
  +
| vng-lex-stems.txt
  +
| Verbal nouns from Foclóir Póca
  +
|-
  +
| vn-lex-fgb-eqvar.txt
  +
| Verbal noun variants from Foclóir Gaeilge-Béarla
  +
|-
  +
| vn-lex-fgb-var.txt
  +
| Verbal noun variants from Foclóir Gaeilge-Béarla
  +
|-
  +
| vn-lex-stems.txt
  +
| Verbal noun stems
  +
|-
  +
| vnva-lex-cc.txt
  +
| Continuation classes for verbal nouns and verbal adjectives
  +
|-
  +
| vnva-lex-fgb1.txt
  +
| Verbal nouns and verbal adjectives found in noun entries
  +
|-
  +
| vnva-lex-multi.txt
  +
| Multichar symbols for verbal nouns and verbal adjectives
  +
|-
  +
| v-reg.txt
  +
| Replace rules for verbs
  +
|}
  +
  +
  +
== Tags ==
  +
A more-or-less comprehensive list of the tags used in the Irish transducer(along with their apertium equivalents) can be found [[Irish FST tags|here]].

Revision as of 22:37, 13 November 2012

Included files

File Contents
abr-lex.txt Abbreviations
adj-lex-cc.txt Adjective continuation classes
adj-lex-fgb1.txt Adjectives from the Foclóir Gaeilge-Béarla
adj-lex-fgb2.txt Adjectives from the Foclóir Gaeilge-Béarla
adj-lex-fgb-eqvar.txt Variant adjectives in Ó Dónaill
adj-lex-fgb-var.txt More variant adjectives in Ó Dónaill
adj-lex-multi.txt Multichar symbols for adjectives
adj-lex-stems2.txt Descriptive adjective stems
adj-lex-stems3.txt Descriptive adjective stems
adj-lex-stems4.txt Color adjective stems
adj-lex-stems5.txt manually classified adjective stems
adj-lex-stems-irreg.txt irregular adjective stems
adj-lex-stems.txt Descriptive adjectives
adv-lex.txt Adverbs
art-lex.txt Articles
conj-lex.txt Conjunctions
det-lex.txt Determiners
f-reg-tidy.txt Replace rules
itj-lex.txt Interjections
n-lex-cc.txt noun continuation classes
n-lex-fgb1.txt Masculine nouns
n-lex-fgb2.txt Masculine nouns
n-lex-fgb-eqvar.txt Variant nouns from Ó Dónaill
n-lex-fgb-variants.txt Variant nouns from Ó Dónaill
n-lex-irreg.txt Irregular nouns
n-lex-multi.txt Multichar symbols for nouns
n-lex-stems2.txt Noun stems
n-lex-stems3.txt Noun stems
n-lex-stems4.txt Noun stems
n-lex-stems5.txt Noun stems
n-lex-stems6.txt Noun stems
n-lex-stems.txt Noun stems
np-lex-cc.txt Continuation classes for proper nouns
np-lex-eire-en.txt Placenames of Ireland, in English
np-lex-eire.txt Placenames of Ireland, in Irish
np-lex-fam-en.txt Family names of Ireland, in English
np-lex-fam.txt Family names of Ireland, in Irish
np-lex-irreg.txt Irregular proper nouns
np-lex-multi.txt Multichar symbols for proper nouns
np-lex-org.txt Organisations (political parties, etc.) in Ireland
np-lex-pers-en-new.txt Hiberno-English anthroponyms
np-lex-pers.txt Irish anthroponyms
np-lex-tir-en.txt Names of countries, in English
np-lex-tir.txt Names of countries
np-logainm-sampla-ga.txt Some placenames in Ireland
n-reg-br.txt Replace rules: broadening of slender words
n-reg-chg.txt Various replace rules for Nouns and adjectives
n-reg-chk.txt Removal of extraneous ^Caol and ^Lea tags for
n-reg-cmpd.txt Insertion of h- prefix after certain initial consonants
n-reg-diph.txt Rewriting of diphthongs
n-reg-init.txt Initial mutations
n-reg-slen.txt Slenderisation of broad words
n-reg-sync.txt Syncopation of last unstressed syllable before endings
n-reg-tidy.txt Miscellaneous tidy up
n-reg-vh.txt Vowel harmony
n-reg-x.txt -
num-lex.txt Cardinal numbers
part-lex.txt Particles
prep-lex.txt Prepositions (simple, compound, emphatic)
pron-lex.txt Pronouns (Indefinite, personal), pronominals
punct-lex.txt Abbreviations
tags-lex.txt XML/HTML-like tags
tobar-lex.txt -
v12-lex-cc.txt Continuation classes common to 1st and 2nd verb conjugations
v1-lex-cc.txt Continuation classes for 1st verb conjugation
v1-lex-stems.txt 1st conjugation verb stems
v2-lex-cc.txt Continuation classes for 2nd verb conjugation
v2-lex-stems.txt 2nd conjugation verb stems
v3-lex-stems.txt Irregular verbs
v4-lex-stems.txt Mixed verb stesm
v5-lex-stems.txt Verb stems
va-lex-stems.txt Verb stems from Foclóir Póca
v-lex-fgb1.txt Verb stems from Foclóir Gaeilge-Béarla
v-lex-fgb2.txt Verb stems from Foclóir Gaeilge-Béarla
v-lex-fgb-eqvar.txt Verbal variants from Foclóir Gaeilge-Béarla
v-lex-fgb-var.txt Verb variants from Foclóir Gaeilge-Béarla
v-lex-multi.txt Multichar symbols for verbs
vng-lex-stems.txt Verbal nouns from Foclóir Póca
vn-lex-fgb-eqvar.txt Verbal noun variants from Foclóir Gaeilge-Béarla
vn-lex-fgb-var.txt Verbal noun variants from Foclóir Gaeilge-Béarla
vn-lex-stems.txt Verbal noun stems
vnva-lex-cc.txt Continuation classes for verbal nouns and verbal adjectives
vnva-lex-fgb1.txt Verbal nouns and verbal adjectives found in noun entries
vnva-lex-multi.txt Multichar symbols for verbal nouns and verbal adjectives
v-reg.txt Replace rules for verbs
adj-lex-stems-irreg.txt irregular adjective stems
adj-lex-stems.txt Descriptive adjectives
adv-lex.txt Adverbs
art-lex.txt Articles
conj-lex.txt Conjunctions
det-lex.txt Determiners
f-reg-tidy.txt Replace rules
itj-lex.txt Interjections
n-lex-cc.txt noun continuation classes
n-lex-fgb1.txt Masculine nouns
n-lex-fgb2.txt Masculine nouns
n-lex-fgb-eqvar.txt Variant nouns from Ó Dónaill
n-lex-fgb-variants.txt Variant nouns from Ó Dónaill
n-lex-irreg.txt Irregular nouns
n-lex-multi.txt Multichar symbols for nouns
n-lex-stems2.txt Noun stems
n-lex-stems3.txt Noun stems
n-lex-stems4.txt Noun stems
n-lex-stems5.txt Noun stems
n-lex-stems6.txt Noun stems
n-lex-stems.txt Noun stems
np-lex-cc.txt Continuation classes for proper nouns
np-lex-eire-en.txt Placenames of Ireland, in English
np-lex-eire.txt Placenames of Ireland, in Irish
np-lex-fam-en.txt Family names of Ireland, in English
np-lex-fam.txt Family names of Ireland, in Irish
np-lex-irreg.txt Irregular proper nouns
np-lex-multi.txt Multichar symbols for proper nouns
np-lex-org.txt Organisations (political parties, etc.) in Ireland
np-lex-pers-en-new.txt Hiberno-English anthroponyms
np-lex-pers.txt Irish anthroponyms
np-lex-tir-en.txt Names of countries, in English
np-lex-tir.txt Names of countries
np-logainm-sampla-ga.txt Some placenames in Ireland
n-reg-br.txt Replace rules: broadening of slender words
n-reg-chg.txt Various replace rules for Nouns and adjectives
n-reg-chk.txt Removal of extraneous ^Caol and ^Lea tags for
n-reg-cmpd.txt Insertion of h- prefix after certain initial consonants
n-reg-diph.txt Rewriting of diphthongs
n-reg-init.txt Initial mutations
n-reg-slen.txt Slenderisation of broad words
n-reg-sync.txt Syncopation of last unstressed syllable before endings
n-reg-tidy.txt Miscellaneous tidy up
n-reg-vh.txt Vowel harmony
n-reg-x.txt -
num-lex.txt Cardinal numbers
part-lex.txt Particles
prep-lex.txt Prepositions (simple, compound, emphatic)
pron-lex.txt Pronouns (Indefinite, personal), pronominals
punct-lex.txt Abbreviations
tags-lex.txt XML/HTML-like tags
tobar-lex.txt -
v12-lex-cc.txt Continuation classes common to 1st and 2nd verb conjugations
v1-lex-cc.txt Continuation classes for 1st verb conjugation
v1-lex-stems.txt 1st conjugation verb stems
v2-lex-cc.txt Continuation classes for 2nd verb conjugation
v2-lex-stems.txt 2nd conjugation verb stems
v3-lex-stems.txt Irregular verbs
v4-lex-stems.txt Mixed verb stesm
v5-lex-stems.txt Verb stems
va-lex-stems.txt Verb stems from Foclóir Póca
v-lex-fgb1.txt Verb stems from Foclóir Gaeilge-Béarla
v-lex-fgb2.txt Verb stems from Foclóir Gaeilge-Béarla
v-lex-fgb-eqvar.txt Verbal variants from Foclóir Gaeilge-Béarla
v-lex-fgb-var.txt Verb variants from Foclóir Gaeilge-Béarla
v-lex-multi.txt Multichar symbols for verbs
vng-lex-stems.txt Verbal nouns from Foclóir Póca
vn-lex-fgb-eqvar.txt Verbal noun variants from Foclóir Gaeilge-Béarla
vn-lex-fgb-var.txt Verbal noun variants from Foclóir Gaeilge-Béarla
vn-lex-stems.txt Verbal noun stems
vnva-lex-cc.txt Continuation classes for verbal nouns and verbal adjectives
vnva-lex-fgb1.txt Verbal nouns and verbal adjectives found in noun entries
vnva-lex-multi.txt Multichar symbols for verbal nouns and verbal adjectives
v-reg.txt Replace rules for verbs


Tags

A more-or-less comprehensive list of the tags used in the Irish transducer(along with their apertium equivalents) can be found here.