User:TommiPirinen/Conversion tables

From Apertium
< User:TommiPirinen
Revision as of 13:33, 13 July 2020 by TommiPirinen (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Scripts to convert apes

apertium2ud.sed

s/<GD>//g
s/<ND>//g
s/<TD>//g
s/<aa>/ Animacy=Anim/g
s/<abbr>//g
s/<abe>/ Case=Abe/g
s/<abl>/ Case=Abl/g
s/<acc>/ Case=Acc/g
s/<acr>/ Abbr=Yes/g
s/<actv>/ Voice=Act/g
s/<ade>/ Case=Ade/g
s/<adj>/ ADJ/g
s/<adv>/ ADV/g
s/<aff>/ Polarity=Pos/g
s/<agnt>/ /g
s/<al>//g
s/<all>/ Case=All/g
s/<an>/ Animacy=Anim,Inan/g
s/<ant>//g
s/<aor>/ Tense=Past/g
s/<apos>/ PUNCT/g
s/<atn>//g
s/<atp>//g
s/<attr>//g
s/<caus>/ Voice=Cau/g
s/<caus>//g
s/<clb>/ PUNCT/g
s/<cm>/ PUNCT PunctType=Comm/g
s/<cmp>//g
s/<cni>/ Mood=Cnd/g
s/<cnjadv>/ SCONJ, ADV/g
s/<cnjcoo>/ CCONJ/g
s/<cnjsub>/ SCONJ/g
s/<cnt>//g
s/<cog>//g
s/<coll>/ Number=Coll/g
s/<com>/ Case=Com/g
s/<comp>/ Degree=Comp/g
s/<cop>/ AUX/g
s/<ct>/ Number=Count/g
s/<dat>/ Case=Dat/g
s/<date>//g
s/<deb>//g
s/<def>//g
s/<dela>/ Case=Del/g
s/<dem>/ PronType=Dem/g
s/<det>/ DET/g
s/<detnt>/ DET/g
s/<dg>/ Case=Dat,Gen/g
s/<dis>/ Case=Dis/g
s/<dst>//g
s/<du>/ Number=Dual/g
s/<email>//g
s/<enc>//g
s/<erg>/ Case=Erg/g
s/<ess>/ Case=Ess/g
s/<expl>//g
s/<f>/ Gender=Fem/g
s/<file>//g
s/<fm>//g
s/<fn>/ Gender=Fem,Neut/g
s/<fs>//g
s/<fti>/ Tense=Fut Mood=Ind/g
s/<fts>/ Tense=Fut Mood=Sub/g
s/<fut>/ Tense=Fut/g
s/<gen>/ Case=Gen/g
s/<ger>/ VerbForm=Ger/g
s/<guio>/ PUNCT PunctType=Dash/g
s/<hab>/ Aspect=Hab/g
s/<hu>/ Animacy=Hum/g
s/<hyd>//g
s/<ifi>/ Tense=Past Definite=Def/g
s/<ij>/ INTJ/g
s/<ill>/ Case=Ill/g
s/<imp>/ Mood=Imp/g
s/<imperf>/ Aspect=Imp/g
s/<impers>/ Person=0/g
s/<impf>/ Aspect=Imp/g
s/<ind>/ PronType=Ind/g
s/<ine>/ Case=Ine/g
s/<inf>/ VerbForm=Inf/g
s/<infps>//g
s/<ingr>/ /g
s/<ins>/ Case=Ins/g
s/<itg>//g
s/<itg>/ PronType=Int/g
s/<ito>/ VerbForm=Inf/g
s/<iv>/ Subcat=Intr/g
s/<loc>/ Case=Loc/g
s/<lp>/ /g
s/<lpar>/ PUNCT PunctType=Brck PunctSide=Ini/g
s/<lquest>/ PUNCT PunctSide=Ini/g
s/<lquot>/ PUNCT PunctType=Quot PunctSide=Ini/g
s/<m>/ Gender=Masc/g
s/<ma>/ Gender=Masc/g
s/<maj>//g
s/<mf>/ Gender=Masc,Fem/g
s/<mfn>/ Gender=Masc,Fem,Neut/g
s/<mi>/ Gender=Masc/g
s/<midv>/ Voice=Mid/g
s/<min>//g
s/<mn>/ Gender=Masc,Neut/g
s/<mod>/ PART/g
s/<mon>//g
s/<mp>/ Gender=Masc/g
s/<n>/ NOUN/g
s/<nactv>/ /g
s/<neg>/ Polarity=Neg/g
s/<nn>/ Animacy=Inan/g
s/<nom>/ Case=Nom/g
s/<nonpast>/ Tense=Pres,Fut/g
s/<np>/ PROPN/g
s/<nt>/ Gender=Neut/g
s/<num>/ NUM/g
s/<o_pl1>/ /g
s/<o_pl2>/ /g
s/<o_pl3>/ /g
s/<o_sg1>/ /g
s/<o_sg2>/ /g
s/<o_sg3>/ /g
s/<obj>//g
s/<obl>/ Case=Obl/g
s/<ord>/ NumType=Ord/g
s/<org>//g
s/<p1>/ Person=1/g
s/<p2>/ Person=2/g
s/<p3>/ Person=3/g
s/<par>/ Case=Par/g
s/<pass>/ Voice=Pass/g
s/<past3p>/ /g
s/<past>/ Tense=Past/g
s/<pasv>/ Voice=PAss/g
s/<pat>//g
s/<percent>//g
s/<percent>/ PUNCT/g
s/<perf>/ Aspect=Perf/g
s/<pers>/ PronType=Prs/g
s/<pii>/ Tense=Past Mood=Ind/g
s/<pis>/ Tense=Past Mood=Sub/g
s/<pl>/ Number=Plur/g
s/<plu>/ Tense=Pqp/g
s/<pos>/ Poss=Yes/g
s/<post>/ ADP/g
s/<postadv>/ ADV/g
s/<pp2>//g
s/<pp3>//g
s/<pp>/ VerbForm=Part/g
s/<pprs>/ Tense=Pres Mood=Part/g
s/<pprs>/ VerbForm=Part/g
s/<pr>/ ADP/g
s/<preadj>//g
s/<preadj_nh>//g
s/<preadv>/ ADV/g
s/<pred>//g
s/<predet>/ DET/g
s/<pres>/ Tense=Pres/g
s/<pret>/ Tense=Past/g
s/<pri>/ Tense=Pres Mood=Ind/g
s/<prl>/ Case=Pro/g
s/<prn>/ PRON/g
s/<pro>//g
s/<prp>//g
s/<prs>/ Tense=Pres Mood=Sub/g
s/<prx>//g
s/<pst>/ Degree=Pos/g
s/<punct>/ PUNCT/g
s/<px1pl>/ Person[psor]=1 Number[psor]=Plur/g
s/<px1sg>/ Person[psor]=1 Number[psor]=Sing/g
s/<px2pl>/ Person[psor]=2 Number[psor]=Plur/g
s/<px2sg>/ Person[psor]=2 Number[psor]=Sing/g
s/<px3pl>/ Person[psor]=3 Number[psor]=Plur/g
s/<px3sg>/ Person[psor]=3 Number[psor]=Sing/g
s/<px3sp>/ e.g. in [[Turkic languages]] || Person[psor]=3/g
s/<qnt>/ PronType=Ind/g
s/<quot>/ PUNCT PunctType=Quot/g
s/<rec>//g
s/<ref>/ Reflex=Yes/g
s/<rel>/ PronType=Rel/g
s/<res>//g
s/<rpar>/ PUNCT PunctType=Brck PunctSide=Fin/g
s/<rquot>/ PUNCT PunctType=Quot PunctSide=Fin/g
s/<s_pl1>/ Number=Plur Person=1/g
s/<s_pl2>/ Number=Plur Person=2/g
s/<s_pl3>/ Number=Plur Person=3/g
s/<s_sg1>/ Number=Sing Person=1/g
s/<s_sg2>/ Number=Sing Person=2/g
s/<s_sg3>/ Number=Sing Person=3/g
s/<sent>/ PUNCT/g
s/<sep>//g
s/<ses>/ Case=Sup/g
s/<sg>/ Number=Sing/g
s/<sint>//g
s/<soc>/ /g
s/<sp>/ Number=Sing,Plur/g
s/<sub>/ Case=Sub/g
s/<subj>//g
s/<subs>/ /g
s/<sup>/ Degree=Sup/g
s/<supn>/ VerbForm=Sup/g
s/<temp>/ Case=Tem/g
s/<term>/ Case=Ter/g
s/<time>//g
s/<tn>//g
s/<top>//g
s/<tra>/ Case=Tra/g
s/<tv>/ Subcat=Tran/g
s/<un>/ Gender=Com,Neut/g
s/<unc>//g
s/<ut>/ Gender=Com/g
s/<v>/ VERB/g
s/<vaux>/ AUX/g
s/<vbdo>/ VERB/g
s/<vbhaver>/ VERB/g
s/<vblex>/ VERB/g
s/<vbmod>/ VERB/g
s/<vbser>/ VERB/g
s/<voc>/ Case=Voc/g
s/<web>//g


Apertium2unimorph.sed

From ^^ and https://github.com/unimorph/ud-compatibility/blob/master/UD_UM/UD-UniMorph.tsv

s/<GD>//g
s/<ND>//g
s/<TD>//g
s/<aa>/;ANIM/g
s/<abbr>//g
s/<abe>/;PRIV/g
s/<abl>/;AT+ABL/g
s/<acc>/;ACC/g
s/<acr>//g
s/<actv>/;ACT/g
s/<ade>/;AT+ESS/g
s/<adj>/;ADJ/g
s/<adv>/;ADV/g
s/<aff>/;POS/g
s/<agnt>/;/g
s/<al>//g
s/<all>/;AT+ALL/g
s/<an>//g
s/<ant>//g
s/<aor>/;PST/g
s/<apos>/;PUNCT/g
s/<atn>//g
s/<atp>//g
s/<attr>//g
s/<caus>/CAUS/g
s/<caus>//g
s/<clb>/;PUNCT/g
s/<cm>/;PUNCT/g
s/<cmp>//g
s/<cni>/;COND/g
s/<cnjadv>/;CONJ/g
s/<cnjcoo>/;CONJ/g
s/<cnjsub>/;CONJ/g
s/<cnt>//g
s/<cog>//g
s/<coll>/;XXX/g
s/<com>/;COM/g
s/<comp>/;CMPR/g
s/<cop>/;V/g
s/<ct>/;XXX/g
s/<dat>/;DAT/g
s/<date>//g
s/<deb>//g
s/<def>//g
s/<dela>/;XXX/g
s/<dem>/;XXX/g
s/<det>/;DET/g
s/<detnt>/;DET/g
s/<dg>//g
s/<dis>/;DIST/g
s/<dst>//g
s/<du>/;DU/g
s/<email>//g
s/<enc>//g
s/<erg>/;ERG/g
s/<ess>/;FRML/g
s/<expl>//g
s/<f>/;FEM/g
s/<file>//g
s/<fm>//g
s/<fn>//g
s/<fs>//g
s/<fti>/;FUT;IND/g
s/<fts>/;FUT;SBJV/g
s/<fut>/;FUT/g
s/<gen>/;GEN/g
s/<ger>/;V.MSDR/g
s/<guio>/;PUNCT/g
s/<hab>/;HAB/g
s/<hu>/;HUM/g
s/<hyd>//g
s/<ifi>/;PST;DEF/g
s/<ij>/;INTJ/g
s/<ill>/;IN+ALL/g
s/<imp>/;IMP/g
s/<imperf>/;IPFV/g
s/<impers>/;0/g
s/<impf>/;IPVF/g
s/<ind>//g
s/<ine>/;IN+ESS/g
s/<inf>/;NFIN/g
s/<infps>//g
s/<ingr>//g
s/<ins>/;INS/g
s/<itg>//g
s/<itg>//g
s/<ito>/;NFIN/g
s/<iv>//g
s/<loc>/;ESS/g
s/<lp>//g
s/<lpar>/;PUNCT/g
s/<lquest>/;PUNCT/g
s/<lquot>/;PUNCT/g
s/<m>/;MASC/g
s/<ma>/;MASC/g
s/<maj>//g
s/<mf>/;MASC+FEM/g
s/<mfn>/;MASC+FEM/g
s/<mi>/;MASC/g
s/<midv>/;MID/g
s/<min>//g
s/<mn>//g
s/<mod>/;PART/g
s/<mon>//g
s/<mp>/;MASC/g
s/<n>/;N/g
s/<nactv>/;/g
s/<neg>/;NEG/g
s/<nn>/;INAN/g
s/<nom>/;NOM/g
s/<nonpast>/;PRS/g
s/<np>/;PROPN/g
s/<nt>/;NEUT/g
s/<num>/;NUM/g
s/<o_pl1>//g
s/<o_pl2>//g
s/<o_pl3>//g
s/<o_sg1>//g
s/<o_sg2>//g
s/<o_sg3>//g
s/<obj>//g
s/<obl>/;Case=Obl/g
s/<ord>/;NumType=Ord/g
s/<org>//g
s/<p1>/;1/g
s/<p2>/;2/g
s/<p3>/;3/g
s/<par>/;PRT/g
s/<pass>/;PASS/g
s/<past3p>/;/g
s/<past>/;PST/g
s/<pasv>/;PASS/g
s/<pat>//g
s/<percent>//g
s/<percent>/;PUNCT/g
s/<perf>/;PFV/g
s/<pers>//g
s/<pii>/;PST;IND/g
s/<pis>/;PST;SBJV/g
s/<pl>/;PL/g
s/<plu>/;PST+PRF/g
s/<pos>//g
s/<post>/;ADP/g
s/<postadv>/;ADV/g
s/<pp2>//g
s/<pp3>//g
s/<pp>/;V.PTCP/g
s/<pprs>/;V.PTCP/g
s/<pr>/;ADP/g
s/<preadj>//g
s/<preadj_nh>//g
s/<preadv>/;ADV/g
s/<pred>//g
s/<predet>/;DET/g
s/<pres>/;PRS/g
s/<pret>/;PST/g
s/<pri>/;PRS;IND/g
s/<prl>//g
s/<prn>/;PRO/g
s/<pro>//g
s/<prp>//g
s/<prs>/;PRS;SBJV/g
s/<prx>//g
s/<pst>//g
s/<punct>/;PUNCT/g
s/<px1pl>/;PSS1P/g
s/<px1sg>/;PSS1S/g
s/<px2pl>/;PSS2P/g
s/<px2sg>/;PSS2S/g
s/<px3pl>/;PSS3P/g
s/<px3sg>/;PSS3S/g
s/<px3sp>/;PSS3PM/g
s/<qnt>//g
s/<quot>/;PUNCT/g
s/<rec>//g
s/<ref>/;REF1/g
s/<rel>//g
s/<res>//g
s/<rpar>/;PUNCT/g
s/<rquot>/;PUNCT/g
s/<s_pl1>/;PL;1/g
s/<s_pl2>/;PL;2/g
s/<s_pl3>/;PL;3/g
s/<s_sg1>/;SG;1/g
s/<s_sg2>/;SG;2/g
s/<s_sg3>/;SG;3/g
s/<sent>/;PUNCT/g
s/<sep>//g
s/<ses>/;ON+ESS/g
s/<sg>/;SG/g
s/<sint>//g
s/<soc>/;/g
s/<sp>/;SG+PL/g
s/<sub>/;ON+ALL/g
s/<subj>//g
s/<subs>//g
s/<sup>/;RL/g
s/<supn>/;V.CVB/g
s/<temp>/;XXX/g
s/<term>/;XXX/g
s/<time>//g
s/<tn>//g
s/<top>//g
s/<tra>/;TRANS/g
s/<tv>//g
s/<un>/;MASC+FEM/g
s/<unc>//g
s/<ut>/;MASC+FEM/g
s/<v>/;V/g
s/<vaux>/;V/g
s/<vbdo>/;V/g
s/<vbhaver>/;V/g
s/<vblex>/;V/g
s/<vbmod>/;V/g
s/<vbser>/;V/g
s/<voc>/;VOC/g
s/<web>//g