ESSLLI CG course
Jump to navigation
Jump to search
Contents
Using Apertium to analyse esslli-cg-relevant languages[edit]
You may either use the Vislcg3 cg lab, or you may download the Apertium resources.
Use the Vislcg3 lab and download the Apertium cg file[edit]
Here is the file:
- Download the file, and upload it to the visl lab page.
- Write in a sentence and test the result
- Change the rlx file to the better
Download the whole catalogue from Apertium[edit]
Serbo-Croatian[edit]
SVN: https://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-sh-mk
$ echo "Radovanović je ušla u povijest kao prva žena-pilot u Srbiji." | lt-proc sh-mk.automorf.bin | cg-conv -a "<Radovanović>" "Radovanović" np cog f sg nom "Radovanović" np cog f sg gen "Radovanović" np cog f sg dat "Radovanović" np cog f sg acc "Radovanović" np cog f sg loc "Radovanović" np cog f sg ins "Radovanović" np cog f sg voc "Radovanović" np cog f pl nom "Radovanović" np cog f pl gen "Radovanović" np cog f pl dat "Radovanović" np cog f pl acc "Radovanović" np cog f pl loc "Radovanović" np cog f pl ins "Radovanović" np cog f pl voc "Radovanović" np cog ma sg nom "Radovanović" np cog ma sg voc "<je>" "biti" vbser clt pres p3 sg "clitic" prn pers clt p3 f sg gen "clitic" prn pers clt p3 f sg acc "<ušla>" "ući" vblex perf iv lp nt pl "ući" vblex perf iv lp f sg "ući" vblex perf iv lp m pl "ući" vblex perf iv opt nt pl "ući" vblex perf iv opt f sg "ući" vblex perf iv opt m pl "<u>" "u" pr "<povijest>" "povijest" n f sg nom "povijest" n f sg acc "<kao>" "kao" adv "<prva>" "prvi" num ord nt pl nom "prvi" num ord nt pl acc "prvi" num ord nt pl voc "prvi" num ord f sg nom "prvi" num ord f sg voc "<žena>" "žena" n f sg nom "žena" n f pl gen "<->" "-" guio "<pilot>" "pilot" n ma sg nom "<u>" "u" pr "<Srbiji>" "Srbija" np top f sg dat "Srbija" np top f sg loc "<.>" "." sent
$ echo "Radovanović je ušla u povijest kao prva žena-pilot u Srbiji." | lt-proc sh-mk.automorf.bin | cg-conv -a | vislcg3 --trace --grammar apertium-sh-mk.sh-mk.rlx "<Radovanović>" "Radovanović" np cog f sg nom SELECT:222:Nom+Perfect ; "Radovanović" np cog f pl nom REMOVE:219 ; "Radovanović" np cog f sg gen SELECT:222:Nom+Perfect ; "Radovanović" np cog f sg dat SELECT:222:Nom+Perfect ; "Radovanović" np cog f sg acc SELECT:222:Nom+Perfect ; "Radovanović" np cog f sg loc SELECT:222:Nom+Perfect ; "Radovanović" np cog f sg ins SELECT:222:Nom+Perfect ; "Radovanović" np cog f sg voc SELECT:222:Nom+Perfect ; "Radovanović" np cog f pl gen SELECT:222:Nom+Perfect ; "Radovanović" np cog f pl dat SELECT:222:Nom+Perfect ; "Radovanović" np cog f pl acc SELECT:222:Nom+Perfect ; "Radovanović" np cog f pl loc SELECT:222:Nom+Perfect ; "Radovanović" np cog f pl ins SELECT:222:Nom+Perfect ; "Radovanović" np cog f pl voc SELECT:222:Nom+Perfect ; "Radovanović" np cog ma sg nom SELECT:222:Nom+Perfect ; "Radovanović" np cog ma sg voc SELECT:222:Nom+Perfect "<je>" "biti" vbser clt pres p3 sg ; "clitic" prn pers clt p3 f sg gen REMOVE:182:jeIsNotGenitive ; "clitic" prn pers clt p3 f sg acc REMOVE:185:jeIsNotAccusative "<ušla>" "ući" vblex perf iv lp f sg SELECT:200:PerfectNotOptative SELECT:206:l-participleNumber ; "ući" vblex perf iv opt nt pl SELECT:200:PerfectNotOptative ; "ući" vblex perf iv opt f sg SELECT:200:PerfectNotOptative ; "ući" vblex perf iv opt m pl SELECT:200:PerfectNotOptative ; "ući" vblex perf iv lp nt pl SELECT:200:PerfectNotOptative SELECT:206:l-participleNumber ; "ući" vblex perf iv lp m pl SELECT:200:PerfectNotOptative SELECT:206:l-participleNumber "<u>" "u" pr "<povijest>" "povijest" n f sg acc SELECT:308:Pr-Acc-chain ; "povijest" n f sg nom SELECT:308:Pr-Acc-chain "<kao>" "kao" adv "<prva>" "prvi" num ord f sg nom SELECT:237:GenNumNext SELECT:245 ; "prvi" num ord nt pl nom SELECT:237:GenNumNext ; "prvi" num ord nt pl acc SELECT:237:GenNumNext ; "prvi" num ord nt pl voc SELECT:237:GenNumNext ; "prvi" num ord f sg voc SELECT:237:GenNumNext SELECT:245 "<žena>" "žena" n f sg nom SELECT:243:AtrrNum->NNum ; "žena" n f pl gen SELECT:243:AtrrNum->NNum "<->" "-" guio "<pilot>" "pilot" n ma sg nom "<u>" "u" pr "<Srbiji>" "Srbija" np top f sg loc SELECT:317:Pr-Loc-chain ; "Srbija" np top f sg dat SELECT:317:Pr-Loc-chain "<.>" "." sent
Turkish[edit]
SVN: https://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-tr-az
Trmorph: http://www.let.rug.nl/~coltekin/trmorph/ (you need the Apertium branch, see Trmorph)
$ echo "Kovayla bira içerim, ama sen bilmezsin." | hfst-proc tr-az.automorf.hfst | cg-conv -a "<Kovayla>" "Kova" n ins "Kova" n ins 3s "<bira>" "bira" n "bira" n 3s "<içerim>" "iç" v t_aor 1s "içeri" n 1s "içeri" n p1s "içeri" n p1s 3s "<,>" "," pnct "<ama>" "ama" cnjadv "ama" cnjcoo "ama" n "ama" n 3s "<sen>" "sen" prn pers "sen" prn pers 3s "<bilmezsin>" "bil" v neg t_aor 2s "<.>" "." pnct
$ echo "Kovayla bira içerim, ama sen bilmezsin." | hfst-proc tr-az.automorf.hfst | cg-conv -a | vislcg3 --trace --grammar apertium-tr-az.tr-az.rlx "<Kovayla>" "Kova" n ins ; "Kova" n ins 3s REMOVE:521 "<bira>" "bira" n ; "bira" n 3s REMOVE:521 "<içerim>" "iç" v t_aor 1s "içeri" n 1s ; "içeri" n p1s 3s REMOVE:521 ; "içeri" n p1s REMOVE:531 "<,>" "," pnct "<ama>" "ama" cnjadv "ama" cnjcoo "ama" n ; "ama" n 3s REMOVE:521 "<sen>" "sen" prn pers ; "sen" prn pers 3s REMOVE:521 "<bilmezsin>" "bil" v neg t_aor 2s "<.>" "." pnct
Bulgarian[edit]
SVN: https://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-mk-bg
$ echo "Айтоска река е река в България." | lt-proc bg-mk.automorf.bin | cg-conv -a "<Айтоска>" "*Айтоска" "<река>" "река" n f sg nom ind "рече" vblex perf tv pres p1 sg "<е>" "е" vbser pres p3 sg "<река>" "река" n f sg nom ind "рече" vblex perf tv pres p1 sg "<в>" "в" pr "<България>" "България" np top f sg nom "<.>" "." sent
$ echo "Айтоска река е река в България." | lt-proc bg-mk.automorf.bin | cg-conv -a | vislcg3 --trace --grammar apertium-mk-bg.bg-mk.rlx "<Айтоска>" "*Айтоска" "<река>" "река" n f sg nom ind ; "рече" vblex perf tv pres p1 sg REMOVE:63:r2 "<е>" "е" vbser pres p3 sg "<река>" "река" n f sg nom ind "рече" vblex perf tv pres p1 sg "<в>" "в" pr "<България>" "България" np top f sg nom "<.>" "." sent
Slovenian[edit]
SVN: https://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-sl-es
$ echo "Od tukaj se vidi vse mesto." | lt-proc sl-es.automorf.bin | cg-conv -a Codepage UTF-8, locale en_US_POSIX Converting Apertium to VISL... Warning: No soft or hard delimiters defined in grammar. Hard limit of 500 cohorts may break windows in unintended places. "<Od>" "Od" pr "Oda" n f du gen "Oda" n f pl gen "<tukaj>" "tukaj" adv "<se>" "se" prn ref mf sp loc "se" prn ref mf sp ins "se" prn ref mf sp gen "se" prn ref mf sp acc "<vidi>" "*vidi" "<vse>" "vse" adv "ves" prn ind m pl acc "ves" prn ind f sg gen "ves" prn ind f pl nom "ves" prn ind f pl acc "ves" prn ind nt sg nom "ves" prn ind nt sg acc "ves" det ind m pl acc "ves" det ind f sg gen "ves" det ind f pl nom "ves" det ind f pl acc "ves" det ind nt sg nom "ves" det ind nt sg acc "<mesto>" "mesto" n nt sg nom "mesto" n nt sg acc "<.>" "." sent