Udmurt and Komi

From Apertium
Jump to navigation Jump to search

The udm-kpv language pair.

Monolingual transducers

Installation

Monolingual dependencies

$ svn co https://victorio.uit.no/langtech/trunk/giella-core
$ cd giella-core
$ ./autogen.sh
$ ./configure
$ make
$ cd ..
$ svn co https://victorio.uit.no/langtech/trunk/giella-shared
$ cd giella-shared
$ ./autogen.sh
$ ./configure
$ make
$ cd ..

Now edit your $HOME/.bashrc and add two lines, replacing /PATH/TO with the full path to those directories:

export GIELLA_CORE=/PATH/TO/giella-core
export GIELLA_SHARED=/PATH/TO/giella-shared

And do source ~/.bashrc

$ svn co https://victorio.uit.no/langtech/trunk/langs/udm giella-udm
$ cd giella-udm
$ ./autogen.sh
$ ./configure --with-hfst --enable-reversed-intersect --enable-apertium
$ make
$ cd ..
$ svn co https://victorio.uit.no/langtech/trunk/langs/kpv giella-kpv
$ cd giella-kpv
$ ./autogen.sh
$ ./configure  --with-hfst --enable-reversed-intersect --enable-apertium
$ make
$ cd ..

Language pair

$ git clone git@github.com:/apertium/apertium-udm-kpv.git
$ cd apertium-udm-kpv
$ ./autogen.sh --with-lang1=../giella-udm/tools/mt/apertium --with-lang2=../giella-kpv/tools/mt/apertium
$ make

Google Summer of Code 2018

Work plan

Week Dates Goals Achieved translator coverage Achieved mono. coverage
1 14.05 - 20.05 filling in Udmurt dictionary
2 21.05 - 27.05 working on disambiguation
3 28.05 - 03.06 filling in Komi dictionary
4 04.06 - 10.06 working on disambiguation
midterm evaluation
5 11.06 - 17.06 aligning parallel texts
6-7 18.06 - 25.06 writing scripts for getting translations
8 26.06 - 08.07 filling in bilingual dictionary
midterm evaluation
9-10 09.07 - 22.07 writing transfer rules
11 23.07 - 29.07 testing the translator
12 30.07 - 05.08 cleaning up, writing documentation, releasing
Project completed