Difference between revisions of "Udmurt and Komi"

From Apertium
Jump to navigation Jump to search
 
(8 intermediate revisions by the same user not shown)
Line 75: Line 75:
|
|
* Aligning parallel texts
* Aligning parallel texts
* Add function words
* Add nominals to bilingual dictionary
* Write transfer rules for function words
| style="text-align:center" |
| style="text-align:center" |
32.7%
| style="text-align:center" |
| style="text-align:center" |
77.4%
| style="text-align:center" |
| style="text-align:center" |
96.3%
|-
|-
! 2
! 2
| style="text-align:center" | 21.05 - 27.05
| style="text-align:center" | 21.05 - 27.05
|
|
* Add nominals to bilingual dictionary
* write transfer rules for nominals
| style="text-align:center" |
| style="text-align:center" |
51.4%
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
Line 91: Line 96:
| style="text-align:center" | 28.05 - 03.06
| style="text-align:center" | 28.05 - 03.06
|
|
* Write transfer rules for nominals
* Add verbs to bilingual dictionary
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
Line 99: Line 104:
| style="text-align:center" | 04.06 - 10.06
| style="text-align:center" | 04.06 - 10.06
|
|
* Add verbs to bilingual dictionary
* Write transfer rules for verbs
| style="text-align:center" |
| style="text-align:center" |
62.4%
| style="text-align:center" |
| style="text-align:center" |
86%
| style="text-align:center" |
| style="text-align:center" |
|-
|-
| style="text-align:center" | colspan="3" |'''midterm evaluation'''
| style="text-align:center" colspan="6" |'''Midterm evaluation'''
|-
|-
!5
!5
| style="text-align:center" | 11.06 - 17.06
| style="text-align:center" | 11.06 - 17.06
|
|
* Write transfer rules for verbs
* Add stems to Udmurt transducer
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
Line 117: Line 124:
| style="text-align:center" | 18.06 - 24.06
| style="text-align:center" | 18.06 - 24.06
|
|
* Add stems to Udmurt transducer
* Working on disambiguation
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
Line 125: Line 132:
| style="text-align:center" | 25.06 - 01.07
| style="text-align:center" | 25.06 - 01.07
|
|
* Working on disambiguation
* Add stems to Komi transducer
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
Line 133: Line 140:
| style="text-align:center" | 02.07 - 08.07
| style="text-align:center" | 02.07 - 08.07
|
|
* Add stems to Komi transducer
* Working on disambiguation
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
|-
|-
| style="text-align:center" | colspan="3" | '''midterm evaluation'''
| style="text-align:center" colspan="6" | '''Midterm evaluation'''
|-
|-
!9-10
!9
| style="text-align:center" | 09.07 - 22.07
| style="text-align:center" | 09.07 - 15.07
|
|
* Working on disambiguation
* Writing transfer rules
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
|-
|-
!11
!10-11
| style="text-align:center" | 23.07 - 29.07
| style="text-align:center" | 16.07 - 29.07
|
|
* Testing, debugging
* Testvoc
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
|-
|-
!12
!12
| style="text-align:center" | 30.07 - 05.08
| style="text-align:center" | 30.07 - 05.08
Line 166: Line 173:
| style="text-align:center" |
| style="text-align:center" |
|-
|-
| style="text-align:center" | colspan="3" | '''Project completed'''
| style="text-align:center" colspan="6" | '''Project completed'''
|}
|}
[[Category:Language pairs]]
[[Category:Language pairs]]

Latest revision as of 17:08, 14 June 2018

The udm-kpv language pair.

Monolingual transducers[edit]

Installation[edit]

Monolingual dependencies[edit]

$ svn co https://victorio.uit.no/langtech/trunk/giella-core
$ cd giella-core
$ ./autogen.sh
$ ./configure
$ make
$ cd ..
$ svn co https://victorio.uit.no/langtech/trunk/giella-shared
$ cd giella-shared
$ ./autogen.sh
$ ./configure
$ make
$ cd ..

Now edit your $HOME/.bashrc and add two lines, replacing /PATH/TO with the full path to those directories:

export GIELLA_CORE=/PATH/TO/giella-core
export GIELLA_SHARED=/PATH/TO/giella-shared

And do source ~/.bashrc

$ svn co https://victorio.uit.no/langtech/trunk/langs/udm giella-udm
$ cd giella-udm
$ ./autogen.sh
$ ./configure --with-hfst --enable-reversed-intersect --enable-apertium
$ make
$ cd ..
$ svn co https://victorio.uit.no/langtech/trunk/langs/kpv giella-kpv
$ cd giella-kpv
$ ./autogen.sh
$ ./configure  --with-hfst --enable-reversed-intersect --enable-apertium
$ make
$ cd ..

Language pair[edit]

$ git clone git@github.com:/apertium/apertium-udm-kpv.git
$ cd apertium-udm-kpv
$ ./autogen.sh --with-lang1=../giella-udm/tools/mt/apertium --with-lang2=../giella-kpv/tools/mt/apertium
$ make

Google Summer of Code 2018[edit]

Workplan

Week Dates Goals Bidix coverage Udm coverage Komi coverage
1 14.05 - 20.05
  • Aligning parallel texts
  • Add function words
  • Write transfer rules for function words

32.7%

77.4%

96.3%

2 21.05 - 27.05
  • Add nominals to bilingual dictionary

51.4%

3 28.05 - 03.06
  • Write transfer rules for nominals
4 04.06 - 10.06
  • Add verbs to bilingual dictionary

62.4%

86%

Midterm evaluation
5 11.06 - 17.06
  • Write transfer rules for verbs
6 18.06 - 24.06
  • Add stems to Udmurt transducer
7 25.06 - 01.07
  • Working on disambiguation
8 02.07 - 08.07
  • Add stems to Komi transducer
Midterm evaluation
9 09.07 - 15.07
  • Working on disambiguation
10-11 16.07 - 29.07
  • Testing, debugging
12 30.07 - 05.08
  • Cleaning up
  • Writing documentation
  • Releasing
Project completed