Difference between revisions of "Udmurt and Komi"

From Apertium
Jump to navigation Jump to search
 
(13 intermediate revisions by the same user not shown)
Line 61: Line 61:
   
 
==Google Summer of Code 2018==
 
==Google Summer of Code 2018==
  +
Workplan
Work plan
 
  +
{| class="wikitable" style="text-align: left;"
 
  +
{|class="wikitable"
| '''Week''' || '''Dates''' || '''Goals''' || '''Bidix coverage''' || '''Udm coverage''' || '''Komi coverage'''
 
  +
! style="width: 10%" | Week
  +
! style="width: 15%" | Dates
  +
! style="width: 35%" | Goals
  +
! style="width: 10%" | Bidix coverage
  +
! style="width: 10%" | Udm coverage
  +
! style="width: 10%" | Komi coverage
  +
|-
  +
! 1
  +
| style="text-align:center" | 14.05 - 20.05
  +
|
  +
* Aligning parallel texts
  +
* Add function words
  +
* Write transfer rules for function words
  +
| style="text-align:center" |
  +
32.7%
  +
| style="text-align:center" |
  +
77.4%
  +
| style="text-align:center" |
  +
96.3%
  +
|-
  +
! 2
  +
| style="text-align:center" | 21.05 - 27.05
  +
|
  +
* Add nominals to bilingual dictionary
  +
| style="text-align:center" |
  +
51.4%
  +
| style="text-align:center" |
  +
| style="text-align:center" |
 
|-
 
|-
  +
!3
|1|| 14.05 - 20.05 || aligning parallel texts; add nominals to bilingual dictionary|| || ||
 
  +
| style="text-align:center" | 28.05 - 03.06
  +
|
  +
* Write transfer rules for nominals
  +
| style="text-align:center" |
  +
| style="text-align:center" |
  +
| style="text-align:center" |
 
|-
 
|-
  +
!4
|2|| 21.05 - 27.05 || write transfer rules for nominals || || ||
 
  +
| style="text-align:center" | 04.06 - 10.06
  +
|
  +
* Add verbs to bilingual dictionary
  +
| style="text-align:center" |
  +
62.4%
  +
| style="text-align:center" |
  +
86%
  +
| style="text-align:center" |
 
|-
 
|-
  +
| style="text-align:center" colspan="6" |'''Midterm evaluation'''
|3 || 28.05 - 03.06 || add verbs to bilingual dictionary || || ||
 
 
|-
 
|-
  +
!5
|4 || 04.06 - 10.06 || write transfer rules for verbs || || ||
 
  +
| style="text-align:center" | 11.06 - 17.06
  +
|
  +
* Write transfer rules for verbs
  +
| style="text-align:center" |
  +
| style="text-align:center" |
  +
| style="text-align:center" |
 
|-
 
|-
  +
!6
| colspan="3" |'''midterm evaluation'''
 
  +
| style="text-align:center" | 18.06 - 24.06
|-
 
  +
|
|5 || 11.06 - 17.06 || add stems to Udmurt transducer|| || ||
 
  +
* Add stems to Udmurt transducer
|-
 
  +
| style="text-align:center" |
|6 || 18.06 - 24.06 || working on disambiguation || || ||
 
  +
| style="text-align:center" |
  +
| style="text-align:center" |
 
|-
 
|-
  +
!7
|7 || 25.06 - 01.07 || add stems to Komi transducer || || ||
 
  +
| style="text-align:center" | 25.06 - 01.07
  +
|
  +
* Working on disambiguation
  +
| style="text-align:center" |
  +
| style="text-align:center" |
  +
| style="text-align:center" |
 
|-
 
|-
  +
!8
|8 || 02.07 - 08.07 || working on disambiguation || || ||
 
  +
| style="text-align:center" | 02.07 - 08.07
  +
|
  +
* Add stems to Komi transducer
  +
| style="text-align:center" |
  +
| style="text-align:center" |
  +
| style="text-align:center" |
 
|-
 
|-
| colspan="3" | '''midterm evaluation'''
+
| style="text-align:center" colspan="6" | '''Midterm evaluation'''
 
|-
 
|-
  +
!9
|9-10 || 09.07 - 22.07 || writing transfer rules|| || ||
 
  +
| style="text-align:center" | 09.07 - 15.07
  +
|
  +
* Working on disambiguation
  +
| style="text-align:center" |
  +
| style="text-align:center" |
  +
| style="text-align:center" |
 
|-
 
|-
  +
!10-11
|11 || 23.07 - 29.07 || Testvoc || || ||
 
  +
| style="text-align:center" | 16.07 - 29.07
|-
 
  +
|
|12 || 30.07 - 05.08 || cleaning up, writing documentation, releasing|| || ||
 
  +
* Testing, debugging
  +
| style="text-align:center" |
  +
| style="text-align:center" |
  +
| style="text-align:center" |
  +
|-
  +
!12
  +
| style="text-align:center" | 30.07 - 05.08
  +
|
  +
* Cleaning up
  +
* Writing documentation
  +
* Releasing
  +
| style="text-align:center" |
  +
| style="text-align:center" |
  +
| style="text-align:center" |
 
|-
 
|-
| colspan="3" | '''Project completed'''
+
| style="text-align:center" colspan="6" | '''Project completed'''
 
|}
 
|}
 
[[Category:Language pairs]]
 
[[Category:Language pairs]]

Latest revision as of 17:08, 14 June 2018

The udm-kpv language pair.

Monolingual transducers[edit]

Installation[edit]

Monolingual dependencies[edit]

$ svn co https://victorio.uit.no/langtech/trunk/giella-core
$ cd giella-core
$ ./autogen.sh
$ ./configure
$ make
$ cd ..
$ svn co https://victorio.uit.no/langtech/trunk/giella-shared
$ cd giella-shared
$ ./autogen.sh
$ ./configure
$ make
$ cd ..

Now edit your $HOME/.bashrc and add two lines, replacing /PATH/TO with the full path to those directories:

export GIELLA_CORE=/PATH/TO/giella-core
export GIELLA_SHARED=/PATH/TO/giella-shared

And do source ~/.bashrc

$ svn co https://victorio.uit.no/langtech/trunk/langs/udm giella-udm
$ cd giella-udm
$ ./autogen.sh
$ ./configure --with-hfst --enable-reversed-intersect --enable-apertium
$ make
$ cd ..
$ svn co https://victorio.uit.no/langtech/trunk/langs/kpv giella-kpv
$ cd giella-kpv
$ ./autogen.sh
$ ./configure  --with-hfst --enable-reversed-intersect --enable-apertium
$ make
$ cd ..

Language pair[edit]

$ git clone git@github.com:/apertium/apertium-udm-kpv.git
$ cd apertium-udm-kpv
$ ./autogen.sh --with-lang1=../giella-udm/tools/mt/apertium --with-lang2=../giella-kpv/tools/mt/apertium
$ make

Google Summer of Code 2018[edit]

Workplan

Week Dates Goals Bidix coverage Udm coverage Komi coverage
1 14.05 - 20.05
  • Aligning parallel texts
  • Add function words
  • Write transfer rules for function words

32.7%

77.4%

96.3%

2 21.05 - 27.05
  • Add nominals to bilingual dictionary

51.4%

3 28.05 - 03.06
  • Write transfer rules for nominals
4 04.06 - 10.06
  • Add verbs to bilingual dictionary

62.4%

86%

Midterm evaluation
5 11.06 - 17.06
  • Write transfer rules for verbs
6 18.06 - 24.06
  • Add stems to Udmurt transducer
7 25.06 - 01.07
  • Working on disambiguation
8 02.07 - 08.07
  • Add stems to Komi transducer
Midterm evaluation
9 09.07 - 15.07
  • Working on disambiguation
10-11 16.07 - 29.07
  • Testing, debugging
12 30.07 - 05.08
  • Cleaning up
  • Writing documentation
  • Releasing
Project completed