Difference between revisions of "Udmurt and Komi"

From Apertium
Jump to navigation Jump to search
 
(16 intermediate revisions by the same user not shown)
Line 61: Line 61:


==Google Summer of Code 2018==
==Google Summer of Code 2018==
Workplan
Work plan

{|class="wikitable"
{|class="wikitable"
| '''Week''' || '''Dates''' || '''Goals''' || '''Bidix coverage''' || '''Udm coverage''' || '''Komi coverage'''
! style="width: 10%" | Week
! style="width: 15%" | Dates
! style="width: 35%" | Goals
! style="width: 10%" | Bidix coverage
! style="width: 10%" | Udm coverage
! style="width: 10%" | Komi coverage
|-
! 1
| style="text-align:center" | 14.05 - 20.05
|
* Aligning parallel texts
* Add function words
* Write transfer rules for function words
| style="text-align:center" |
32.7%
| style="text-align:center" |
77.4%
| style="text-align:center" |
96.3%
|-
! 2
| style="text-align:center" | 21.05 - 27.05
|
* Add nominals to bilingual dictionary
| style="text-align:center" |
51.4%
| style="text-align:center" |
| style="text-align:center" |
|-
|-
!3
|1|| 14.05 - 20.05 || aligning parallel texts;\n add nominals to bilingual dictionary|| || ||
| style="text-align:center" | 28.05 - 03.06
|
* Write transfer rules for nominals
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
|-
|-
!4
|2|| 21.05 - 27.05 || write transfer rules for nominals || || ||
| style="text-align:center" | 04.06 - 10.06
|
* Add verbs to bilingual dictionary
| style="text-align:center" |
62.4%
| style="text-align:center" |
86%
| style="text-align:center" |
|-
|-
| style="text-align:center" colspan="6" |'''Midterm evaluation'''
|3 || 28.05 - 03.06 || add verbs to bilingual dictionary || || ||
|-
|-
!5
|4 || 04.06 - 10.06 || write transfer rules for verbs || || ||
| style="text-align:center" | 11.06 - 17.06
|
* Write transfer rules for verbs
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
|-
|-
!6
| colspan="3" |'''midterm evaluation'''
| style="text-align:center" | 18.06 - 24.06
|-
|
|5 || 11.06 - 17.06 || add stems to Udmurt transducer|| || ||
* Add stems to Udmurt transducer
|-
| style="text-align:center" |
|6 || 18.06 - 24.06 || working on disambiguation || || ||
| style="text-align:center" |
| style="text-align:center" |
|-
|-
!7
|7 || 25.06 - 01.07 || add stems to Komi transducer || || ||
| style="text-align:center" | 25.06 - 01.07
|
* Working on disambiguation
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
|-
|-
!8
|8 || 02.07 - 08.07 || working on disambiguation || || ||
| style="text-align:center" | 02.07 - 08.07
|
* Add stems to Komi transducer
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
|-
|-
| colspan="3" | '''midterm evaluation'''
| style="text-align:center" colspan="6" | '''Midterm evaluation'''
|-
|-
!9
|9-10 || 09.07 - 22.07 || writing transfer rules|| || ||
| style="text-align:center" | 09.07 - 15.07
|
* Working on disambiguation
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
|-
|-
!10-11
|11 || 23.07 - 29.07 || Testvoc || || ||
| style="text-align:center" | 16.07 - 29.07
|-
|
|12 || 30.07 - 05.08 || cleaning up, writing documentation, releasing|| || ||
* Testing, debugging
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
|-
!12
| style="text-align:center" | 30.07 - 05.08
|
* Cleaning up
* Writing documentation
* Releasing
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
|-
|-
| colspan="3" | '''Project completed'''
| style="text-align:center" colspan="6" | '''Project completed'''
|}
|}
[[Category:Language pairs]]
[[Category:Language pairs]]

Latest revision as of 17:08, 14 June 2018

The udm-kpv language pair.

Monolingual transducers[edit]

Installation[edit]

Monolingual dependencies[edit]

$ svn co https://victorio.uit.no/langtech/trunk/giella-core
$ cd giella-core
$ ./autogen.sh
$ ./configure
$ make
$ cd ..
$ svn co https://victorio.uit.no/langtech/trunk/giella-shared
$ cd giella-shared
$ ./autogen.sh
$ ./configure
$ make
$ cd ..

Now edit your $HOME/.bashrc and add two lines, replacing /PATH/TO with the full path to those directories:

export GIELLA_CORE=/PATH/TO/giella-core
export GIELLA_SHARED=/PATH/TO/giella-shared

And do source ~/.bashrc

$ svn co https://victorio.uit.no/langtech/trunk/langs/udm giella-udm
$ cd giella-udm
$ ./autogen.sh
$ ./configure --with-hfst --enable-reversed-intersect --enable-apertium
$ make
$ cd ..
$ svn co https://victorio.uit.no/langtech/trunk/langs/kpv giella-kpv
$ cd giella-kpv
$ ./autogen.sh
$ ./configure  --with-hfst --enable-reversed-intersect --enable-apertium
$ make
$ cd ..

Language pair[edit]

$ git clone git@github.com:/apertium/apertium-udm-kpv.git
$ cd apertium-udm-kpv
$ ./autogen.sh --with-lang1=../giella-udm/tools/mt/apertium --with-lang2=../giella-kpv/tools/mt/apertium
$ make

Google Summer of Code 2018[edit]

Workplan

Week Dates Goals Bidix coverage Udm coverage Komi coverage
1 14.05 - 20.05
  • Aligning parallel texts
  • Add function words
  • Write transfer rules for function words

32.7%

77.4%

96.3%

2 21.05 - 27.05
  • Add nominals to bilingual dictionary

51.4%

3 28.05 - 03.06
  • Write transfer rules for nominals
4 04.06 - 10.06
  • Add verbs to bilingual dictionary

62.4%

86%

Midterm evaluation
5 11.06 - 17.06
  • Write transfer rules for verbs
6 18.06 - 24.06
  • Add stems to Udmurt transducer
7 25.06 - 01.07
  • Working on disambiguation
8 02.07 - 08.07
  • Add stems to Komi transducer
Midterm evaluation
9 09.07 - 15.07
  • Working on disambiguation
10-11 16.07 - 29.07
  • Testing, debugging
12 30.07 - 05.08
  • Cleaning up
  • Writing documentation
  • Releasing
Project completed