Difference between revisions of "Udmurt and Komi"

From Apertium
Jump to navigation Jump to search
 
(24 intermediate revisions by 2 users not shown)
Line 1: Line 1:


The udm-kpv language pair.
==Code==

==Monolingual transducers==

==Installation==
===Monolingual dependencies===
===Monolingual dependencies===
<pre>
<pre>
Line 33: Line 37:
$ cd giella-udm
$ cd giella-udm
$ ./autogen.sh
$ ./autogen.sh
$ ./configure --enable-apertium
$ ./configure --with-hfst --enable-reversed-intersect --enable-apertium
$ make
$ make
$ cd ..
$ cd ..
Line 40: Line 44:
<pre>
<pre>
$ svn co https://victorio.uit.no/langtech/trunk/langs/kpv giella-kpv
$ svn co https://victorio.uit.no/langtech/trunk/langs/kpv giella-kpv
$ cd giella-kpv$ ./autogen.sh
$ cd giella-kpv
$ ./autogen.sh
$ ./configure --enable-apertium
$ ./configure --with-hfst --enable-reversed-intersect --enable-apertium
$ make
$ make
$ cd ..
$ cd ..
Line 55: Line 60:
</pre>
</pre>


==Google Summer of Code 2018==
Workplan


{|class="wikitable"
! style="width: 10%" | Week
! style="width: 15%" | Dates
! style="width: 35%" | Goals
! style="width: 10%" | Bidix coverage
! style="width: 10%" | Udm coverage
! style="width: 10%" | Komi coverage
|-
! 1
| style="text-align:center" | 14.05 - 20.05
|
* Aligning parallel texts
* Add function words
* Write transfer rules for function words
| style="text-align:center" |
32.7%
| style="text-align:center" |
77.4%
| style="text-align:center" |
96.3%
|-
! 2
| style="text-align:center" | 21.05 - 27.05
|
* Add nominals to bilingual dictionary
| style="text-align:center" |
51.4%
| style="text-align:center" |
| style="text-align:center" |
|-
!3
| style="text-align:center" | 28.05 - 03.06
|
* Write transfer rules for nominals
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
|-
!4
| style="text-align:center" | 04.06 - 10.06
|
* Add verbs to bilingual dictionary
| style="text-align:center" |
62.4%
| style="text-align:center" |
86%
| style="text-align:center" |
|-
| style="text-align:center" colspan="6" |'''Midterm evaluation'''
|-
!5
| style="text-align:center" | 11.06 - 17.06
|
* Write transfer rules for verbs
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
|-
!6
| style="text-align:center" | 18.06 - 24.06
|
* Add stems to Udmurt transducer
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
|-
!7
| style="text-align:center" | 25.06 - 01.07
|
* Working on disambiguation
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
|-
!8
| style="text-align:center" | 02.07 - 08.07
|
* Add stems to Komi transducer
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
|-
| style="text-align:center" colspan="6" | '''Midterm evaluation'''
|-
!9
| style="text-align:center" | 09.07 - 15.07
|
* Working on disambiguation
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
|-
!10-11
| style="text-align:center" | 16.07 - 29.07
|
* Testing, debugging
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
|-
!12
| style="text-align:center" | 30.07 - 05.08
|
* Cleaning up
* Writing documentation
* Releasing
| style="text-align:center" |
| style="text-align:center" |
| style="text-align:center" |
|-
| style="text-align:center" colspan="6" | '''Project completed'''
|}
[[Category:Language pairs]]
[[Category:Language pairs]]

Latest revision as of 17:08, 14 June 2018

The udm-kpv language pair.

Monolingual transducers[edit]

Installation[edit]

Monolingual dependencies[edit]

$ svn co https://victorio.uit.no/langtech/trunk/giella-core
$ cd giella-core
$ ./autogen.sh
$ ./configure
$ make
$ cd ..
$ svn co https://victorio.uit.no/langtech/trunk/giella-shared
$ cd giella-shared
$ ./autogen.sh
$ ./configure
$ make
$ cd ..

Now edit your $HOME/.bashrc and add two lines, replacing /PATH/TO with the full path to those directories:

export GIELLA_CORE=/PATH/TO/giella-core
export GIELLA_SHARED=/PATH/TO/giella-shared

And do source ~/.bashrc

$ svn co https://victorio.uit.no/langtech/trunk/langs/udm giella-udm
$ cd giella-udm
$ ./autogen.sh
$ ./configure --with-hfst --enable-reversed-intersect --enable-apertium
$ make
$ cd ..
$ svn co https://victorio.uit.no/langtech/trunk/langs/kpv giella-kpv
$ cd giella-kpv
$ ./autogen.sh
$ ./configure  --with-hfst --enable-reversed-intersect --enable-apertium
$ make
$ cd ..

Language pair[edit]

$ git clone git@github.com:/apertium/apertium-udm-kpv.git
$ cd apertium-udm-kpv
$ ./autogen.sh --with-lang1=../giella-udm/tools/mt/apertium --with-lang2=../giella-kpv/tools/mt/apertium
$ make

Google Summer of Code 2018[edit]

Workplan

Week Dates Goals Bidix coverage Udm coverage Komi coverage
1 14.05 - 20.05
  • Aligning parallel texts
  • Add function words
  • Write transfer rules for function words

32.7%

77.4%

96.3%

2 21.05 - 27.05
  • Add nominals to bilingual dictionary

51.4%

3 28.05 - 03.06
  • Write transfer rules for nominals
4 04.06 - 10.06
  • Add verbs to bilingual dictionary

62.4%

86%

Midterm evaluation
5 11.06 - 17.06
  • Write transfer rules for verbs
6 18.06 - 24.06
  • Add stems to Udmurt transducer
7 25.06 - 01.07
  • Working on disambiguation
8 02.07 - 08.07
  • Add stems to Komi transducer
Midterm evaluation
9 09.07 - 15.07
  • Working on disambiguation
10-11 16.07 - 29.07
  • Testing, debugging
12 30.07 - 05.08
  • Cleaning up
  • Writing documentation
  • Releasing
Project completed