Difference between revisions of "Udmurt and Komi"

From Apertium
Jump to navigation Jump to search
 
(8 intermediate revisions by the same user not shown)
Line 75: Line 75:
 
|
 
|
 
* Aligning parallel texts
 
* Aligning parallel texts
  +
* Add function words
* Add nominals to bilingual dictionary
 
 
* Write transfer rules for function words
 
| style="text-align:center" |
 
| style="text-align:center" |
  +
32.7%
 
| style="text-align:center" |
 
| style="text-align:center" |
  +
77.4%
 
| style="text-align:center" |
 
| style="text-align:center" |
  +
96.3%
 
|-
 
|-
 
! 2
 
! 2
 
| style="text-align:center" | 21.05 - 27.05
 
| style="text-align:center" | 21.05 - 27.05
 
|
 
|
 
* Add nominals to bilingual dictionary
* write transfer rules for nominals
 
 
| style="text-align:center" |
 
| style="text-align:center" |
  +
51.4%
 
| style="text-align:center" |
 
| style="text-align:center" |
 
| style="text-align:center" |
 
| style="text-align:center" |
Line 91: Line 96:
 
| style="text-align:center" | 28.05 - 03.06
 
| style="text-align:center" | 28.05 - 03.06
 
|
 
|
 
* Write transfer rules for nominals
* Add verbs to bilingual dictionary
 
 
| style="text-align:center" |
 
| style="text-align:center" |
 
| style="text-align:center" |
 
| style="text-align:center" |
Line 99: Line 104:
 
| style="text-align:center" | 04.06 - 10.06
 
| style="text-align:center" | 04.06 - 10.06
 
|
 
|
 
* Add verbs to bilingual dictionary
* Write transfer rules for verbs
 
 
| style="text-align:center" |
 
| style="text-align:center" |
  +
62.4%
 
| style="text-align:center" |
 
| style="text-align:center" |
  +
86%
 
| style="text-align:center" |
 
| style="text-align:center" |
 
|-
 
|-
| style="text-align:center" | colspan="3" |'''midterm evaluation'''
+
| style="text-align:center" colspan="6" |'''Midterm evaluation'''
 
|-
 
|-
 
!5
 
!5
 
| style="text-align:center" | 11.06 - 17.06
 
| style="text-align:center" | 11.06 - 17.06
 
|
 
|
 
* Write transfer rules for verbs
* Add stems to Udmurt transducer
 
 
| style="text-align:center" |
 
| style="text-align:center" |
 
| style="text-align:center" |
 
| style="text-align:center" |
Line 117: Line 124:
 
| style="text-align:center" | 18.06 - 24.06
 
| style="text-align:center" | 18.06 - 24.06
 
|
 
|
 
* Add stems to Udmurt transducer
* Working on disambiguation
 
 
| style="text-align:center" |
 
| style="text-align:center" |
 
| style="text-align:center" |
 
| style="text-align:center" |
Line 125: Line 132:
 
| style="text-align:center" | 25.06 - 01.07
 
| style="text-align:center" | 25.06 - 01.07
 
|
 
|
 
* Working on disambiguation
* Add stems to Komi transducer
 
 
| style="text-align:center" |
 
| style="text-align:center" |
 
| style="text-align:center" |
 
| style="text-align:center" |
Line 133: Line 140:
 
| style="text-align:center" | 02.07 - 08.07
 
| style="text-align:center" | 02.07 - 08.07
 
|
 
|
 
* Add stems to Komi transducer
* Working on disambiguation
 
 
| style="text-align:center" |
 
| style="text-align:center" |
 
| style="text-align:center" |
 
| style="text-align:center" |
 
| style="text-align:center" |
 
| style="text-align:center" |
 
|-
 
|-
| style="text-align:center" | colspan="3" | '''midterm evaluation'''
+
| style="text-align:center" colspan="6" | '''Midterm evaluation'''
 
|-
 
|-
!9-10
+
!9
| style="text-align:center" | 09.07 - 22.07
+
| style="text-align:center" | 09.07 - 15.07
 
|
 
|
 
* Working on disambiguation
* Writing transfer rules
 
 
| style="text-align:center" |
 
| style="text-align:center" |
 
| style="text-align:center" |
 
| style="text-align:center" |
 
| style="text-align:center" |
 
| style="text-align:center" |
 
|-
 
|-
!11
+
!10-11
| style="text-align:center" | 23.07 - 29.07
+
| style="text-align:center" | 16.07 - 29.07
|
+
|
  +
* Testing, debugging
* Testvoc
 
 
| style="text-align:center" |
 
| style="text-align:center" |
 
| style="text-align:center" |
 
| style="text-align:center" |
 
| style="text-align:center" |
 
| style="text-align:center" |
|-
+
|-
 
!12
 
!12
 
| style="text-align:center" | 30.07 - 05.08
 
| style="text-align:center" | 30.07 - 05.08
Line 166: Line 173:
 
| style="text-align:center" |
 
| style="text-align:center" |
 
|-
 
|-
| style="text-align:center" | colspan="3" | '''Project completed'''
+
| style="text-align:center" colspan="6" | '''Project completed'''
 
|}
 
|}
 
[[Category:Language pairs]]
 
[[Category:Language pairs]]

Latest revision as of 17:08, 14 June 2018

The udm-kpv language pair.

Monolingual transducers[edit]

Installation[edit]

Monolingual dependencies[edit]

$ svn co https://victorio.uit.no/langtech/trunk/giella-core
$ cd giella-core
$ ./autogen.sh
$ ./configure
$ make
$ cd ..
$ svn co https://victorio.uit.no/langtech/trunk/giella-shared
$ cd giella-shared
$ ./autogen.sh
$ ./configure
$ make
$ cd ..

Now edit your $HOME/.bashrc and add two lines, replacing /PATH/TO with the full path to those directories:

export GIELLA_CORE=/PATH/TO/giella-core
export GIELLA_SHARED=/PATH/TO/giella-shared

And do source ~/.bashrc

$ svn co https://victorio.uit.no/langtech/trunk/langs/udm giella-udm
$ cd giella-udm
$ ./autogen.sh
$ ./configure --with-hfst --enable-reversed-intersect --enable-apertium
$ make
$ cd ..
$ svn co https://victorio.uit.no/langtech/trunk/langs/kpv giella-kpv
$ cd giella-kpv
$ ./autogen.sh
$ ./configure  --with-hfst --enable-reversed-intersect --enable-apertium
$ make
$ cd ..

Language pair[edit]

$ git clone git@github.com:/apertium/apertium-udm-kpv.git
$ cd apertium-udm-kpv
$ ./autogen.sh --with-lang1=../giella-udm/tools/mt/apertium --with-lang2=../giella-kpv/tools/mt/apertium
$ make

Google Summer of Code 2018[edit]

Workplan

Week Dates Goals Bidix coverage Udm coverage Komi coverage
1 14.05 - 20.05
  • Aligning parallel texts
  • Add function words
  • Write transfer rules for function words

32.7%

77.4%

96.3%

2 21.05 - 27.05
  • Add nominals to bilingual dictionary

51.4%

3 28.05 - 03.06
  • Write transfer rules for nominals
4 04.06 - 10.06
  • Add verbs to bilingual dictionary

62.4%

86%

Midterm evaluation
5 11.06 - 17.06
  • Write transfer rules for verbs
6 18.06 - 24.06
  • Add stems to Udmurt transducer
7 25.06 - 01.07
  • Working on disambiguation
8 02.07 - 08.07
  • Add stems to Komi transducer
Midterm evaluation
9 09.07 - 15.07
  • Working on disambiguation
10-11 16.07 - 29.07
  • Testing, debugging
12 30.07 - 05.08
  • Cleaning up
  • Writing documentation
  • Releasing
Project completed