Difference between revisions of "User:Gourab337/GSoC2021-Workplan-Control"

From Apertium
Jump to navigation Jump to search
Line 147: Line 147:
 
|Adding words from available data<br>
 
|Adding words from available data<br>
 
Word selection rules
 
Word selection rules
  +
|7075
|7032
 
  +
|1670
|1651
 
 
|
 
|
|hin-ben: ~67.9%<br>
+
|hin-ben: ~67.6%<br>
ben-hin: ~49.5%<br>
+
ben-hin: ~49.6%<br>
 
ben: ~72.0%
 
ben: ~72.0%
 
|
 
|

Revision as of 10:30, 26 July 2021

Workplan
Week Dates Goals Fulfilled
Bidix

(excluding proper names)

Coverage WER Monlingual dictionaries Bilingual dictionary / repository ben monodix

(excl. proper names)

Bidix

(excl. proper names)

Non-WP

coverage (%)

WP

coverage (%)

WER

(%)

Testvoc

(clean %) --- Manual disamb. (words)

1 06/13/2021 500 apertium-ben:

Main paradigms: n, adj, vblex, vbser, adv, pr, post, cnjcoo, cnjsub, cnjadv, det, num, prn
Add/check words: pr, post, cnjcoo, cnjsub, cnjadv, det, num, prn

pr, post, cnjcoo, cnjsub, cnjsub, num, det, prn 6603 756 hin-ben: ~33.3%

ben-hin: ~20.4%
ben: ~67.9%

2 06/20/2021 500 preparing scripts for adding words from the available free data into the dictionaries 6637 895 hin-ben: ~40.1%

ben-hin: ~29.7%
ben: ~67.7%

3 06/27/2021 500 Key transfer rules hin > ben to avoid #

Eventually: the same for ben > hin
Manual disambiguation of Hindi texts

6640 931 hin-ben: ~39.5%

ben-hin: ~34.0%
ben: ~69.9%

4 07/04/2021 500 Manual disambiguation of Hindi texts 6687 917 hin-ben: ~44.5%

ben-hin: ~39.3%
ben: ~70.0%

5 07/11/2021 800 apertium-ben:

ordinals
Manual adding of most often names (150), adjectives (100), verbs (50)

ordinals

Most often names (150), adjectives (100), verbs (50)
Word selection rules

6764 1136 hin-ben: ~63.2%

ben-hin: ~43.4%
ben: ~71.0%

6 07/18/2021 5000 Adding words from available data Adding words from available data

Word selection rules

6984 1328 hin-ben: ~65.5%

ben-hin: ~47.6%
ben: ~71.8%

7 07/25/2021 10000 ~80% hin - ben ~50% Adding words from available data Adding words from available data

Word selection rules

7075 1670 hin-ben: ~67.6%

ben-hin: ~49.6%
ben: ~72.0%

8 08/01/2021 10100 Morphological disambiguation rules for Hindi Transfer rules

Testvoc: closed categories, adv

9 08/08/2021 10200 Morphological disambiguation rules for Hindi Transfer rules

Testvoc: adj

10 08/15/2021 10300 Morphological disambiguation rules for Hindi Transfer rules

Testvoc: n

11 08/22/2021 10400 ~80% hin - ben ~65% Morphological disambiguation rules for Hindi Transfer rules

Testvoc: vblex