Difference between revisions of "User:Gourab337/GSoC2021-Workplan-Control"

From Apertium
Jump to navigation Jump to search
Line 204: Line 204:
|
|
|-
|-
|11
|12
|07/25/2021
|07/25/2021
|10000
|10000

Revision as of 17:06, 20 June 2021

Workplan
Week Dates Goals Fulfilled
Bidix

(excluding proper names)

Coverage WER Monlingual dictionaries Bilingual dictionary / repository ben monodix

(excl. proper names)

Bidix

(excl. proper names)

Non-WP

coverage (%)

WP

coverage (%)

WER

(%)

Testvoc

(clean %) --- Manual disamb. (words)

1 06/13/2021 500 apertium-ben:

Main paradigms: n, adj, vblex, vbser, adv, pr, post, cnjcoo, cnjsub, cnjadv, det, num, prn Add/check words: pr, post, cnjcoo, cnjsub, cnjadv, det, num, prn

pr, post, cnjcoo, cnjsub, cnjsub, num, det, prn 6603 756 hin-ben: ~33,3%

ben-hin: ~20.4% ben: ~67.9%

2 06/20/2021 500 preparing scripts for adding words from the available free data into the dictionaries
3 06/27/2021 500 Key transfer rules hin > ben to avoid #

Eventualy: the same for ben > hin Manual disambiguation of Hindi texts

4 07/04/2021 500 Manual disambiguation of Hindi texts
5 07/11/2021 800 apertium-ben:

ordinals Manual adding of most often names (150), adjectives (100), verbs (50)"

"ordinals

Most often names (150), adjectives (100), verbs (50) Word selection rules

6 07/18/2021 5000 Adding words from available data Adding words from available data

Word selection rules

7 07/25/2021 10000 ~80% hin - ben ~50% Adding words from available data Adding words from available data

Word selection rules

8 08/01/2021 10100 Morphological disambiguation rules for Hindi Transfer rules

Testvoc: closed categories, adv

9 08/08/2021 10200 Morphological disambiguation rules for Hindi Transfer rules

Testvoc: adj

10 08/15/2021 10300 Morphological disambiguation rules for Hindi Transfer rules

Testvoc: n

11 08/22/2021 10400 ~80% hin - ben ~65% Morphological disambiguation rules for Hindi "Transfer rules

Testvoc: vblex"

12 07/25/2021 10000 ~80% hin - ben ~50% Adding words from available data Adding words from available data

Word selection rules