Difference between revisions of "User:Gourab337/GSoC2021-Workplan-Control"

From Apertium
Jump to navigation Jump to search
Line 39: Line 39:
|
|
|
|
|apertium-ben:
|apertium-ben:<br>
Main paradigms: n, adj, vblex, vbser, adv, pr, post, cnjcoo, cnjsub, cnjadv, det, num, prn
Main paradigms: n, adj, vblex, vbser, adv, pr, post, cnjcoo, cnjsub, cnjadv, det, num, prn<br>
Add/check words: pr, post, cnjcoo, cnjsub, cnjadv, det, num, prn
Add/check words: pr, post, cnjcoo, cnjsub, cnjadv, det, num, prn
|pr, post, cnjcoo, cnjsub, cnjsub, num, det, prn
|pr, post, cnjcoo, cnjsub, cnjsub, num, det, prn
Line 46: Line 46:
|756
|756
|
|
|hin-ben: ~33,3%
|hin-ben: ~33,3%<br>
ben-hin: ~20.4%
ben-hin: ~20.4%<br>
ben: ~67.9%
ben: ~67.9%
|
|
Line 62: Line 62:
|895
|895
|
|
|hin-ben: ~40,1%
|hin-ben: ~40,1%<br>
ben-hin: ~29.7%
ben-hin: ~29.7%<br>
ben: ~67.7%
ben: ~67.7%
|
|
Line 74: Line 74:
|
|
|
|
|Key transfer rules hin > ben to avoid #
|Key transfer rules hin > ben to avoid #<br>
Eventualy: the same for ben > hin
Eventualy: the same for ben > hin<br>
Manual disambiguation of Hindi texts
Manual disambiguation of Hindi texts
|
|
Line 103: Line 103:
|
|
|
|
|apertium-ben:
|apertium-ben:<br>
ordinals
ordinals<br>
Manual adding of most often names (150), adjectives (100), verbs (50)"
Manual adding of most often names (150), adjectives (100), verbs (50)
|"ordinals
|ordinals<br>
Most often names (150), adjectives (100), verbs (50)
Most often names (150), adjectives (100), verbs (50)<br>
Word selection rules
Word selection rules
|
|
Line 122: Line 122:
|
|
|Adding words from available data
|Adding words from available data
|Adding words from available data
|Adding words from available data<br>
Word selection rules
Word selection rules
|
|
Line 137: Line 137:
|hin - ben ~50%
|hin - ben ~50%
|Adding words from available data
|Adding words from available data
|Adding words from available data
|Adding words from available data<br>
Word selection rules
Word selection rules
|
|
Line 152: Line 152:
|
|
|Morphological disambiguation rules for Hindi
|Morphological disambiguation rules for Hindi
|Transfer rules
|Transfer rules<br>
Testvoc: closed categories, adv
Testvoc: closed categories, adv
|
|
Line 167: Line 167:
|
|
|Morphological disambiguation rules for Hindi
|Morphological disambiguation rules for Hindi
|Transfer rules
|Transfer rules<br>
Testvoc: adj
Testvoc: adj
|
|
Line 182: Line 182:
|
|
|Morphological disambiguation rules for Hindi
|Morphological disambiguation rules for Hindi
|Transfer rules
|Transfer rules<br>
Testvoc: n
Testvoc: n
|
|
Line 197: Line 197:
|hin - ben ~65%
|hin - ben ~65%
|Morphological disambiguation rules for Hindi
|Morphological disambiguation rules for Hindi
|"Transfer rules
|Transfer rules<br>
Testvoc: vblex"
Testvoc: vblex
|
|
|
|
|
|
|-
|12
|07/25/2021
|10000
|~80%
|hin - ben ~50%
|Adding words from available data
|Adding words from available data
Word selection rules
|
|
|
|

Revision as of 19:39, 20 June 2021

Workplan
Week Dates Goals Fulfilled
Bidix

(excluding proper names)

Coverage WER Monlingual dictionaries Bilingual dictionary / repository ben monodix

(excl. proper names)

Bidix

(excl. proper names)

Non-WP

coverage (%)

WP

coverage (%)

WER

(%)

Testvoc

(clean %) --- Manual disamb. (words)

1 06/13/2021 500 apertium-ben:

Main paradigms: n, adj, vblex, vbser, adv, pr, post, cnjcoo, cnjsub, cnjadv, det, num, prn
Add/check words: pr, post, cnjcoo, cnjsub, cnjadv, det, num, prn

pr, post, cnjcoo, cnjsub, cnjsub, num, det, prn 6603 756 hin-ben: ~33,3%

ben-hin: ~20.4%
ben: ~67.9%

2 06/20/2021 500 preparing scripts for adding words from the available free data into the dictionaries 6637 895 hin-ben: ~40,1%

ben-hin: ~29.7%
ben: ~67.7%

3 06/27/2021 500 Key transfer rules hin > ben to avoid #

Eventualy: the same for ben > hin
Manual disambiguation of Hindi texts

4 07/04/2021 500 Manual disambiguation of Hindi texts
5 07/11/2021 800 apertium-ben:

ordinals
Manual adding of most often names (150), adjectives (100), verbs (50)

ordinals

Most often names (150), adjectives (100), verbs (50)
Word selection rules

6 07/18/2021 5000 Adding words from available data Adding words from available data

Word selection rules

7 07/25/2021 10000 ~80% hin - ben ~50% Adding words from available data Adding words from available data

Word selection rules

8 08/01/2021 10100 Morphological disambiguation rules for Hindi Transfer rules

Testvoc: closed categories, adv

9 08/08/2021 10200 Morphological disambiguation rules for Hindi Transfer rules

Testvoc: adj

10 08/15/2021 10300 Morphological disambiguation rules for Hindi Transfer rules

Testvoc: n

11 08/22/2021 10400 ~80% hin - ben ~65% Morphological disambiguation rules for Hindi Transfer rules

Testvoc: vblex