Difference between revisions of "User:Gourab337/GSoC2021-Workplan-Control"

From Apertium
Jump to navigation Jump to search
(2 intermediate revisions by 2 users not shown)
Line 39: Line 39:
 
|
 
|
 
|
 
|
|apertium-ben:
+
|apertium-ben:<br>
Main paradigms: n, adj, vblex, vbser, adv, pr, post, cnjcoo, cnjsub, cnjadv, det, num, prn
+
Main paradigms: n, adj, vblex, vbser, adv, pr, post, cnjcoo, cnjsub, cnjadv, det, num, prn<br>
 
Add/check words: pr, post, cnjcoo, cnjsub, cnjadv, det, num, prn
 
Add/check words: pr, post, cnjcoo, cnjsub, cnjadv, det, num, prn
 
|pr, post, cnjcoo, cnjsub, cnjsub, num, det, prn
 
|pr, post, cnjcoo, cnjsub, cnjsub, num, det, prn
Line 46: Line 46:
 
|756
 
|756
 
|
 
|
|hin-ben: ~33,3%
+
|hin-ben: ~33,3%<br>
ben-hin: ~20.4%
+
ben-hin: ~20.4%<br>
 
ben: ~67.9%
 
ben: ~67.9%
 
|
 
|
Line 59: Line 59:
 
|
 
|
 
|preparing scripts for adding words from the available free data into the dictionaries
 
|preparing scripts for adding words from the available free data into the dictionaries
  +
|6637
  +
|895
 
|
 
|
 
|hin-ben: ~40,1%<br>
|
 
  +
ben-hin: ~29.7%<br>
|
 
  +
ben: ~67.7%
|
 
 
|
 
|
 
|
 
|
Line 72: Line 74:
 
|
 
|
 
|
 
|
|Key transfer rules hin > ben to avoid #
+
|Key transfer rules hin > ben to avoid #<br>
Eventualy: the same for ben > hin
+
Eventualy: the same for ben > hin<br>
 
Manual disambiguation of Hindi texts
 
Manual disambiguation of Hindi texts
 
|
 
|
Line 101: Line 103:
 
|
 
|
 
|
 
|
|apertium-ben:
+
|apertium-ben:<br>
ordinals
+
ordinals<br>
Manual adding of most often names (150), adjectives (100), verbs (50)"
+
Manual adding of most often names (150), adjectives (100), verbs (50)
|"ordinals
+
|ordinals<br>
Most often names (150), adjectives (100), verbs (50)
+
Most often names (150), adjectives (100), verbs (50)<br>
 
Word selection rules
 
Word selection rules
 
|
 
|
Line 120: Line 122:
 
|
 
|
 
|Adding words from available data
 
|Adding words from available data
|Adding words from available data
+
|Adding words from available data<br>
 
Word selection rules
 
Word selection rules
 
|
 
|
Line 135: Line 137:
 
|hin - ben ~50%
 
|hin - ben ~50%
 
|Adding words from available data
 
|Adding words from available data
|Adding words from available data
+
|Adding words from available data<br>
 
Word selection rules
 
Word selection rules
 
|
 
|
Line 150: Line 152:
 
|
 
|
 
|Morphological disambiguation rules for Hindi
 
|Morphological disambiguation rules for Hindi
|Transfer rules
+
|Transfer rules<br>
 
Testvoc: closed categories, adv
 
Testvoc: closed categories, adv
 
|
 
|
Line 165: Line 167:
 
|
 
|
 
|Morphological disambiguation rules for Hindi
 
|Morphological disambiguation rules for Hindi
|Transfer rules
+
|Transfer rules<br>
 
Testvoc: adj
 
Testvoc: adj
 
|
 
|
Line 180: Line 182:
 
|
 
|
 
|Morphological disambiguation rules for Hindi
 
|Morphological disambiguation rules for Hindi
|Transfer rules
+
|Transfer rules<br>
 
Testvoc: n
 
Testvoc: n
 
|
 
|
Line 195: Line 197:
 
|hin - ben ~65%
 
|hin - ben ~65%
 
|Morphological disambiguation rules for Hindi
 
|Morphological disambiguation rules for Hindi
|"Transfer rules
+
|Transfer rules<br>
Testvoc: vblex"
+
Testvoc: vblex
|
 
|
 
|
 
|
 
|
 
|
 
|-
 
|11
 
|07/25/2021
 
|10000
 
|~80%
 
|hin - ben ~50%
 
|Adding words from available data
 
|Adding words from available data
 
Word selection rules
 
 
|
 
|
 
|
 
|

Revision as of 19:39, 20 June 2021

Workplan
Week Dates Goals Fulfilled
Bidix

(excluding proper names)

Coverage WER Monlingual dictionaries Bilingual dictionary / repository ben monodix

(excl. proper names)

Bidix

(excl. proper names)

Non-WP

coverage (%)

WP

coverage (%)

WER

(%)

Testvoc

(clean %) --- Manual disamb. (words)

1 06/13/2021 500 apertium-ben:

Main paradigms: n, adj, vblex, vbser, adv, pr, post, cnjcoo, cnjsub, cnjadv, det, num, prn
Add/check words: pr, post, cnjcoo, cnjsub, cnjadv, det, num, prn

pr, post, cnjcoo, cnjsub, cnjsub, num, det, prn 6603 756 hin-ben: ~33,3%

ben-hin: ~20.4%
ben: ~67.9%

2 06/20/2021 500 preparing scripts for adding words from the available free data into the dictionaries 6637 895 hin-ben: ~40,1%

ben-hin: ~29.7%
ben: ~67.7%

3 06/27/2021 500 Key transfer rules hin > ben to avoid #

Eventualy: the same for ben > hin
Manual disambiguation of Hindi texts

4 07/04/2021 500 Manual disambiguation of Hindi texts
5 07/11/2021 800 apertium-ben:

ordinals
Manual adding of most often names (150), adjectives (100), verbs (50)

ordinals

Most often names (150), adjectives (100), verbs (50)
Word selection rules

6 07/18/2021 5000 Adding words from available data Adding words from available data

Word selection rules

7 07/25/2021 10000 ~80% hin - ben ~50% Adding words from available data Adding words from available data

Word selection rules

8 08/01/2021 10100 Morphological disambiguation rules for Hindi Transfer rules

Testvoc: closed categories, adv

9 08/08/2021 10200 Morphological disambiguation rules for Hindi Transfer rules

Testvoc: adj

10 08/15/2021 10300 Morphological disambiguation rules for Hindi Transfer rules

Testvoc: n

11 08/22/2021 10400 ~80% hin - ben ~65% Morphological disambiguation rules for Hindi Transfer rules

Testvoc: vblex