Difference between revisions of "User:Gourab337/GSoC2021-Workplan-Control"
Jump to navigation
Jump to search
Hectoralos (talk | contribs) |
Hectoralos (talk | contribs) |
||
Line 39: | Line 39: | ||
| |
| |
||
| |
| |
||
|apertium-ben: |
|apertium-ben:<br> |
||
Main paradigms: n, adj, vblex, vbser, adv, pr, post, cnjcoo, cnjsub, cnjadv, det, num, prn |
Main paradigms: n, adj, vblex, vbser, adv, pr, post, cnjcoo, cnjsub, cnjadv, det, num, prn<br> |
||
Add/check words: pr, post, cnjcoo, cnjsub, cnjadv, det, num, prn |
Add/check words: pr, post, cnjcoo, cnjsub, cnjadv, det, num, prn |
||
|pr, post, cnjcoo, cnjsub, cnjsub, num, det, prn |
|pr, post, cnjcoo, cnjsub, cnjsub, num, det, prn |
||
Line 46: | Line 46: | ||
|756 |
|756 |
||
| |
| |
||
|hin-ben: ~33,3% |
|hin-ben: ~33,3%<br> |
||
ben-hin: ~20.4% |
ben-hin: ~20.4%<br> |
||
ben: ~67.9% |
ben: ~67.9% |
||
| |
| |
||
Line 62: | Line 62: | ||
|895 |
|895 |
||
| |
| |
||
|hin-ben: ~40,1% |
|hin-ben: ~40,1%<br> |
||
ben-hin: ~29.7% |
ben-hin: ~29.7%<br> |
||
ben: ~67.7% |
ben: ~67.7% |
||
| |
| |
||
Line 74: | Line 74: | ||
| |
| |
||
| |
| |
||
|Key transfer rules hin > ben to avoid # |
|Key transfer rules hin > ben to avoid #<br> |
||
Eventualy: the same for ben > hin |
Eventualy: the same for ben > hin<br> |
||
Manual disambiguation of Hindi texts |
Manual disambiguation of Hindi texts |
||
| |
| |
||
Line 103: | Line 103: | ||
| |
| |
||
| |
| |
||
|apertium-ben: |
|apertium-ben:<br> |
||
ordinals |
ordinals<br> |
||
Manual adding of most often names (150), adjectives (100), verbs (50) |
Manual adding of most often names (150), adjectives (100), verbs (50) |
||
| |
|ordinals<br> |
||
Most often names (150), adjectives (100), verbs (50) |
Most often names (150), adjectives (100), verbs (50)<br> |
||
Word selection rules |
Word selection rules |
||
| |
| |
||
Line 122: | Line 122: | ||
| |
| |
||
|Adding words from available data |
|Adding words from available data |
||
|Adding words from available data |
|Adding words from available data<br> |
||
Word selection rules |
Word selection rules |
||
| |
| |
||
Line 137: | Line 137: | ||
|hin - ben ~50% |
|hin - ben ~50% |
||
|Adding words from available data |
|Adding words from available data |
||
|Adding words from available data |
|Adding words from available data<br> |
||
Word selection rules |
Word selection rules |
||
| |
| |
||
Line 152: | Line 152: | ||
| |
| |
||
|Morphological disambiguation rules for Hindi |
|Morphological disambiguation rules for Hindi |
||
|Transfer rules |
|Transfer rules<br> |
||
Testvoc: closed categories, adv |
Testvoc: closed categories, adv |
||
| |
| |
||
Line 167: | Line 167: | ||
| |
| |
||
|Morphological disambiguation rules for Hindi |
|Morphological disambiguation rules for Hindi |
||
|Transfer rules |
|Transfer rules<br> |
||
Testvoc: adj |
Testvoc: adj |
||
| |
| |
||
Line 182: | Line 182: | ||
| |
| |
||
|Morphological disambiguation rules for Hindi |
|Morphological disambiguation rules for Hindi |
||
|Transfer rules |
|Transfer rules<br> |
||
Testvoc: n |
Testvoc: n |
||
| |
| |
||
Line 197: | Line 197: | ||
|hin - ben ~65% |
|hin - ben ~65% |
||
|Morphological disambiguation rules for Hindi |
|Morphological disambiguation rules for Hindi |
||
| |
|Transfer rules<br> |
||
Testvoc: vblex |
Testvoc: vblex |
||
| |
|||
| |
|||
| |
|||
| |
|||
| |
|||
| |
|||
|- |
|||
|12 |
|||
|07/25/2021 |
|||
|10000 |
|||
|~80% |
|||
|hin - ben ~50% |
|||
|Adding words from available data |
|||
|Adding words from available data |
|||
Word selection rules |
|||
| |
| |
||
| |
| |
Revision as of 19:39, 20 June 2021
Workplan | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Week | Dates | Goals | Fulfilled | |||||||||
Bidix
(excluding proper names) |
Coverage | WER | Monlingual dictionaries | Bilingual dictionary / repository | ben monodix
(excl. proper names) |
Bidix
(excl. proper names) |
Non-WP
coverage (%) |
WP
coverage (%) |
WER
(%) |
Testvoc
(clean %) --- Manual disamb. (words) | ||
1 | 06/13/2021 | 500 | apertium-ben: Main paradigms: n, adj, vblex, vbser, adv, pr, post, cnjcoo, cnjsub, cnjadv, det, num, prn |
pr, post, cnjcoo, cnjsub, cnjsub, num, det, prn | 6603 | 756 | hin-ben: ~33,3% ben-hin: ~20.4% |
|||||
2 | 06/20/2021 | 500 | preparing scripts for adding words from the available free data into the dictionaries | 6637 | 895 | hin-ben: ~40,1% ben-hin: ~29.7% |
||||||
3 | 06/27/2021 | 500 | Key transfer rules hin > ben to avoid # Eventualy: the same for ben > hin |
|||||||||
4 | 07/04/2021 | 500 | Manual disambiguation of Hindi texts | |||||||||
5 | 07/11/2021 | 800 | apertium-ben: ordinals |
ordinals Most often names (150), adjectives (100), verbs (50) |
||||||||
6 | 07/18/2021 | 5000 | Adding words from available data | Adding words from available data Word selection rules |
||||||||
7 | 07/25/2021 | 10000 | ~80% | hin - ben ~50% | Adding words from available data | Adding words from available data Word selection rules |
||||||
8 | 08/01/2021 | 10100 | Morphological disambiguation rules for Hindi | Transfer rules Testvoc: closed categories, adv |
||||||||
9 | 08/08/2021 | 10200 | Morphological disambiguation rules for Hindi | Transfer rules Testvoc: adj |
||||||||
10 | 08/15/2021 | 10300 | Morphological disambiguation rules for Hindi | Transfer rules Testvoc: n |
||||||||
11 | 08/22/2021 | 10400 | ~80% | hin - ben ~65% | Morphological disambiguation rules for Hindi | Transfer rules Testvoc: vblex |