Difference between revisions of "User:Gourab337/GSoC2021-Workplan-Control"
Jump to navigation
Jump to search
Hectoralos (talk | contribs) |
|||
Line 59: | Line 59: | ||
| |
| |
||
|preparing scripts for adding words from the available free data into the dictionaries |
|preparing scripts for adding words from the available free data into the dictionaries |
||
|6637 |
|||
|895 |
|||
| |
| |
||
|hin-ben: ~40,1% |
|||
| |
|||
ben-hin: ~29.7% |
|||
| |
|||
ben: ~67.7% |
|||
| |
|||
| |
| |
||
| |
| |
Revision as of 19:22, 20 June 2021
Workplan | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Week | Dates | Goals | Fulfilled | |||||||||
Bidix
(excluding proper names) |
Coverage | WER | Monlingual dictionaries | Bilingual dictionary / repository | ben monodix
(excl. proper names) |
Bidix
(excl. proper names) |
Non-WP
coverage (%) |
WP
coverage (%) |
WER
(%) |
Testvoc
(clean %) --- Manual disamb. (words) | ||
1 | 06/13/2021 | 500 | apertium-ben:
Main paradigms: n, adj, vblex, vbser, adv, pr, post, cnjcoo, cnjsub, cnjadv, det, num, prn Add/check words: pr, post, cnjcoo, cnjsub, cnjadv, det, num, prn |
pr, post, cnjcoo, cnjsub, cnjsub, num, det, prn | 6603 | 756 | hin-ben: ~33,3%
ben-hin: ~20.4% ben: ~67.9% |
|||||
2 | 06/20/2021 | 500 | preparing scripts for adding words from the available free data into the dictionaries | 6637 | 895 | hin-ben: ~40,1%
ben-hin: ~29.7% ben: ~67.7% |
||||||
3 | 06/27/2021 | 500 | Key transfer rules hin > ben to avoid #
Eventualy: the same for ben > hin Manual disambiguation of Hindi texts |
|||||||||
4 | 07/04/2021 | 500 | Manual disambiguation of Hindi texts | |||||||||
5 | 07/11/2021 | 800 | apertium-ben:
ordinals Manual adding of most often names (150), adjectives (100), verbs (50)" |
"ordinals
Most often names (150), adjectives (100), verbs (50) Word selection rules |
||||||||
6 | 07/18/2021 | 5000 | Adding words from available data | Adding words from available data
Word selection rules |
||||||||
7 | 07/25/2021 | 10000 | ~80% | hin - ben ~50% | Adding words from available data | Adding words from available data
Word selection rules |
||||||
8 | 08/01/2021 | 10100 | Morphological disambiguation rules for Hindi | Transfer rules
Testvoc: closed categories, adv |
||||||||
9 | 08/08/2021 | 10200 | Morphological disambiguation rules for Hindi | Transfer rules
Testvoc: adj |
||||||||
10 | 08/15/2021 | 10300 | Morphological disambiguation rules for Hindi | Transfer rules
Testvoc: n |
||||||||
11 | 08/22/2021 | 10400 | ~80% | hin - ben ~65% | Morphological disambiguation rules for Hindi | "Transfer rules
Testvoc: vblex" |
||||||
12 | 07/25/2021 | 10000 | ~80% | hin - ben ~50% | Adding words from available data | Adding words from available data
Word selection rules |