Hindi and Bengali
Jump to navigation
Jump to search
Contents
Hindi and Bengali for GSoC
This is a language pair translating between Hindi and Bengali.
Goals
Currently the translator is very basic. We need to increase it's coverage to cover more words of the languages. We also need to add more transfer rules to cover all the Pending Tests to get more accurate translations.
Done
Closed Categories (n, adj, vblex, vbser, adv, prn, post, cnjcoo, cnjsub, cnjadv, det, num, prn, ord).Most frequently used nouns, post, adj, adv, det added.Hin > Ben transfer rules on nouns, verbs tenses and adj added.Testing scripts and test corpus.
Todo list
- Increase coverage of translator by adding more nouns, adjectives and verbs from the list of most frequently used words in corpus. Reference
- Add transfer rules to fix pronoun #s (obj -> obl , nom -> nom, erg conversion).
- Write transfer rules for Pending Tests (Ben > Hin and Hin > Ben).
- Remove prox and dist tag in the bidix and replace it by making suitable paradigms for det.prox & det.dist (ইটা / ওটা).
- Do disambiguation.
- Reduce Word Error Rate.
Apertium Git Repositories
External Resources
General
- A Useful Collection of Resources (Important)
- Bengali Grammar
- Introduction to Hindi
- Learning of Hindi Phonology as a Foreigner
- Hindi by Yamuna Kachru
Dictionaries
- http://hindi-english.org/
- http://e-mahashabdkosh.rb-aai.in/
- https://shabdkosh.raftaar.in/Hindi-English-Dictionary
- http://www.aamboli.com/
Corpora