Bengali and English/Anubadok
Jump to navigation
Jump to search
Anubadok is an open source English to Bengali MT system developed by G M Hossain, currently in experimental stage.
The program is licensed under GPL. Its accessible from here.
Tag Set
Anubadok uses Penn Treebank Tag Set, the tag set is as follows:
- CC Coordinating conjunction
- CD Cardinal number
- DT Determiner
- EX Existential there
- FW Foreign word
- IN Preposition or subordinating conjunction
- JJ Adjective
- JJR Adjective, comparative
- JJS Adjective, superlative
- LS List item marker
- MD Modal
- NN Noun, singular or mass
- NNS Noun, plural
- NP Proper noun, singular
- NPS Proper noun, plural
- PDT Predeterminer
- POS Possessive ending
- PP Personal pronoun
- PP$ Possessive pronoun
- RB Adverb
- RBR Adverb, comparative
- RBS Adverb, superlative
- RP Particle
- SYM Symbol
- TO to
- UH Interjection
- VB Verb, base form
- VBD Verb, past tense
- VBG Verb, gerund or present participle
- VBN Verb, past participle
- VBP Verb, non-3rd person singular present
- VBZ Verb, 3rd person singular present
- WDT Wh-determiner
- WP Wh-pronoun
- WP$ Possessive wh-pronoun
- WRB Wh-adverb
Inflection Rules (BnSondhi.pm)
Legend
- C Consonant
- V Vowel
- _ Any Letter
Rules
- __খোল + তে__ = __খুলতে__
- __পাঠা + তে__ = __পাঠাতে__
- __নি + ের__ = __নেওয়ার__
- __নে + ার__ = __নেওয়ার__
- __পা + তে__ = __পেতে__
- __দে + তে__ = __দিতে__
- __কর + ের = __করার__