Difference between revisions of "Bengali and English/BugsAndIssues"

From Apertium
Jump to navigation Jump to search
Line 38: Line 38:
: সে আমার কাছ থেকে বইটা নিল - He took the book from me.
: সে আমার কাছ থেকে বইটা নিল - He took the book from me.


* Another word 'অল্প':
Apparently, from the inflection pattern, the word কাছ can be classified as noun, which it isnt ...
: সে অল্পে খুশি (রয়েছে) - He is satisfied with less.
: আমি অল্প খাই - I eat less
: আমি অল্পের জন্য কাজটা করতে পারলাম না - I could not do the work for (less, or something) [its tough doing word for word translation :(]

Revision as of 10:21, 2 July 2009

Nouns

  1. Only 800 tagged pure nouns from anubadok dictionary matched against CRBLP's 20K most freq used word list
  • need to tag more manually (en-es package has 5K approx. need to reach there)
  • Anubadok has about 2000 Nouns in its own list
  • Anubadok has about 2300 Proper Nouns in its own list
  1. Some nouns are always pl or sg, need to tag those
  2. We are excluding Proper nouns now
  3. We are excluding adjectives that can be used as nouns, right now
  4. We are keeping track the plural form generation through animacy, this is good, but in the long run need to come up with something more sophisticated
  5. Some nouns can have hybrid animacy, need to tag those later
  6. Should we tag the subtype of Noun?
  7. মা - মারা , জনক - জনকরা - These are wrong, need to add rule to fix that, either mark them as irregular and entry in a separate table or just find the adequate rule for them

Pronouns

Adjective

  • Adjectives can have genitive forms, eg. অল্পের জন্য বেঁচে গেছি। But this is only when the adjective is used as nouns, so we need to add these adjectives as nouns too

Verb

  • The gerund form of the verb can be used as nouns, so we need to add these gerunds into noun table, and mark them as inanimate.

Adverb

  • We are marking all the adverbs as <adv> and have not marked <cnjadv> properly, this needs to be changed ASAP

Determiner

Misc

  • The word 'কাছ':
সে আমার কাছে আসল - He came to me/ He came near me (Anubadok translates 'He came to me' - সে আমাকেতে আসেছিল, which is wrong ...)
সে আমার কাছের লোক - He is a close person of mine (The translation is still incorrect, I don't know the exact translation ...)
সে আমার কাছ থেকে বইটা নিল - He took the book from me.
  • Another word 'অল্প':
সে অল্পে খুশি (রয়েছে) - He is satisfied with less.
আমি অল্প খাই - I eat less
আমি অল্পের জন্য কাজটা করতে পারলাম না - I could not do the work for (less, or something) [its tough doing word for word translation :(]