Difference between revisions of "Bengali and English/TagSets"
Jump to navigation
Jump to search
Darthxaher (talk | contribs) |
(→Determiner: Its dem not demo) |
||
(41 intermediate revisions by 2 users not shown) | |||
Line 2: | Line 2: | ||
These symbols override anything from [[List_of_symbols]] |
These symbols override anything from [[List_of_symbols]] |
||
+ | |||
+ | == Categories == |
||
+ | |||
+ | {|class=wikitable |
||
+ | ! Symbol !! Gloss !! Numeric value !! Notes |
||
+ | |- |
||
+ | | <code>n</code> || Noun || 0 || |
||
+ | |- |
||
+ | | <code>np</code> || Proper noun || 1 || |
||
+ | |- |
||
+ | | <code>vblex</code> || Standard verb || 2 || |
||
+ | |- |
||
+ | | <code>adj</code> || Adjective || 3 || |
||
+ | |- |
||
+ | | <code>adv</code> || Adverb || 4 || |
||
+ | |- |
||
+ | | <code>det</code> || Determiner || 5 || |
||
+ | |- |
||
+ | | <code>prn</code> || Pronoun || 6 || |
||
+ | |- |
||
+ | | <code>pst</code> || Postposition || 7 || |
||
+ | |- |
||
+ | | <code>ij</code> || Interjection || 8 || |
||
+ | |- |
||
+ | | <code>cnjcoo</code> || Co-ordinating Conjunction || 9 || |
||
+ | |- |
||
+ | | <code>cnjsum</code> || Sub-ordinating Conjunction || 10, a || |
||
+ | |- |
||
+ | | <code>num</code> || Numeral || 11,b || |
||
+ | |- |
||
+ | |} |
||
== Pronouns == |
== Pronouns == |
||
Line 53: | Line 84: | ||
{| class="wikitable" border="1" |
{| class="wikitable" border="1" |
||
− | ! Symbol !! Gloss !! Notes |
+ | ! Symbol !! Gloss !! Numeric value !!Notes |
|- |
|- |
||
− | | <code>nom</code> || |
+ | | <code>nom</code> || Nominative Case || 0 |
|- |
|- |
||
− | | <code>obj</code> || Objective Case |
+ | | <code>obj</code> || Objective Case || 1 |
|- |
|- |
||
− | | <code>gen</code> || Genitive(possessive) Case |
+ | | <code>gen</code> || Genitive(possessive) Case || 2 |
|- |
|- |
||
− | | <code>loc</code> || Locative Case |
+ | | <code>loc</code> || Locative Case || 3 |
|} |
|} |
||
Line 67: | Line 98: | ||
{| class="wikitable" border="1" |
{| class="wikitable" border="1" |
||
− | ! Symbol !! Gloss !! Notes |
+ | ! Symbol !! Gloss !! Numeric value || Notes |
|- |
|- |
||
− | | <code> |
+ | | <code>nn</code> || Inanimate || 0 |
+ | |- |
||
+ | | <code>aa</code> || Animate || 1 |
||
|- |
|- |
||
− | | <code> |
+ | | <code>hu</code> || Human || 2 |
|- |
|- |
||
− | | <code> |
+ | | <code>el</code> || Elite || 3 |
|- |
|- |
||
− | | <code> |
+ | | <code>an</code> || Animate/Inanimate || 4 |
|- |
|- |
||
− | | <code> |
+ | | <code>ah</code> || Animate/Human || 5 |
+ | |- |
||
+ | | <code>eh</code> || Elite/Human || 6 |
||
|} |
|} |
||
− | '''NOTE:''' '''<code>hu</code>''' |
+ | '''NOTE:''' '''<code>hu</code>''' ,'''<code>el</code>''' and their hybrid are only used in case of Nouns |
=== Gender === |
=== Gender === |
||
{| class="wikitable" border="1" |
{| class="wikitable" border="1" |
||
− | ! Symbol !! Gloss !! Notes |
+ | ! Symbol !! Gloss !! Numeric value || Notes |
|- |
|- |
||
− | | mf || Male,Female |
+ | | <code>mf</code> || Male,Female || 0 |
|- |
|- |
||
− | | m || Male |
+ | | <code>m</code> || Male || 1 |
|- |
|- |
||
− | | f || Female |
+ | | <code>f</code> || Female || 2 |
|- |
|- |
||
− | | nt || Neuter/Inanimate |
+ | | <code>nt</code> || Neuter/Inanimate || 3 |
|- |
|- |
||
− | | un || *Common and Neuter (Both animate and inanimate) |
+ | | <code>un</code> || *Common and Neuter (Both animate and inanimate) || 4 |
|} |
|} |
||
Line 101: | Line 136: | ||
{| class="wikitable" border="1" |
{| class="wikitable" border="1" |
||
− | ! Symbol !! Gloss !! Notes |
+ | ! Symbol !! Gloss !! Numeric Value !! Notes |
|- |
|- |
||
− | | pol || Polite |
+ | | <code>pol</code> || Polite || 2 |
|- |
|- |
||
− | | fam || Familiar |
+ | | <code>fam</code> || Familiar || 1 |
|- |
|- |
||
− | | infml || Informal |
+ | | <code>infml</code> || Informal || 0 |
|} |
|} |
||
Line 117: | Line 152: | ||
! Symbol !! Gloss !! Notes |
! Symbol !! Gloss !! Notes |
||
|- |
|- |
||
+ | | <code>sint</code> || Synthetic |
||
− | | pve || Positive |
||
|- |
|- |
||
+ | | <code>psint</code> || Partially Synthetic |
||
⚫ | |||
|- |
|- |
||
⚫ | |||
⚫ | |||
+ | |- |
||
⚫ | |||
+ | |} |
||
+ | |||
+ | === Note: === |
||
+ | * We are not tagging positive degree as its the default. |
||
+ | |||
+ | == Determiner == |
||
+ | |||
+ | {| class="wikitable" border="1" |
||
+ | ! Symbol !! Gloss !! Notes |
||
+ | |- |
||
+ | | <code>det</code> || Determiner |
||
+ | |- |
||
+ | | <code>ind</code> || Indefinite || eg. একটি, আরেকটি, অন্য |
||
+ | |- |
||
+ | | <code>pos</code> || Possessive || eg. তার, আমার |
||
+ | |- |
||
+ | | <code>dem</code> || Demonstrative || eg. এই, এইটা |
||
+ | |- |
||
+ | | <code>ord</code> || Ordinal || eg. প্রথম, দ্বিতীয়, তৃতীয় |
||
+ | |- |
||
+ | | <code>qnt</code> || Quantifier || eg. অনেক, বেশি |
||
+ | |- |
||
+ | | <code>itg</code> || Interrogative || eg. কী, কত |
||
+ | |- |
||
+ | | <code>pers</code> || Personal || eg. আমরা মানুষেরা - We people |
||
+ | |- |
||
+ | | <code>def</code> || Definite || Bengali does not have separate definite article, normally 'টা' is added to lemma |
||
|} |
|} |
||
Line 131: | Line 195: | ||
! Symbol !! Gloss !! Notes |
! Symbol !! Gloss !! Notes |
||
|- |
|- |
||
− | | ger || Gerund |
+ | | <code>ger</code> || Gerund || going -> যাওয়া |
|- |
|- |
||
− | | inf || Infinitive |
+ | | <code>inf</code> || Infinitive || to go -> যেতে |
|- |
|- |
||
− | | inf2 || Infinitive (Alternate)/Genitive Form || eg সেখানে যাবার জন্য - For going there |
+ | | <code>inf2</code> || Infinitive (Alternate)/Genitive Form || eg সেখানে যাবার জন্য - For going there |
|- |
|- |
||
− | | pressmpl || Present Simple |
+ | | <code>pressmpl</code> || Present Simple |
|- |
|- |
||
− | | prescnt || Present Continuous |
+ | | <code>prescnt</code> || Present Continuous |
|- |
|- |
||
− | | pastsmpl || Past Simple |
+ | | <code>pastsmpl</code> || Past Simple |
|- |
|- |
||
− | | pastcont || Past Continuous |
+ | | <code>pastcont</code> || Past Continuous |
|- |
|- |
||
− | | pasthbtl || Past Habitual |
+ | | <code>pasthbtl</code> || Past Habitual |
|- |
|- |
||
− | | pastcnd || Past Conditional |
+ | | <code>pastcnd</code> || Past Conditional |
|- |
|- |
||
− | | prft || Perfect |
+ | | <code>prft</code> || Perfect |
|- |
|- |
||
− | | plprft || Pluperfect |
+ | | <code>plprft</code> || Pluperfect |
|- |
|- |
||
− | | ftsmpl || Future Simple |
+ | | <code>ftsmpl</code> || Future Simple |
|- |
|- |
||
− | | ftcnt || Future Continuous |
+ | | <code>ftcnt</code> || Future Continuous |
|- |
|- |
||
− | | ppst || Past Participle |
+ | | <code>ppst</code> || Past Participle |
|- |
|- |
||
− | | pcnd || Conditional Participle |
+ | | <code>pcnd</code> || Conditional Participle |
|- |
|- |
||
− | | presimp || Present Imperative |
+ | | <code>presimp</code> || Present Imperative |
|- |
|- |
||
− | | ftimp || Future Imperative |
+ | | <code>ftimp</code> || Future Imperative |
|- |
|- |
||
− | | neg || Negative || সে যায় - He goes, সে যায়না - He does not go. |
+ | | <code>neg</code> || Negative || সে যায় - He goes, সে যায়না - He does not go. |
|} |
|} |
||
== **Noun == |
== **Noun == |
||
+ | |||
+ | === Speling Format Order === |
||
+ | |||
+ | [Stem]; [Inflection]; [animacy].[number].[case].[definite]; n.[gender] |
||
+ | |||
+ | eg. বালক; বালকটাকে; hu.sg.obj.def; n.m |
||
+ | |||
+ | == Proper nouns == |
||
+ | |||
+ | {|class=wikitable |
||
+ | ! Symbol !! Gloss !! Numeric Value !! Note |
||
+ | |- |
||
+ | | <code>ant</code> || Anthroponym || 0 || Human first names (e.g. John) |
||
+ | |- |
||
+ | | <code>top</code> || Toponym || 1 || Topological names (e.g. Dhaka, Baghdad, London) |
||
+ | |- |
||
+ | | <code>hyd</code> || Hydronym || 2 || Names of rivers etc. |
||
+ | |- |
||
+ | | <code>cog</code> || Cognomen || 3 || Second (or family) name (e.g. Smith) |
||
+ | |- |
||
+ | | <code>org</code> || Organisation || 4 || Organization (e.g. National Health Service) |
||
+ | |- |
||
+ | | <code>al</code> || Altres || 5 || Others (e.g. Brand names like Wikipedia, or Talc, etc.) |
||
+ | |} |
||
+ | |||
+ | === Note: === |
||
+ | * Cognomen Can be both singular and plural |
||
+ | * Anthroponyms are always singular |
||
+ | * Toponyms and Hydronym are always singular |
||
+ | {{comment|It is not necessary to separate out hydronyms unless they need special treatment (as in Polish) - [[User:Francis Tyers|Francis Tyers]] 07:21, 29 June 2009 (UTC)}} |
||
== Notes: == |
== Notes: == |
Latest revision as of 06:07, 17 June 2011
These symbols override anything from List_of_symbols
Categories[edit]
Symbol | Gloss | Numeric value | Notes |
---|---|---|---|
n |
Noun | 0 | |
np |
Proper noun | 1 | |
vblex |
Standard verb | 2 | |
adj |
Adjective | 3 | |
adv |
Adverb | 4 | |
det |
Determiner | 5 | |
prn |
Pronoun | 6 | |
pst |
Postposition | 7 | |
ij |
Interjection | 8 | |
cnjcoo |
Co-ordinating Conjunction | 9 | |
cnjsum |
Sub-ordinating Conjunction | 10, a | |
num |
Numeral | 11,b |
Pronouns[edit]
Pronoun Subcategory[edit]
Symbol | Gloss | Notes |
---|---|---|
Prpers |
Personal Pronoun | (use in the lemma field) |
dem |
Demonstrative Pronoun | |
rec |
Reciprocal Pronoun | |
ref |
Reflexive Pronoun | |
int |
Interrogative Pronoun | |
rel |
Relative Pronoun |
Person[edit]
Symbol | Gloss | Notes |
---|---|---|
p1 |
1st Person | |
p2 |
2nd Person | |
p3 |
3rd Person | |
all |
Applicable to all Persons | |
impers |
Impersonal | Used only in verbs eg. It rains - বৃষ্টি পড়ে |
Number[edit]
Symbol | Gloss | Notes |
---|---|---|
sg |
Singular | |
pl |
Plural | |
sp |
Singular and Plural |
Grammatical Case[edit]
Symbol | Gloss | Numeric value | Notes |
---|---|---|---|
nom |
Nominative Case | 0 | |
obj |
Objective Case | 1 | |
gen |
Genitive(possessive) Case | 2 | |
loc |
Locative Case | 3 |
Animacy[edit]
Symbol | Gloss | Numeric value | Notes |
---|---|---|---|
nn |
Inanimate | 0 | |
aa |
Animate | 1 | |
hu |
Human | 2 | |
el |
Elite | 3 | |
an |
Animate/Inanimate | 4 | |
ah |
Animate/Human | 5 | |
eh |
Elite/Human | 6 |
NOTE: hu
,el
and their hybrid are only used in case of Nouns
Gender[edit]
Symbol | Gloss | Numeric value | Notes |
---|---|---|---|
mf |
Male,Female | 0 | |
m |
Male | 1 | |
f |
Female | 2 | |
nt |
Neuter/Inanimate | 3 | |
un |
*Common and Neuter (Both animate and inanimate) | 4 |
Honorifics[edit]
Symbol | Gloss | Numeric Value | Notes |
---|---|---|---|
pol |
Polite | 2 | |
fam |
Familiar | 1 | |
infml |
Informal | 0 |
Adjective[edit]
Degree[edit]
Symbol | Gloss | Notes |
---|---|---|
sint |
Synthetic | |
psint |
Partially Synthetic | |
comp |
Comparative | |
sup |
Superlative |
Note:[edit]
- We are not tagging positive degree as its the default.
Determiner[edit]
Symbol | Gloss | Notes |
---|---|---|
det |
Determiner | |
ind |
Indefinite | eg. একটি, আরেকটি, অন্য |
pos |
Possessive | eg. তার, আমার |
dem |
Demonstrative | eg. এই, এইটা |
ord |
Ordinal | eg. প্রথম, দ্বিতীয়, তৃতীয় |
qnt |
Quantifier | eg. অনেক, বেশি |
itg |
Interrogative | eg. কী, কত |
pers |
Personal | eg. আমরা মানুষেরা - We people |
def |
Definite | Bengali does not have separate definite article, normally 'টা' is added to lemma |
Verb[edit]
Tense and Mood[edit]
Symbol | Gloss | Notes |
---|---|---|
ger |
Gerund | going -> যাওয়া |
inf |
Infinitive | to go -> যেতে |
inf2 |
Infinitive (Alternate)/Genitive Form | eg সেখানে যাবার জন্য - For going there |
pressmpl |
Present Simple | |
prescnt |
Present Continuous | |
pastsmpl |
Past Simple | |
pastcont |
Past Continuous | |
pasthbtl |
Past Habitual | |
pastcnd |
Past Conditional | |
prft |
Perfect | |
plprft |
Pluperfect | |
ftsmpl |
Future Simple | |
ftcnt |
Future Continuous | |
ppst |
Past Participle | |
pcnd |
Conditional Participle | |
presimp |
Present Imperative | |
ftimp |
Future Imperative | |
neg |
Negative | সে যায় - He goes, সে যায়না - He does not go. |
**Noun[edit]
Speling Format Order[edit]
[Stem]; [Inflection]; [animacy].[number].[case].[definite]; n.[gender]
eg. বালক; বালকটাকে; hu.sg.obj.def; n.m
Proper nouns[edit]
Symbol | Gloss | Numeric Value | Note |
---|---|---|---|
ant |
Anthroponym | 0 | Human first names (e.g. John) |
top |
Toponym | 1 | Topological names (e.g. Dhaka, Baghdad, London) |
hyd |
Hydronym | 2 | Names of rivers etc. |
cog |
Cognomen | 3 | Second (or family) name (e.g. Smith) |
org |
Organisation | 4 | Organization (e.g. National Health Service) |
al |
Altres | 5 | Others (e.g. Brand names like Wikipedia, or Talc, etc.) |
Note:[edit]
- Cognomen Can be both singular and plural
- Anthroponyms are always singular
- Toponyms and Hydronym are always singular
It is not necessary to separate out hydronyms unless they need special treatment (as in Polish) - Francis Tyers 07:21, 29 June 2009 (UTC)
Notes:[edit]
- *Final decision yet to be taken
- **Place holder