Difference between revisions of "Bengali and English/TagSets"
Jump to navigation
Jump to search
Darthxaher (talk | contribs) |
(→Determiner: Its dem not demo) |
||
(82 intermediate revisions by 2 users not shown) | |||
Line 1: | Line 1: | ||
{{TOCD}} |
{{TOCD}} |
||
+ | |||
+ | These symbols override anything from [[List_of_symbols]] |
||
+ | |||
+ | == Categories == |
||
+ | |||
+ | {|class=wikitable |
||
+ | ! Symbol !! Gloss !! Numeric value !! Notes |
||
+ | |- |
||
+ | | <code>n</code> || Noun || 0 || |
||
+ | |- |
||
+ | | <code>np</code> || Proper noun || 1 || |
||
+ | |- |
||
+ | | <code>vblex</code> || Standard verb || 2 || |
||
+ | |- |
||
+ | | <code>adj</code> || Adjective || 3 || |
||
+ | |- |
||
+ | | <code>adv</code> || Adverb || 4 || |
||
+ | |- |
||
+ | | <code>det</code> || Determiner || 5 || |
||
+ | |- |
||
+ | | <code>prn</code> || Pronoun || 6 || |
||
+ | |- |
||
+ | | <code>pst</code> || Postposition || 7 || |
||
+ | |- |
||
+ | | <code>ij</code> || Interjection || 8 || |
||
+ | |- |
||
+ | | <code>cnjcoo</code> || Co-ordinating Conjunction || 9 || |
||
+ | |- |
||
+ | | <code>cnjsum</code> || Sub-ordinating Conjunction || 10, a || |
||
+ | |- |
||
+ | | <code>num</code> || Numeral || 11,b || |
||
+ | |- |
||
+ | |} |
||
== Pronouns == |
== Pronouns == |
||
+ | |||
+ | === Pronoun Subcategory === |
||
+ | |||
+ | {| class="wikitable" border="1" |
||
+ | ! Symbol !! Gloss !! Notes |
||
+ | |- |
||
+ | | <code>Prpers</code> || Personal Pronoun || (use in the lemma field) |
||
+ | |- |
||
+ | | <code>dem</code> || Demonstrative Pronoun |
||
+ | |- |
||
+ | | <code>rec</code> || Reciprocal Pronoun |
||
+ | |- |
||
+ | | <code>ref</code> || Reflexive Pronoun |
||
+ | |- |
||
+ | | <code>int</code> || Interrogative Pronoun |
||
+ | |- |
||
+ | | <code>rel</code> || Relative Pronoun |
||
+ | |} |
||
=== Person === |
=== Person === |
||
{| class="wikitable" border="1" |
{| class="wikitable" border="1" |
||
− | ! |
+ | ! Symbol !! Gloss !! Notes |
|- |
|- |
||
− | | p1 || 1st Person |
+ | | <code>p1</code> || 1st Person |
|- |
|- |
||
− | | p2 || 2nd Person |
+ | | <code>p2</code> || 2nd Person |
|- |
|- |
||
− | | p3 || 3rd Person |
+ | | <code>p3</code> || 3rd Person |
|- |
|- |
||
− | | all || Applicable to all Persons |
+ | | <code>all</code> || Applicable to all Persons |
+ | |- |
||
+ | | <code>impers</code> || Impersonal || Used only in verbs eg. It rains - বৃষ্টি পড়ে |
||
|} |
|} |
||
Line 19: | Line 72: | ||
{| class="wikitable" border="1" |
{| class="wikitable" border="1" |
||
− | ! |
+ | ! Symbol !! Gloss !! Notes |
|- |
|- |
||
− | | sg || Singular |
+ | | <code>sg</code> || Singular |
|- |
|- |
||
− | | pl || Plural |
+ | | <code>pl</code> || Plural |
|- |
|- |
||
− | | sp || Singular and Plural |
+ | | <code>sp</code> || Singular and Plural |
|} |
|} |
||
Line 31: | Line 84: | ||
{| class="wikitable" border="1" |
{| class="wikitable" border="1" |
||
+ | ! Symbol !! Gloss !! Numeric value !!Notes |
||
− | ! Tag !! Meaning |
||
|- |
|- |
||
− | | nom || |
+ | | <code>nom</code> || Nominative Case || 0 |
|- |
|- |
||
− | | obj || Objective Case |
+ | | <code>obj</code> || Objective Case || 1 |
|- |
|- |
||
− | | gen || Genitive(possessive) Case |
+ | | <code>gen</code> || Genitive(possessive) Case || 2 |
|- |
|- |
||
− | | loc || Locative Case |
+ | | <code>loc</code> || Locative Case || 3 |
|} |
|} |
||
Line 45: | Line 98: | ||
{| class="wikitable" border="1" |
{| class="wikitable" border="1" |
||
+ | ! Symbol !! Gloss !! Numeric value || Notes |
||
− | ! Tag !! Meaning |
||
|- |
|- |
||
+ | | <code>nn</code> || Inanimate || 0 |
||
− | | an || Animate |
||
+ | |- |
||
+ | | <code>aa</code> || Animate || 1 |
||
|- |
|- |
||
+ | | <code>hu</code> || Human || 2 |
||
− | | aa || Animate/Inanimate |
||
|- |
|- |
||
+ | | <code>el</code> || Elite || 3 |
||
− | | nn || Inanimate |
||
+ | |- |
||
+ | | <code>an</code> || Animate/Inanimate || 4 |
||
+ | |- |
||
+ | | <code>ah</code> || Animate/Human || 5 |
||
+ | |- |
||
+ | | <code>eh</code> || Elite/Human || 6 |
||
|} |
|} |
||
+ | |||
+ | '''NOTE:''' '''<code>hu</code>''' ,'''<code>el</code>''' and their hybrid are only used in case of Nouns |
||
+ | |||
+ | === Gender === |
||
+ | |||
+ | {| class="wikitable" border="1" |
||
+ | ! Symbol !! Gloss !! Numeric value || Notes |
||
+ | |- |
||
+ | | <code>mf</code> || Male,Female || 0 |
||
+ | |- |
||
+ | | <code>m</code> || Male || 1 |
||
+ | |- |
||
+ | | <code>f</code> || Female || 2 |
||
+ | |- |
||
+ | | <code>nt</code> || Neuter/Inanimate || 3 |
||
+ | |- |
||
+ | | <code>un</code> || *Common and Neuter (Both animate and inanimate) || 4 |
||
+ | |} |
||
+ | |||
+ | === Honorifics === |
||
+ | |||
+ | {| class="wikitable" border="1" |
||
+ | ! Symbol !! Gloss !! Numeric Value !! Notes |
||
+ | |- |
||
+ | | <code>pol</code> || Polite || 2 |
||
+ | |- |
||
+ | | <code>fam</code> || Familiar || 1 |
||
+ | |- |
||
+ | | <code>infml</code> || Informal || 0 |
||
+ | |} |
||
+ | |||
+ | == Adjective == |
||
+ | |||
+ | === Degree === |
||
+ | |||
+ | {| class="wikitable" border="1" |
||
+ | ! Symbol !! Gloss !! Notes |
||
+ | |- |
||
+ | | <code>sint</code> || Synthetic |
||
+ | |- |
||
+ | | <code>psint</code> || Partially Synthetic |
||
+ | |- |
||
+ | | <code>comp</code> || Comparative |
||
+ | |- |
||
+ | | <code>sup</code> || Superlative |
||
+ | |} |
||
+ | |||
+ | === Note: === |
||
+ | * We are not tagging positive degree as its the default. |
||
+ | |||
+ | == Determiner == |
||
+ | |||
+ | {| class="wikitable" border="1" |
||
+ | ! Symbol !! Gloss !! Notes |
||
+ | |- |
||
+ | | <code>det</code> || Determiner |
||
+ | |- |
||
+ | | <code>ind</code> || Indefinite || eg. একটি, আরেকটি, অন্য |
||
+ | |- |
||
+ | | <code>pos</code> || Possessive || eg. তার, আমার |
||
+ | |- |
||
+ | | <code>dem</code> || Demonstrative || eg. এই, এইটা |
||
+ | |- |
||
+ | | <code>ord</code> || Ordinal || eg. প্রথম, দ্বিতীয়, তৃতীয় |
||
+ | |- |
||
+ | | <code>qnt</code> || Quantifier || eg. অনেক, বেশি |
||
+ | |- |
||
+ | | <code>itg</code> || Interrogative || eg. কী, কত |
||
+ | |- |
||
+ | | <code>pers</code> || Personal || eg. আমরা মানুষেরা - We people |
||
+ | |- |
||
+ | | <code>def</code> || Definite || Bengali does not have separate definite article, normally 'টা' is added to lemma |
||
+ | |} |
||
+ | |||
+ | == Verb == |
||
+ | |||
+ | === Tense and Mood === |
||
+ | |||
+ | {| class="wikitable" border="1" |
||
+ | ! Symbol !! Gloss !! Notes |
||
+ | |- |
||
+ | | <code>ger</code> || Gerund || going -> যাওয়া |
||
+ | |- |
||
+ | | <code>inf</code> || Infinitive || to go -> যেতে |
||
+ | |- |
||
+ | | <code>inf2</code> || Infinitive (Alternate)/Genitive Form || eg সেখানে যাবার জন্য - For going there |
||
+ | |- |
||
+ | | <code>pressmpl</code> || Present Simple |
||
+ | |- |
||
+ | | <code>prescnt</code> || Present Continuous |
||
+ | |- |
||
+ | | <code>pastsmpl</code> || Past Simple |
||
+ | |- |
||
+ | | <code>pastcont</code> || Past Continuous |
||
+ | |- |
||
+ | | <code>pasthbtl</code> || Past Habitual |
||
+ | |- |
||
+ | | <code>pastcnd</code> || Past Conditional |
||
+ | |- |
||
+ | | <code>prft</code> || Perfect |
||
+ | |- |
||
+ | | <code>plprft</code> || Pluperfect |
||
+ | |- |
||
+ | | <code>ftsmpl</code> || Future Simple |
||
+ | |- |
||
+ | | <code>ftcnt</code> || Future Continuous |
||
+ | |- |
||
+ | | <code>ppst</code> || Past Participle |
||
+ | |- |
||
+ | | <code>pcnd</code> || Conditional Participle |
||
+ | |- |
||
+ | | <code>presimp</code> || Present Imperative |
||
+ | |- |
||
+ | | <code>ftimp</code> || Future Imperative |
||
+ | |- |
||
+ | | <code>neg</code> || Negative || সে যায় - He goes, সে যায়না - He does not go. |
||
+ | |||
+ | |} |
||
+ | |||
+ | == **Noun == |
||
+ | |||
+ | === Speling Format Order === |
||
+ | |||
+ | [Stem]; [Inflection]; [animacy].[number].[case].[definite]; n.[gender] |
||
+ | |||
+ | eg. বালক; বালকটাকে; hu.sg.obj.def; n.m |
||
+ | |||
+ | == Proper nouns == |
||
+ | |||
+ | {|class=wikitable |
||
+ | ! Symbol !! Gloss !! Numeric Value !! Note |
||
+ | |- |
||
+ | | <code>ant</code> || Anthroponym || 0 || Human first names (e.g. John) |
||
+ | |- |
||
+ | | <code>top</code> || Toponym || 1 || Topological names (e.g. Dhaka, Baghdad, London) |
||
+ | |- |
||
+ | | <code>hyd</code> || Hydronym || 2 || Names of rivers etc. |
||
+ | |- |
||
+ | | <code>cog</code> || Cognomen || 3 || Second (or family) name (e.g. Smith) |
||
+ | |- |
||
+ | | <code>org</code> || Organisation || 4 || Organization (e.g. National Health Service) |
||
+ | |- |
||
+ | | <code>al</code> || Altres || 5 || Others (e.g. Brand names like Wikipedia, or Talc, etc.) |
||
+ | |} |
||
+ | |||
+ | === Note: === |
||
+ | * Cognomen Can be both singular and plural |
||
+ | * Anthroponyms are always singular |
||
+ | * Toponyms and Hydronym are always singular |
||
+ | |||
+ | |||
+ | {{comment|It is not necessary to separate out hydronyms unless they need special treatment (as in Polish) - [[User:Francis Tyers|Francis Tyers]] 07:21, 29 June 2009 (UTC)}} |
||
+ | |||
+ | == Notes: == |
||
+ | * *Final decision yet to be taken |
||
+ | * **Place holder |
Latest revision as of 06:07, 17 June 2011
These symbols override anything from List_of_symbols
Categories[edit]
Symbol | Gloss | Numeric value | Notes |
---|---|---|---|
n |
Noun | 0 | |
np |
Proper noun | 1 | |
vblex |
Standard verb | 2 | |
adj |
Adjective | 3 | |
adv |
Adverb | 4 | |
det |
Determiner | 5 | |
prn |
Pronoun | 6 | |
pst |
Postposition | 7 | |
ij |
Interjection | 8 | |
cnjcoo |
Co-ordinating Conjunction | 9 | |
cnjsum |
Sub-ordinating Conjunction | 10, a | |
num |
Numeral | 11,b |
Pronouns[edit]
Pronoun Subcategory[edit]
Symbol | Gloss | Notes |
---|---|---|
Prpers |
Personal Pronoun | (use in the lemma field) |
dem |
Demonstrative Pronoun | |
rec |
Reciprocal Pronoun | |
ref |
Reflexive Pronoun | |
int |
Interrogative Pronoun | |
rel |
Relative Pronoun |
Person[edit]
Symbol | Gloss | Notes |
---|---|---|
p1 |
1st Person | |
p2 |
2nd Person | |
p3 |
3rd Person | |
all |
Applicable to all Persons | |
impers |
Impersonal | Used only in verbs eg. It rains - বৃষ্টি পড়ে |
Number[edit]
Symbol | Gloss | Notes |
---|---|---|
sg |
Singular | |
pl |
Plural | |
sp |
Singular and Plural |
Grammatical Case[edit]
Symbol | Gloss | Numeric value | Notes |
---|---|---|---|
nom |
Nominative Case | 0 | |
obj |
Objective Case | 1 | |
gen |
Genitive(possessive) Case | 2 | |
loc |
Locative Case | 3 |
Animacy[edit]
Symbol | Gloss | Numeric value | Notes |
---|---|---|---|
nn |
Inanimate | 0 | |
aa |
Animate | 1 | |
hu |
Human | 2 | |
el |
Elite | 3 | |
an |
Animate/Inanimate | 4 | |
ah |
Animate/Human | 5 | |
eh |
Elite/Human | 6 |
NOTE: hu
,el
and their hybrid are only used in case of Nouns
Gender[edit]
Symbol | Gloss | Numeric value | Notes |
---|---|---|---|
mf |
Male,Female | 0 | |
m |
Male | 1 | |
f |
Female | 2 | |
nt |
Neuter/Inanimate | 3 | |
un |
*Common and Neuter (Both animate and inanimate) | 4 |
Honorifics[edit]
Symbol | Gloss | Numeric Value | Notes |
---|---|---|---|
pol |
Polite | 2 | |
fam |
Familiar | 1 | |
infml |
Informal | 0 |
Adjective[edit]
Degree[edit]
Symbol | Gloss | Notes |
---|---|---|
sint |
Synthetic | |
psint |
Partially Synthetic | |
comp |
Comparative | |
sup |
Superlative |
Note:[edit]
- We are not tagging positive degree as its the default.
Determiner[edit]
Symbol | Gloss | Notes |
---|---|---|
det |
Determiner | |
ind |
Indefinite | eg. একটি, আরেকটি, অন্য |
pos |
Possessive | eg. তার, আমার |
dem |
Demonstrative | eg. এই, এইটা |
ord |
Ordinal | eg. প্রথম, দ্বিতীয়, তৃতীয় |
qnt |
Quantifier | eg. অনেক, বেশি |
itg |
Interrogative | eg. কী, কত |
pers |
Personal | eg. আমরা মানুষেরা - We people |
def |
Definite | Bengali does not have separate definite article, normally 'টা' is added to lemma |
Verb[edit]
Tense and Mood[edit]
Symbol | Gloss | Notes |
---|---|---|
ger |
Gerund | going -> যাওয়া |
inf |
Infinitive | to go -> যেতে |
inf2 |
Infinitive (Alternate)/Genitive Form | eg সেখানে যাবার জন্য - For going there |
pressmpl |
Present Simple | |
prescnt |
Present Continuous | |
pastsmpl |
Past Simple | |
pastcont |
Past Continuous | |
pasthbtl |
Past Habitual | |
pastcnd |
Past Conditional | |
prft |
Perfect | |
plprft |
Pluperfect | |
ftsmpl |
Future Simple | |
ftcnt |
Future Continuous | |
ppst |
Past Participle | |
pcnd |
Conditional Participle | |
presimp |
Present Imperative | |
ftimp |
Future Imperative | |
neg |
Negative | সে যায় - He goes, সে যায়না - He does not go. |
**Noun[edit]
Speling Format Order[edit]
[Stem]; [Inflection]; [animacy].[number].[case].[definite]; n.[gender]
eg. বালক; বালকটাকে; hu.sg.obj.def; n.m
Proper nouns[edit]
Symbol | Gloss | Numeric Value | Note |
---|---|---|---|
ant |
Anthroponym | 0 | Human first names (e.g. John) |
top |
Toponym | 1 | Topological names (e.g. Dhaka, Baghdad, London) |
hyd |
Hydronym | 2 | Names of rivers etc. |
cog |
Cognomen | 3 | Second (or family) name (e.g. Smith) |
org |
Organisation | 4 | Organization (e.g. National Health Service) |
al |
Altres | 5 | Others (e.g. Brand names like Wikipedia, or Talc, etc.) |
Note:[edit]
- Cognomen Can be both singular and plural
- Anthroponyms are always singular
- Toponyms and Hydronym are always singular
It is not necessary to separate out hydronyms unless they need special treatment (as in Polish) - Francis Tyers 07:21, 29 June 2009 (UTC)
Notes:[edit]
- *Final decision yet to be taken
- **Place holder