Difference between revisions of "Indonesian and Malaysian"
Jump to navigation
Jump to search
(→Tagset) |
(→Tagset) |
||
Line 1: | Line 1: | ||
==Tagset== |
==Tagset== |
||
Not fixed yet still under design |
|||
Original Part-of-Speech |
|||
{|class=wikitable |
{|class=wikitable |
||
! Tag || Description || |
! Tag || Description || Example || Example's Analysis || Translation in English || Notes |
||
|- |
|- |
||
| <code> |
| <code><adj></code> || adjective lemma || besar || besar<adj><positive> || big || |
||
|- |
|- |
||
| <code> |
| <code><n></code> || noun lemma || buku || buku<n><bare><sg> || book || |
||
|- |
|- |
||
| <code> |
| <code><num></code> || number lemma || tujuh || tujuh<num><car> || seven || |
||
|- |
|- |
||
| <code> |
| <code><prn></code> || pronoun || saya || saya<prn><p1><sg> || I, me, my (depending on the position) || |
||
|- |
|- |
||
| <code> |
| <code><det></code> || determiner || bukunya || buku<n><bare><sg><det> || can be 'his/her book' or 'the book' || |
||
|- |
|- |
||
| <code> |
| <code><cnjcoo></code> || coordinating conjunction || dan || dan<cnjcoo> || and || |
||
|- |
|- |
||
| <code> |
| <code><cnjsub></code> || subordinating conjunction || ketika || ketika<cnjsub> || when (not the WH question word) || |
||
|- |
|- |
||
| <code> |
| <code><v></code> || verb lemma || kirim || kirim<v><av><perf><sg> || deliver || |
||
|- |
|- |
||
| <code> |
| <code><par></code> || particle || lah || lah<par> || - || |
||
|- |
|- |
||
| <code> |
| <code><mod></code> || modal || akan || akan<mod> || will || |
||
|- |
|- |
||
| <code> |
| <code><inj></code> || interjection || ah || ah<inj> || - || |
||
|- |
|- |
||
| <code> |
| <code><qst></code> || question word || apa || apa<qst> || what || |
||
|- |
|- |
||
| <code><pr></code> || preposition lemma || di || di<pr> || in/on || double preposition: 'di dalam' = inside, 'di atas' = above, 'di bawah' = below |
|||
| <code><pl></code> || plural || NUM || melompat-lompat - lompat<v><av><imp><pl> || jump repeatedly || this number plurality feature not only occurs in nouns. It is not only show the plurality on number of entity, but also show plurality of the event or referring to plural entities |
|||
|- |
|- |
||
|} |
|||
| <code><car></code> || cardinal number || NUM || tujuh - tujuh<num><car> || seven || |
|||
Person |
|||
{|class=wikitable |
|||
! Tag || Description || Example || Example's Analysis || Translation in English || Notes |
|||
|- |
|- |
||
| <code> |
| <code><p1></code> || first person || saya || saya<prn><p1><sg> || I, me, my (depending on the position) || Saya is the formal 'I'. There are plenty of unformal 'I' variations |
||
|- |
|- |
||
| <code> |
| <code><p2></code> || second person || anda || anda<prn><p2><sg> || You, your (depending on the position) || Anda is the formal 'You'. There are plenty of unformal 'You' variations |
||
|- |
|- |
||
| <code> |
| <code><p3></code> || third person || mereka || mereka<prn><p3><pl> || they, them, their (depending on the position) || |
||
|- |
|- |
||
|} |
|||
| <code><vbser></code> || verb ‘to be’ || VERB || || || |
|||
Number |
|||
{|class=wikitable |
|||
! Tag || Description || Example || Example's Analysis || Translation in English || Notes |
|||
|- |
|- |
||
| <code> |
| <code><sg></code> || singular || buku || buku<n><bare><sg> || book || |
||
|- |
|- |
||
| |
| || || melompat || lompat<v><av><imp><sg> || jump || |
||
|- |
|- |
||
| <code><pl></code> || plural || buku-buku || buku<n><bare><pl> || books || this number plurality feature not only occurs in nouns. It is not only show the plurality on number of entity, but also show plurality of the event or referring to plural entities |
|||
| <code><perf></code> || perfective aspect || ASPECT || || || |
|||
|- |
|- |
||
| || || melompat-lompat || lompat<v><av><imp><pl> || jump repeatedly || |
|||
| <code><imp></code> || imperfective aspect || ASPECT || || || |
|||
|- |
|- |
||
|} |
|||
| <code><ent></code> || derived entity noun || NOUN || kiriman - kirim<v><ent><sg> || package || |
|||
Derivation Mark (not yet added/implemented) |
|||
{|class=wikitable |
|||
! Tag || Description || Example || Example's Analysis || Translation in English || Notes |
|||
|- |
|- |
||
| <code> |
| <code><der_n></code> || || || || || |
||
|- |
|- |
||
| <code> |
| <code><der_adj></code> || || || || || |
||
|- |
|- |
||
| <code>< |
| <code><der_v></code> || || || || || |
||
|- |
|- |
||
| <code><der_num></code> || || || || || |
|||
| <code><enc></code> || enclitic || CLITIC || bukuku/buku<n><sg>+aku<enc><p1> || my book || The clitics are pronoun. In this example another form will be "buku aku", where it also grammatically correct. Sequence of noun followed by pronoun, makes the pronoun becomes possesive pronoun |
|||
|- |
|||
|} |
|||
Numeral Type |
|||
{|class=wikitable |
|||
! Tag || Description || Example || Example's Analysis || Translation in English || Notes |
|||
|- |
|||
| <code><car></code> || cardinal number || tujuh || tujuh<num><car> || seven || |
|||
|- |
|||
| <code><ord></code> || ordinal number || ketujuh || tujuh<num><ord> || the seventh || |
|||
|- |
|||
| <code><coll></code> || collective number || bertujuh || tujuh<num><coll> || in group of seven || |
|||
|- |
|||
| <code><ref></code> || referential number || tujuh-tujuhnya || tujuh<num><ref> || all the seven of them || |
|||
|- |
|||
| || || ketujuh-tujuhnya || tujuh<num><ref> || all the seven of them || |
|||
|- |
|||
|} |
|||
Verb Type |
|||
{|class=wikitable |
|||
! Tag || Description || Example || Example's Analysis || Translation in English || Notes |
|||
|- |
|||
| <code><vbhaver></code> || verb ‘to have’ || berpayung || payung<n><av><perf><vbhaver><bare><sg> || having umbrella(payung) || |
|||
|- |
|||
| <code><vbser></code> || verb ‘to be’ || perbesar || besar<adj><av><perf><vbser><sg> || making something to be big || |
|||
|- |
|||
|} |
|||
Voice Type |
|||
{|class=wikitable |
|||
! Tag || Description || Example || Example's Analysis || Translation in English || Notes |
|||
|- |
|||
| <code><av></code> || active voice || mengirim || kirim<v><av><imp><sg> || send || |
|||
|- |
|||
| <code><pv></code> || passive voice || dikirim || kirim<v><pv><imp><sg> || being sent || |
|||
|- |
|||
|} |
|||
Aspect Type |
|||
{|class=wikitable |
|||
! Tag || Description || Example || Example's Analysis || Translation in English || Notes |
|||
|- |
|||
| <code><perf></code> || perfective aspect || kirim || kirim<v><av><perf><sg> || send || |
|||
|- |
|||
| <code><imp></code> || imperfective aspect || mengirim || kirim<v><av><imp><sg> || send || |
|||
|- |
|||
|} |
|||
Noun Type |
|||
{|class=wikitable |
|||
! Tag || Description || Example || Example's Analysis || Translation in English || Notes |
|||
|- |
|||
| <code><bare></code> || original bare noun type || buku || buku<n><bare><sg> || book || |
|||
|- |
|||
| <code><abstract></code> || derived abstract type || kedinginan || dingin<adj><abstract><sg> || coldness || |
|||
|- |
|||
| <code><act></code> || derived act noun || pengiriman || kirim<v><act><sg> || delivery || |
|||
|- |
|||
| || || pembukuan || buku<n><act><sg> || archiving || |
|||
|- |
|||
| <code><actor></code> || derived actor noun || pengirim || kirim<v><actor><sg> || deliverer || |
|||
|- |
|||
| <code><ent></code> || derived entity noun || kiriman || kirim<v><ent><sg> || package || |
|||
|- |
|||
| <code><theme></code> || derived theme noun || pertokoan || toko<n><theme><sg> || something related to shop || |
|||
|- |
|||
|} |
|||
Adjective Type |
|||
{|class=wikitable |
|||
! Tag || Description || Example || Example's Analysis || Translation in English || Notes |
|||
|- |
|||
| <code><positive></code> || original bare adjective type || besar || besar<adj><bare> || big || |
|||
|- |
|||
| <code><sup></code> || superlative adjective || terbesar || besar<adj><sup> || the biggest || |
|||
|- |
|||
| <code><exceed></code> || exceed adjective || kebesaran || besar<adj><exceed> || too big || equally grammatical with "terlalu besar" |
|||
|- |
|||
| <code><manner></code> || manner adjective || keibuan || ibu<n><manner> || motherly || |
|||
|- |
|||
| <code><uni></code> || union adjective || seibu || ibu<n><bare><uni><sg> || same/one mother || |
|||
|- |
|||
| <code><possib></code> || adjectival phrase || secepat-cepatnya || cepat<adj><possib> || as quick as possible || |
|||
|- |
|||
| || || sebesar-besarnya || besar<adj><possib> || as big as possible || |
|||
|- |
|||
|} |
|||
Clitic Mark |
|||
{|class=wikitable |
|||
! Tag || Description || Example || Example's Analysis || Translation in English || Notes |
|||
|- |
|||
| <code><enc></code> || enclitic || bukuku || buku<n><bare><sg>+aku<prn><enc><p1><sg><pos> || my book || The clitics are pronoun. In this example another form will be "buku aku", where it also grammatically correct. Sequence of noun followed by pronoun, makes the pronoun becomes possesive pronoun |
|||
|- |
|||
| <code><pro></code> || proclitic || kukirim || aku<prn><pro><p1><sg>+kirim<v><av><perf><sg> || I send || The clitics are pronoun. In this example another form will be "aku cari", where it also grammatically correct. Sequence of pronoun followed by verb mostly makes the pronoun becomes the subject in the sentence |
|||
|- |
|||
|} |
|||
Transitivity Mark |
|||
{|class=wikitable |
|||
! Tag || Description || Example || Example's Analysis || Translation in English || Notes |
|||
|- |
|||
| <code><appl></code> || Applicative || kirimi || kirim<v><av><perf><appl><sg> || send || |
|||
|- |
|||
| <code><caus></code> || Causative || kirimkan || kirim<v><av><perf><caus><sg> || send || |
|||
|- |
|||
|} |
|||
Others |
|||
{|class=wikitable |
|||
! Tag || Description || Example || Example's Analysis || Translation in English || Notes |
|||
|- |
|- |
||
| <code><cap></code> || capitalized word mark || Buku || buku<n><bare><sg><cap> || Book || Capitalized word |
|||
| <code><pro></code> || proclitic || CLITIC || kucari - aku<pro><p1>+cari<v><av><perf>|| I look (for something) || The clitics are pronoun. In this example another form will be "aku cari", where it also grammatically correct. Sequence of pronoun followed by verb mostly makes the pronoun becomes the subject in the sentence |
|||
|- |
|- |
||
| <code><pos></code> || possesive || bukuku || buku<n><bare><sg>+aku<prn><enc><p1><sg><pos> || my book || The clitics are pronoun. In this example another form will be "buku aku", where it also grammatically correct. Sequence of noun followed by pronoun, makes the pronoun becomes possesive pronoun |
|||
| <code><cap></code> || capitalization mark|| MARK || Kirim - kirim<v><av><perf><cap>? || ''see above <v>'' || |
|||
|- |
|- |
||
|} |
|} |
Revision as of 17:13, 19 June 2010
Tagset
Original Part-of-Speech
Tag | Description | Example | Example's Analysis | Translation in English | Notes |
---|---|---|---|---|---|
<adj> |
adjective lemma | besar | besar<adj><positive> | big | |
<n> |
noun lemma | buku | buku<n><bare><sg> | book | |
<num> |
number lemma | tujuh | tujuh<num><car> | seven | |
<prn> |
pronoun | saya | saya<prn><p1><sg> | I, me, my (depending on the position) | |
<det> |
determiner | bukunya | buku<n><bare><sg><det> | can be 'his/her book' or 'the book' | |
<cnjcoo> |
coordinating conjunction | dan | dan<cnjcoo> | and | |
<cnjsub> |
subordinating conjunction | ketika | ketika<cnjsub> | when (not the WH question word) | |
<v> |
verb lemma | kirim | kirim<v><av><perf><sg> | deliver | |
<par> |
particle | lah | lah<par> | - | |
<mod> |
modal | akan | akan<mod> | will | |
<inj> |
interjection | ah | ah<inj> | - | |
<qst> |
question word | apa | apa<qst> | what | |
<pr> |
preposition lemma | di | di<pr> | in/on | double preposition: 'di dalam' = inside, 'di atas' = above, 'di bawah' = below |
Person
Tag | Description | Example | Example's Analysis | Translation in English | Notes |
---|---|---|---|---|---|
<p1> |
first person | saya | saya<prn><p1><sg> | I, me, my (depending on the position) | Saya is the formal 'I'. There are plenty of unformal 'I' variations |
<p2> |
second person | anda | anda<prn><p2><sg> | You, your (depending on the position) | Anda is the formal 'You'. There are plenty of unformal 'You' variations |
<p3> |
third person | mereka | mereka<prn><p3><pl> | they, them, their (depending on the position) |
Number
Tag | Description | Example | Example's Analysis | Translation in English | Notes |
---|---|---|---|---|---|
<sg> |
singular | buku | buku<n><bare><sg> | book | |
melompat | lompat<v><av><imp><sg> | jump | |||
<pl> |
plural | buku-buku | buku<n><bare><pl> | books | this number plurality feature not only occurs in nouns. It is not only show the plurality on number of entity, but also show plurality of the event or referring to plural entities |
melompat-lompat | lompat<v><av><imp><pl> | jump repeatedly |
Derivation Mark (not yet added/implemented)
Tag | Description | Example | Example's Analysis | Translation in English | Notes |
---|---|---|---|---|---|
<der_n> |
|||||
<der_adj> |
|||||
<der_v> |
|||||
<der_num> |
Numeral Type
Tag | Description | Example | Example's Analysis | Translation in English | Notes |
---|---|---|---|---|---|
<car> |
cardinal number | tujuh | tujuh<num><car> | seven | |
<ord> |
ordinal number | ketujuh | tujuh<num><ord> | the seventh | |
<coll> |
collective number | bertujuh | tujuh<num><coll> | in group of seven | |
<ref> |
referential number | tujuh-tujuhnya | tujuh<num><ref> | all the seven of them | |
ketujuh-tujuhnya | tujuh<num><ref> | all the seven of them |
Verb Type
Tag | Description | Example | Example's Analysis | Translation in English | Notes |
---|---|---|---|---|---|
<vbhaver> |
verb ‘to have’ | berpayung | payung<n><av><perf><vbhaver><bare><sg> | having umbrella(payung) | |
<vbser> |
verb ‘to be’ | perbesar | besar<adj><av><perf><vbser><sg> | making something to be big |
Voice Type
Tag | Description | Example | Example's Analysis | Translation in English | Notes |
---|---|---|---|---|---|
<av> |
active voice | mengirim | kirim<v><av><imp><sg> | send | |
<pv> |
passive voice | dikirim | kirim<v><pv><imp><sg> | being sent |
Aspect Type
Tag | Description | Example | Example's Analysis | Translation in English | Notes |
---|---|---|---|---|---|
<perf> |
perfective aspect | kirim | kirim<v><av><perf><sg> | send | |
<imp> |
imperfective aspect | mengirim | kirim<v><av><imp><sg> | send |
Noun Type
Tag | Description | Example | Example's Analysis | Translation in English | Notes |
---|---|---|---|---|---|
<bare> |
original bare noun type | buku | buku<n><bare><sg> | book | |
<abstract> |
derived abstract type | kedinginan | dingin<adj><abstract><sg> | coldness | |
<act> |
derived act noun | pengiriman | kirim<v><act><sg> | delivery | |
pembukuan | buku<n><act><sg> | archiving | |||
<actor> |
derived actor noun | pengirim | kirim<v><actor><sg> | deliverer | |
<ent> |
derived entity noun | kiriman | kirim<v><ent><sg> | package | |
<theme> |
derived theme noun | pertokoan | toko<n><theme><sg> | something related to shop |
Adjective Type
Tag | Description | Example | Example's Analysis | Translation in English | Notes |
---|---|---|---|---|---|
<positive> |
original bare adjective type | besar | besar<adj><bare> | big | |
<sup> |
superlative adjective | terbesar | besar<adj><sup> | the biggest | |
<exceed> |
exceed adjective | kebesaran | besar<adj><exceed> | too big | equally grammatical with "terlalu besar" |
<manner> |
manner adjective | keibuan | ibu<n><manner> | motherly | |
<uni> |
union adjective | seibu | ibu<n><bare><uni><sg> | same/one mother | |
<possib> |
adjectival phrase | secepat-cepatnya | cepat<adj><possib> | as quick as possible | |
sebesar-besarnya | besar<adj><possib> | as big as possible |
Clitic Mark
Tag | Description | Example | Example's Analysis | Translation in English | Notes |
---|---|---|---|---|---|
<enc> |
enclitic | bukuku | buku<n><bare><sg>+aku<prn><enc><p1><sg><pos> | my book | The clitics are pronoun. In this example another form will be "buku aku", where it also grammatically correct. Sequence of noun followed by pronoun, makes the pronoun becomes possesive pronoun |
<pro> |
proclitic | kukirim | aku<prn><pro><p1><sg>+kirim<v><av><perf><sg> | I send | The clitics are pronoun. In this example another form will be "aku cari", where it also grammatically correct. Sequence of pronoun followed by verb mostly makes the pronoun becomes the subject in the sentence |
Transitivity Mark
Tag | Description | Example | Example's Analysis | Translation in English | Notes |
---|---|---|---|---|---|
<appl> |
Applicative | kirimi | kirim<v><av><perf><appl><sg> | send | |
<caus> |
Causative | kirimkan | kirim<v><av><perf><caus><sg> | send |
Others
Tag | Description | Example | Example's Analysis | Translation in English | Notes |
---|---|---|---|---|---|
<cap> |
capitalized word mark | Buku | buku<n><bare><sg><cap> | Book | Capitalized word |
<pos> |
possesive | bukuku | buku<n><bare><sg>+aku<prn><enc><p1><sg><pos> | my book | The clitics are pronoun. In this example another form will be "buku aku", where it also grammatically correct. Sequence of noun followed by pronoun, makes the pronoun becomes possesive pronoun |