Difference between revisions of "Indonesian and Malaysian"

From Apertium
Jump to navigation Jump to search
Line 1: Line 1:
==Tagset==
==Tagset==
Not fixed yet still under design


Original Part-of-Speech
{|class=wikitable
{|class=wikitable
! Tag || Description || Tag Type || Example || Meaning || Notes
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
|-
|-
| <code><prn></code> || pronoun || POS || saya - saya<prn><p1><sg> || I, me, my (depending on the position) ||
| <code>&lt;adj&gt;</code> || adjective lemma || besar || besar<adj><positive> || big ||
|-
|-
| <code><v></code> || verb lemma || POS || kirim - kirim<v><av><perf>? || deliver ||
| <code>&lt;n&gt;</code> || noun lemma || buku || buku<n><bare><sg> || book ||
|-
|-
| <code><n></code> || noun lemma || POS || pesan-pesan - pesan<n><pl> || messages ||
| <code>&lt;num&gt;</code> || number lemma || tujuh || tujuh<num><car> || seven ||
|-
|-
| <code><adj></code> || adjective lemma || POS || tinggi - tinggi<adj> || tall, high ||
| <code>&lt;prn&gt;</code> || pronoun || saya || saya<prn><p1><sg> || I, me, my (depending on the position) ||
|-
|-
| <code><num></code> || number lemma || POS || tujuh - tujuh<num><car> || seven ||
| <code>&lt;det&gt;</code> || determiner || bukunya || buku<n><bare><sg><det> || can be 'his/her book' or 'the book' ||
|-
|-
| <code><per></code> || preposition lemma || POS || dalam - dalam<per> || in ||
| <code>&lt;cnjcoo&gt;</code> || coordinating conjunction || dan || dan<cnjcoo> || and ||
|-
|-
| <code><cnjsub></code> || subordinating conjunction lemma || POS || dan - dan<cnjsub> || and ||
| <code>&lt;cnjsub&gt;</code> || subordinating conjunction || ketika || ketika<cnjsub> || when (not the WH question word) ||
|-
|-
| <code><cnjcoo></code> || coordinating conjunction lemma || POS || ketika - ketika<cnjcoo> || when (not the WH question word) ||
| <code>&lt;v&gt;</code> || verb lemma || kirim || kirim<v><av><perf><sg> || deliver ||
|-
|-
| <code><p1></code> || first person || PERSON || saya - saya<prn><p1><sg> || I, me, my (depending on the position) ||
| <code>&lt;par&gt;</code> || particle || lah || lah<par> || - ||
|-
|-
| <code><p2></code> || second person || PERSON || anda - anda<prn><p2><sg> || You, your (depending on the position) ||
| <code>&lt;mod&gt;</code> || modal || akan || akan<mod> || will ||
|-
|-
| <code><p3></code> || third person || PERSON || mereka - mereka<prn><p3><pl> || they, them, their (depending on the position) ||
| <code>&lt;inj&gt;</code> || interjection || ah || ah<inj> || - ||
|-
|-
| <code><sg></code> || singular || NUM || melompat - lompat<v><av><imp><sg> || jump || this number plurality feature not only occurs in nouns
| <code>&lt;qst&gt;</code> || question word || apa || apa<qst> || what ||
|-
|-
| <code>&lt;pr&gt;</code> || preposition lemma || di || di<pr> || in/on || double preposition: 'di dalam' = inside, 'di atas' = above, 'di bawah' = below
| <code><pl></code> || plural || NUM || melompat-lompat - lompat<v><av><imp><pl> || jump repeatedly || this number plurality feature not only occurs in nouns. It is not only show the plurality on number of entity, but also show plurality of the event or referring to plural entities
|-
|-
|}
| <code><car></code> || cardinal number || NUM || tujuh - tujuh<num><car> || seven ||


Person
{|class=wikitable
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
|-
|-
| <code><ord></code> || ordinal number || NUM || ketujuh - tujuh<num><ord> || the seventh ||
| <code>&lt;p1&gt;</code> || first person || saya || saya<prn><p1><sg> || I, me, my (depending on the position) || Saya is the formal 'I'. There are plenty of unformal 'I' variations
|-
|-
| <code><coll></code> || collective number || NUM || bertujuh - tujuh<num><coll> || in group of seven ||
| <code>&lt;p2&gt;</code> || second person || anda || anda<prn><p2><sg> || You, your (depending on the position) || Anda is the formal 'You'. There are plenty of unformal 'You' variations
|-
|-
| <code><vbhaver></code> || verb ‘to have’ || VERB || berpayung - payung<n><av><vbhaver><sg> || having umbrella(payung) ||
| <code>&lt;p3&gt;</code> || third person || mereka || mereka<prn><p3><pl> || they, them, their (depending on the position) ||
|-
|-
|}
| <code><vbser></code> || verb ‘to be’ || VERB || || ||

Number
{|class=wikitable
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
|-
|-
| <code><av></code> || active voice || VOICE || || ||
| <code>&lt;sg&gt;</code> || singular || buku || buku<n><bare><sg> || book ||
|-
|-
| <code><pv></code> || passive voice || VOICE || || ||
| || || melompat || lompat<v><av><imp><sg> || jump ||
|-
|-
| <code>&lt;pl&gt;</code> || plural || buku-buku || buku<n><bare><pl> || books || this number plurality feature not only occurs in nouns. It is not only show the plurality on number of entity, but also show plurality of the event or referring to plural entities
| <code><perf></code> || perfective aspect || ASPECT || || ||
|-
|-
| || || melompat-lompat || lompat<v><av><imp><pl> || jump repeatedly ||
| <code><imp></code> || imperfective aspect || ASPECT || || ||
|-
|-
|}
| <code><ent></code> || derived entity noun || NOUN || kiriman - kirim<v><ent><sg> || package ||


Derivation Mark (not yet added/implemented)
{|class=wikitable
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
|-
|-
| <code><actor></code> || derived actor noun || NOUN || pengirim - kirim<v><actor><sg> || deliverer ||
| <code>&lt;der_n&gt;</code> || || || || ||
|-
|-
| <code><act></code> || derived act noun || NOUN || pengiriman - kirim<v><act><sg> || delivery ||
| <code>&lt;der_adj&gt;</code> || || || || ||
|-
|-
| <code>&lt;sup&gt;</code> || superlative adjective || ADJ || tertinggi - tinggi<adj><sup> || the tallest ||
| <code>&lt;der_v&gt;</code> || || || || ||
|-
|-
| <code>&lt;der_num&gt;</code> || || || || ||
| <code><enc></code> || enclitic || CLITIC || bukuku/buku<n><sg>+aku<enc><p1> || my book || The clitics are pronoun. In this example another form will be "buku aku", where it also grammatically correct. Sequence of noun followed by pronoun, makes the pronoun becomes possesive pronoun
|-
|}

Numeral Type
{|class=wikitable
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
|-
| <code>&lt;car&gt;</code> || cardinal number || tujuh || tujuh<num><car> || seven ||
|-
| <code>&lt;ord&gt;</code> || ordinal number || ketujuh || tujuh<num><ord> || the seventh ||
|-
| <code>&lt;coll&gt;</code> || collective number || bertujuh || tujuh<num><coll> || in group of seven ||
|-
| <code>&lt;ref&gt;</code> || referential number || tujuh-tujuhnya || tujuh<num>&lt;ref&gt; || all the seven of them ||
|-
| || || ketujuh-tujuhnya || tujuh<num>&lt;ref&gt; || all the seven of them ||
|-
|}


Verb Type
{|class=wikitable
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
|-
| <code>&lt;vbhaver&gt;</code> || verb ‘to have’ || berpayung || payung<n><av><perf><vbhaver><bare><sg> || having umbrella(payung) ||
|-
| <code>&lt;vbser&gt;</code> || verb ‘to be’ || perbesar || besar<adj><av><perf><vbser><sg> || making something to be big ||
|-
|}

Voice Type
{|class=wikitable
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
|-
| <code>&lt;av&gt;</code> || active voice || mengirim || kirim<v><av><imp><sg> || send ||
|-
| <code>&lt;pv&gt;</code> || passive voice || dikirim || kirim<v><pv><imp><sg> || being sent ||
|-
|}

Aspect Type
{|class=wikitable
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
|-
| <code>&lt;perf&gt;</code> || perfective aspect || kirim || kirim<v><av><perf><sg> || send ||
|-
| <code>&lt;imp&gt;</code> || imperfective aspect || mengirim || kirim<v><av><imp><sg> || send ||
|-
|}

Noun Type
{|class=wikitable
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
|-
| <code>&lt;bare&gt;</code> || original bare noun type || buku || buku<n><bare><sg> || book ||
|-
| <code>&lt;abstract&gt;</code> || derived abstract type || kedinginan || dingin<adj><abstract><sg> || coldness ||
|-
| <code>&lt;act&gt;</code> || derived act noun || pengiriman || kirim<v><act><sg> || delivery ||
|-
| || || pembukuan || buku<n><act><sg> || archiving ||
|-
| <code>&lt;actor&gt;</code> || derived actor noun || pengirim || kirim<v><actor><sg> || deliverer ||
|-
| <code>&lt;ent&gt;</code> || derived entity noun || kiriman || kirim<v><ent><sg> || package ||
|-
| <code>&lt;theme&gt;</code> || derived theme noun || pertokoan || toko<n><theme><sg> || something related to shop ||
|-
|}

Adjective Type
{|class=wikitable
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
|-
| <code>&lt;positive&gt;</code> || original bare adjective type || besar || besar<adj><bare> || big ||
|-
| <code>&lt;sup&gt;</code> || superlative adjective || terbesar || besar<adj>&lt;sup&gt; || the biggest ||
|-
| <code>&lt;exceed&gt;</code> || exceed adjective || kebesaran || besar<adj><exceed> || too big || equally grammatical with "terlalu besar"
|-
| <code>&lt;manner&gt;</code> || manner adjective || keibuan || ibu<n><manner> || motherly ||
|-
| <code>&lt;uni&gt;</code> || union adjective || seibu || ibu<n><bare><uni><sg> || same/one mother ||
|-
| <code>&lt;possib&gt;</code> || adjectival phrase || secepat-cepatnya || cepat<adj><possib> || as quick as possible ||
|-
| || || sebesar-besarnya || besar<adj><possib> || as big as possible ||
|-
|}

Clitic Mark
{|class=wikitable
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
|-
| <code>&lt;enc&gt;</code> || enclitic || bukuku || buku<n><bare><sg>+aku<prn><enc><p1><sg><pos> || my book || The clitics are pronoun. In this example another form will be "buku aku", where it also grammatically correct. Sequence of noun followed by pronoun, makes the pronoun becomes possesive pronoun
|-
| <code>&lt;pro&gt;</code> || proclitic || kukirim || aku<prn><pro><p1><sg>+kirim<v><av><perf><sg> || I send || The clitics are pronoun. In this example another form will be "aku cari", where it also grammatically correct. Sequence of pronoun followed by verb mostly makes the pronoun becomes the subject in the sentence
|-
|}

Transitivity Mark
{|class=wikitable
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
|-
| <code>&lt;appl&gt;</code> || Applicative || kirimi || kirim<v><av><perf><appl><sg> || send ||
|-
| <code>&lt;caus&gt;</code> || Causative || kirimkan || kirim<v><av><perf><caus><sg> || send ||
|-
|}

Others
{|class=wikitable
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
|-
|-
| <code>&lt;cap&gt;</code> || capitalized word mark || Buku || buku<n><bare><sg><cap> || Book || Capitalized word
| <code><pro></code> || proclitic || CLITIC || kucari - aku<pro><p1>+cari<v><av><perf>|| I look (for something) || The clitics are pronoun. In this example another form will be "aku cari", where it also grammatically correct. Sequence of pronoun followed by verb mostly makes the pronoun becomes the subject in the sentence
|-
|-
| <code>&lt;pos&gt;</code> || possesive || bukuku || buku<n><bare><sg>+aku<prn><enc><p1><sg><pos> || my book || The clitics are pronoun. In this example another form will be "buku aku", where it also grammatically correct. Sequence of noun followed by pronoun, makes the pronoun becomes possesive pronoun
| <code><cap></code> || capitalization mark|| MARK || Kirim - kirim<v><av><perf><cap>? || ''see above <v>'' ||
|-
|-
|}
|}

Revision as of 17:13, 19 June 2010

Tagset

Original Part-of-Speech

Tag Description Example Example's Analysis Translation in English Notes
<adj> adjective lemma besar besar<adj><positive> big
<n> noun lemma buku buku<n><bare><sg> book
<num> number lemma tujuh tujuh<num><car> seven
<prn> pronoun saya saya<prn><p1><sg> I, me, my (depending on the position)
<det> determiner bukunya buku<n><bare><sg><det> can be 'his/her book' or 'the book'
<cnjcoo> coordinating conjunction dan dan<cnjcoo> and
<cnjsub> subordinating conjunction ketika ketika<cnjsub> when (not the WH question word)
<v> verb lemma kirim kirim<v><av><perf><sg> deliver
<par> particle lah lah<par> -
<mod> modal akan akan<mod> will
<inj> interjection ah ah<inj> -
<qst> question word apa apa<qst> what
<pr> preposition lemma di di<pr> in/on double preposition: 'di dalam' = inside, 'di atas' = above, 'di bawah' = below


Person

Tag Description Example Example's Analysis Translation in English Notes
<p1> first person saya saya<prn><p1><sg> I, me, my (depending on the position) Saya is the formal 'I'. There are plenty of unformal 'I' variations
<p2> second person anda anda<prn><p2><sg> You, your (depending on the position) Anda is the formal 'You'. There are plenty of unformal 'You' variations
<p3> third person mereka mereka<prn><p3><pl> they, them, their (depending on the position)

Number

Tag Description Example Example's Analysis Translation in English Notes
<sg> singular buku buku<n><bare><sg> book
melompat lompat<v><av><imp><sg> jump
<pl> plural buku-buku buku<n><bare><pl> books this number plurality feature not only occurs in nouns. It is not only show the plurality on number of entity, but also show plurality of the event or referring to plural entities
melompat-lompat lompat<v><av><imp><pl> jump repeatedly


Derivation Mark (not yet added/implemented)

Tag Description Example Example's Analysis Translation in English Notes
<der_n>
<der_adj>
<der_v>
<der_num>

Numeral Type

Tag Description Example Example's Analysis Translation in English Notes
<car> cardinal number tujuh tujuh<num><car> seven
<ord> ordinal number ketujuh tujuh<num><ord> the seventh
<coll> collective number bertujuh tujuh<num><coll> in group of seven
<ref> referential number tujuh-tujuhnya tujuh<num><ref> all the seven of them
ketujuh-tujuhnya tujuh<num><ref> all the seven of them


Verb Type

Tag Description Example Example's Analysis Translation in English Notes
<vbhaver> verb ‘to have’ berpayung payung<n><av><perf><vbhaver><bare><sg> having umbrella(payung)
<vbser> verb ‘to be’ perbesar besar<adj><av><perf><vbser><sg> making something to be big

Voice Type

Tag Description Example Example's Analysis Translation in English Notes
<av> active voice mengirim kirim<v><av><imp><sg> send
<pv> passive voice dikirim kirim<v><pv><imp><sg> being sent

Aspect Type

Tag Description Example Example's Analysis Translation in English Notes
<perf> perfective aspect kirim kirim<v><av><perf><sg> send
<imp> imperfective aspect mengirim kirim<v><av><imp><sg> send

Noun Type

Tag Description Example Example's Analysis Translation in English Notes
<bare> original bare noun type buku buku<n><bare><sg> book
<abstract> derived abstract type kedinginan dingin<adj><abstract><sg> coldness
<act> derived act noun pengiriman kirim<v><act><sg> delivery
pembukuan buku<n><act><sg> archiving
<actor> derived actor noun pengirim kirim<v><actor><sg> deliverer
<ent> derived entity noun kiriman kirim<v><ent><sg> package
<theme> derived theme noun pertokoan toko<n><theme><sg> something related to shop

Adjective Type

Tag Description Example Example's Analysis Translation in English Notes
<positive> original bare adjective type besar besar<adj><bare> big
<sup> superlative adjective terbesar besar<adj><sup> the biggest
<exceed> exceed adjective kebesaran besar<adj><exceed> too big equally grammatical with "terlalu besar"
<manner> manner adjective keibuan ibu<n><manner> motherly
<uni> union adjective seibu ibu<n><bare><uni><sg> same/one mother
<possib> adjectival phrase secepat-cepatnya cepat<adj><possib> as quick as possible
sebesar-besarnya besar<adj><possib> as big as possible

Clitic Mark

Tag Description Example Example's Analysis Translation in English Notes
<enc> enclitic bukuku buku<n><bare><sg>+aku<prn><enc><p1><sg><pos> my book The clitics are pronoun. In this example another form will be "buku aku", where it also grammatically correct. Sequence of noun followed by pronoun, makes the pronoun becomes possesive pronoun
<pro> proclitic kukirim aku<prn><pro><p1><sg>+kirim<v><av><perf><sg> I send The clitics are pronoun. In this example another form will be "aku cari", where it also grammatically correct. Sequence of pronoun followed by verb mostly makes the pronoun becomes the subject in the sentence

Transitivity Mark

Tag Description Example Example's Analysis Translation in English Notes
<appl> Applicative kirimi kirim<v><av><perf><appl><sg> send
<caus> Causative kirimkan kirim<v><av><perf><caus><sg> send

Others

Tag Description Example Example's Analysis Translation in English Notes
<cap> capitalized word mark Buku buku<n><bare><sg><cap> Book Capitalized word
<pos> possesive bukuku buku<n><bare><sg>+aku<prn><enc><p1><sg><pos> my book The clitics are pronoun. In this example another form will be "buku aku", where it also grammatically correct. Sequence of noun followed by pronoun, makes the pronoun becomes possesive pronoun

See also

External links