Difference between revisions of "Indonesian and Malaysian"

From Apertium
Jump to navigation Jump to search
(diper-...-kan)
 
(56 intermediate revisions by 2 users not shown)
Line 1: Line 1:
  +
This is a language pair translating between Indonesian and Malaysian.
  +
<!--
  +
'''Community Bonding Period (April 24, 2012 − May 20, 2012)'''<br/>
  +
'''Week 1 (May 21, 2012 − May 27, 2012''')<br/>
  +
* Completing the Indonesian morphotactic and morphophonemic rules.<br/>
  +
* Incorporating the extracted lemma list into the Indonesian monolingual dictionary.<br/>
  +
* Adding more derivational subentries to the Indonesian monolingual dictionary.<br/>
  +
'''Week 2 (May 28, 2012 − June 3, 2012)'''<br/>
  +
* Continuing to add more derivational subentries to the Indonesian monolingual dictionary.<br/>
  +
'''Week 3 (June 4, 2012 − June 10, 2012)'''<br/>
  +
* Continuing to add more derivational subentries to the Indonesian monolingual dictionary.<br/>
  +
* Completing the Malaysian morphotactic and morphophonemic rules. A significant portion of the completed Indonesian rules can be reused and modified as necessary, and moreover, new rules can be augmented.<br/>
  +
'''Week 4 (June 11, 2012 − June 17, 2012)'''<br/>
  +
* Extracting Malaysian stem words from the Kamus Dewan.<br/>
  +
* Incorporating the extracted lemma list into the Malaysian monolingual dictionary<br/>
  +
* Adding more derivational subentries to the Malaysian monolingual dictionary.<br/>
  +
A more detailed work plan can be found in [[/Work plan|Work plan]]
  +
-->
  +
 
==Tagset==
 
==Tagset==
  +
===Noun affixes===
 
  +
{|class="wikitable"
Original Part-of-Speech
 
{|class=wikitable
 
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
 
 
|-
 
|-
  +
! style="background:#efefef;" | Type of noun affixes
| <code>&lt;adj&gt;</code> || adjective lemma || besar || besar<adj><positive> || big ||
 
  +
! style="background:#efefef;" | Affix
  +
! style="background:#efefef;" | Example of root word
  +
! style="background:#efefef;" | Example of derived word
  +
! style="background:#efefef;" | Tag(s)
 
|-
 
|-
  +
|Prefix
| <code>&lt;n&gt;</code> || noun lemma || buku || buku<n><bare><sg> || book ||
 
  +
|per-
  +
|''ajar'' (teaching)
  +
|''pelajar'' (student)
  +
| {{tag|per}}
 
|-
 
|-
  +
|
| <code>&lt;num&gt;</code> || number lemma || tujuh || tujuh<num><card> || seven ||
 
  +
|peN-
  +
|''duduk'' (sit)
  +
|''penduduk'' (population)
  +
| {{tag|peN}}
 
|-
 
|-
  +
|
| <code>&lt;prn&gt;</code> || pronoun || saya || saya<prn><p1><sg> || I, me, my (depending on the position) ||
 
  +
|pe-
  +
|''nyanyi'' (sing)
  +
|''penyanyi'' (singer)
  +
| {{tag|pe}}
 
|-
 
|-
  +
|Suffix
| <code>&lt;det&gt;</code> || determiner || bukunya || buku<n><bare><sg><det> || can be 'his/her book' or 'the book' ||
 
  +
| -an
  +
|''bangun'' (wake up, raise)
  +
|''bangunan'' (building)
  +
| {{tag|san}}
 
|-
 
|-
  +
|Circumfix
| <code>&lt;cnjcoo&gt;</code> || coordinating conjunction || dan || dan<cnjcoo> || and ||
 
  +
|ke-...-an
  +
|''raja'' (king)
  +
|''kerajaan'' (kingdom)
  +
| {{tag|ke-an}}
 
|-
 
|-
  +
|
| <code>&lt;cnjsub&gt;</code> || subordinating conjunction || ketika || ketika<cnjsub> || when (not the WH question word) ||
 
  +
|per-...-an
  +
|''kerja'' (work)
  +
|''pekerjaan'' (occupation)
  +
| {{tag|per-an}}
 
|-
 
|-
  +
|
| <code>&lt;v&gt;</code> || verb lemma || kirim || kirim<v><actv><perf><sg> || deliver ||
 
  +
|peN-...-an
|-
 
  +
|''buat'' (make)
| <code>&lt;part&gt;</code> || particle || lah || lah<part> || - ||
 
  +
|''pembuatan'' (production)
|-
 
  +
| {{tag|peN-an}}
| <code>&lt;mod&gt;</code> || modal || akan || akan<mod> || will ||
 
|-
 
| <code>&lt;ij&gt;</code> || interjection || ah || ah<ij> || - ||
 
|-
 
| <code>&lt;qst&gt;</code> || question word || apa || apa<qst> || what ||
 
|-
 
| <code>&lt;pr&gt;</code> || preposition lemma || di || di<pr> || in/on || double preposition: 'di dalam' = inside, 'di atas' = above, 'di bawah' = below
 
 
|-
 
|-
  +
|
  +
|pe-...-an
  +
|''layan'' (serve)
  +
|''pelayanan'' (service)
  +
| {{tag|pe-an}}
 
|}
 
|}
   
  +
===Verb affixes===
 
  +
{|class="wikitable"
Person
 
{|class=wikitable
 
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
 
 
|-
 
|-
  +
! style="background:#efefef;" | Type of verb affixes
| <code>&lt;p1&gt;</code> || first person || saya || saya<prn><p1><sg> || I, me, my (depending on the position) || Saya is the formal 'I'. There are plenty of unformal 'I' variations
 
  +
! style="background:#efefef;" | Affix
  +
! style="background:#efefef;" | Example of root word
  +
! style="background:#efefef;" | Example of derived word
  +
! style="background:#efefef;" | Tag(s)
 
|-
 
|-
  +
|Prefix
| <code>&lt;p2&gt;</code> || second person || anda || anda<prn><p2><sg> || You, your (depending on the position) || Anda is the formal 'You'. There are plenty of unformal 'You' variations
 
  +
|ber-
  +
|''ajar'' (teach)
  +
|''belajar'' (to study)
  +
|{{tag|ber}}
 
|-
 
|-
  +
|
| <code>&lt;p3&gt;</code> || third person || mereka || mereka<prn><p3><pl> || they, them, their (depending on the position) ||
 
  +
|meN-
  +
|''tolong'' (help)
  +
|''menolong'' (to help)
  +
|{{tag|actv}}
 
|-
 
|-
|}
+
|
  +
|di-
 
  +
|''ambil'' (take)
Number
 
  +
|''diambil'' (be taken)
{|class=wikitable
 
  +
|{{tag|pasv}}
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
 
 
|-
 
|-
  +
|
| <code>&lt;sg&gt;</code> || singular || buku || buku<n><bare><sg> || book ||
 
  +
|memper-
  +
|''kemas'' (tidy up, orderly)
  +
|''memperkemas'' (to arrange further)
  +
|{{tag|actv}}{{tag|per}}
 
|-
 
|-
  +
|
| || || melompat || lompat<v><actv><imp><sg> || jump ||
 
  +
|diper-
  +
|''dalam'' (deep)
  +
|''diperdalam'' (be deepened)
  +
|{{tag|pasv}}{{tag|per}}
 
|-
 
|-
  +
|
| <code>&lt;pl&gt;</code> || plural || buku-buku || buku<n><bare><pl> || books || this number plurality feature not only occurs in nouns. It is not only show the plurality on number of entity, but also show plurality of the event or referring to plural entities
 
  +
|ter-
  +
|''makan'' (eat)
  +
|''termakan'' (to have accidentally eaten)
  +
|{{tag|ter}}
 
|-
 
|-
  +
|Suffix
| || || melompat-lompat || lompat<v><actv><imp><pl> || jump repeatedly ||
 
  +
| -kan
  +
|''letak'' (place, keep)
  +
|''letakkan'' (keep)
  +
|{{tag|kan}}
 
|-
 
|-
|}
+
|
  +
| -i
 
  +
|''jauh'' (far)
 
  +
|''jauhi'' (avoid)
Derivation Mark (not yet added/implemented)
 
  +
|{{tag|si}}
{|class=wikitable
 
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
 
 
|-
 
|-
  +
|Circumfix
| <code>&lt;der_n&gt;</code> || || || || ||
 
  +
|ber-...-an
  +
|''pasang'' (pair)
  +
|''berpasangan'' (in pairs)
  +
|{{tag|ber-an}}
 
|-
 
|-
  +
|
| <code>&lt;der_adj&gt;</code> || || || || ||
 
  +
|ber-...-kan
  +
|''dasar'' (base)
  +
|''berdasarkan'' (based on)
  +
|{{tag|ber-kan}}
 
|-
 
|-
  +
|
| <code>&lt;der_v&gt;</code> || || || || ||
 
  +
|meN-...-kan
  +
|''pasti'' (sure)
  +
|''memastikan'' (to make sure)
  +
|{{tag|actv}}{{tag|kan}}
 
|-
 
|-
  +
|
| <code>&lt;der_num&gt;</code> || || || || ||
 
  +
|meN-...-i
  +
|''teman'' (company)
  +
|''menemani'' (to accompany)
  +
|{{tag|actv}}{{tag|si}}
 
|-
 
|-
|}
+
|
  +
|memper-...-kan
 
  +
|''guna'' (use)
Numeral Type
 
  +
|''mempergunakan'' (to utilise, to exploit)
{|class=wikitable
 
  +
|{{tag|actv}}{{tag|per-kan}}
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
 
|-
 
| <code>&lt;card&gt;</code> || cardinal number || tujuh || tujuh<num><card> || seven ||
 
|-
 
| <code>&lt;ord&gt;</code> || ordinal number || ketujuh || tujuh<num><ord> || the seventh ||
 
 
|-
 
|-
  +
|
| <code>&lt;coll&gt;</code> || collective number || bertujuh || tujuh<num><coll> || in group of seven ||
 
  +
|memper-...-i
  +
|''ajar'' (teach)
  +
|''mempelajari'' (to study)
  +
|{{tag|actv}}{{tag|per-i}}
 
|-
 
|-
  +
|
| <code>&lt;ref&gt;</code> || referential number || tujuh-tujuhnya || tujuh<num>&lt;ref&gt; || all the seven of them ||
 
  +
|di-...-i
  +
|''sakit'' (pain)
  +
|''disakiti'' (to be hurt by)
  +
|{{tag|pasv}}{{tag|si}}
 
|-
 
|-
  +
|
| || || ketujuh-tujuhnya || tujuh<num>&lt;ref&gt; || all the seven of them ||
 
  +
|di-...-kan
  +
|''benar'' (right)
  +
|''dibenarkan'' (is allowed to)
  +
|{{tag|pasv}}{{tag|kan}}
 
|-
 
|-
  +
|
  +
|diper-...-kan
  +
|''kenal'' (know, recognise)
  +
|''diperkenalkan'' (is being introduced)
  +
|{{tag|pasv}}{{tag|per-kan}}
 
|}
 
|}
   
  +
===Adjective affixes===
 
  +
{|class="wikitable"
Verb Type
 
{|class=wikitable
 
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
 
 
|-
 
|-
  +
! style="background:#efefef;" | Type of adjective affixes
| <code>&lt;vbhaver&gt;</code> || verb ‘to have’ || berpayung || payung<n><actv><perf><vbhaver><bare><sg> || having umbrella(payung) ||
 
  +
! style="background:#efefef;" | Affix
  +
! style="background:#efefef;" | Example of root word
  +
! style="background:#efefef;" | Example of derived word
  +
! style="background:#efefef;" | Tag(s)
 
|-
 
|-
  +
|Prefix
| <code>&lt;vbser&gt;</code> || verb ‘to be’ || perbesar || besar<adj><actv><perf><vbser><sg> || making something to be big ||
 
  +
|ter-
  +
|''kenal'' (know)
  +
|''terkenal'' (famous)
  +
|{{tag|ter}}
 
|-
 
|-
  +
|
  +
|se-
  +
|''lari'' (run)
  +
|''selari'' (parallel)
  +
|{{tag|se}}
 
|}
 
|}
  +
<!--
 
  +
===Verb-forming affixes===
Voice Type
 
 
{|class=wikitable
 
{|class=wikitable
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
+
! Affix || Function || Example || Segmentation || Lemma || Translation || Analysis
 
|-
 
|-
| <code>&lt;actv&gt;</code> || active voice || mengirim || kirim<v><actv><imp><sg> || send ||
+
| meN- || active voice || membaca || meN+baca || baca (v.read) || v.read || baca<vblex><actv>
 
|-
 
|-
  +
| di- || to be done by someone || dibaca || di+baca|| baca (v.read)|| read [by someone] || baca<vblex><di>
| <code>&lt;pasv&gt;</code> || passive voice || dikirim || kirim<v><pasv><imp><sg> || being sent ||
 
 
|-
 
|-
  +
| ber- || to convert noun to verb || berdiri || ber+diri || diri (n.self)|| stand || diri<n><vblex><ber>
|}
 
 
Aspect Type
 
{|class=wikitable
 
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
 
 
|-
 
|-
  +
| ber-R || || bermain-main || ber+main+main || main (v.play) || playing around || main<vblex><ber><pl>
| <code>&lt;perf&gt;</code> || perfective aspect || kirim || kirim<v><actv><perf><sg> || send ||
 
 
|-
 
|-
  +
| ber-an || reciprocal action || bertabrakan || ber+tabrak+an || tabrak (v.hit) || collide || tabrak<vblex><ber><an>
| <code>&lt;imp&gt;</code> || imperfective aspect || mengirim || kirim<v><actv><imp><sg> || send ||
 
 
|-
 
|-
  +
| ber-R-an || reciprocal action || berpeluk-pelukkan || ber+peluk+peluk+kan || peluk (v.hug) || hugging each other || peluk<vblex><ber><pl><an>
|}
 
 
Noun Type
 
{|class=wikitable
 
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
 
 
|-
 
|-
  +
| ter- || unintentional action || termakan || ter+makan|| makan (v.eat)|| eaten (unintentionally) || makan<vblex><ter>
| <code>&lt;bare&gt;</code> || original bare noun type || buku || buku<n><bare><sg> || book ||
 
 
|-
 
|-
  +
| ter-R || || terapung-apung || ter+apung+apung || apung (v.float) || float || apung<vblex><ter><pl>
| <code>&lt;abstract&gt;</code> || derived abstract type || kedinginan || dingin<adj><abstract><sg> || coldness ||
 
  +
|-
  +
| meN-i || || mendatangi || meN+datang+i || datang (v.come)|| come (to a place) || datang<vblex><actv>&lt;i&gt;
 
|-
 
|-
  +
| di-i || passive || didatangi|| di+datang+i || datang (v.come)|| visited || datang<vblex><di>&lt;i&gt;
| <code>&lt;actio&gt;</code> || derived act noun || pengiriman || kirim<v><actio><sg> || delivery ||
 
 
|-
 
|-
  +
| meN-kan || causative || mendatangkan || meN+datang+kan || datang (v.come)|| make someone come || datang<vblex><actv><kan>
| || || pembukuan || buku<n><actio><sg> || archiving ||
 
 
|-
 
|-
  +
| di-kan || causative, passive || didatangkan|| di+datang+kan || datang (v.come)|| imported || datang<vblex><di><kan>
| <code>&lt;actor&gt;</code> || derived actor noun || pengirim || kirim<v><actor><sg> || deliverer ||
 
 
|-
 
|-
  +
| memper- || to make less or more || memperbesar || meN+per+besar || besar (a.big) || make something bigger || besar<adj><vblex><actv><per>
| <code>&lt;ent&gt;</code> || derived entity noun || kiriman || kirim<v><ent><sg> || package ||
 
 
|-
 
|-
  +
| diper- || to make less or more, passive || diperbesar || di+per+besar || besar (a.big) || made bigger || besar<vblex><di><per>
| <code>&lt;theme&gt;</code> || derived theme noun || pertokoan || toko<n><theme><sg> || something related to shop ||
 
 
|-
 
|-
  +
| memper-kan || to change noun to transitive verb || mempertanyakan || meN+per+tanya+kan || tanya (n.question) || v.question || tanya<n><vblex><actv><per-kan>
|}
 
 
Adjective Type
 
{|class=wikitable
 
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
 
|-
 
| <code>&lt;positive&gt;</code> || original bare adjective type || besar || besar<adj><bare> || big ||
 
|-
 
| <code>&lt;sup&gt;</code> || superlative adjective || terbesar || besar<adj>&lt;sup&gt; || the biggest ||
 
|-
 
| <code>&lt;exceed&gt;</code> || exceed adjective || kebesaran || besar<adj><exceed> || too big || equally grammatical with "terlalu besar"
 
|-
 
| <code>&lt;manner&gt;</code> || manner adjective || keibuan || ibu<n><manner> || motherly ||
 
|-
 
| <code>&lt;uni&gt;</code> || union adjective || seibu || ibu<n><bare><uni><sg> || same/one mother ||
 
|-
 
| <code>&lt;possib&gt;</code> || adjectival phrase || secepat-cepatnya || cepat<adj><possib> || as quick as possible ||
 
|-
 
| || || sebesar-besarnya || besar<adj><possib> || as big as possible ||
 
 
|-
 
|-
  +
| diper-kan || || dipertanyakan || di+per+tanya+kan || tanya (n.question) || questioned || tanya<n><vblex><di><per-kan>
 
|}
 
|}
   
  +
===Adjective formation===
Clitic Mark
 
 
{|class=wikitable
 
{|class=wikitable
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
+
! Affix || Function || Example || Segmentation || Lemma || Translation || Analysis
 
|-
 
|-
  +
| ter- || superlative || terbesar || ter+besar || besar (a.big) || biggest || besar<adj><ter>
| <code>&lt;enc&gt;</code> || enclitic || bukuku || buku<n><bare><sg>+aku<prn><enc><p1><sg><pos> || my book || The clitics are pronoun. In this example another form will be "buku aku", where it also grammatically correct. Sequence of noun followed by pronoun, makes the pronoun becomes possesive pronoun
 
 
|-
 
|-
  +
| se- || same || sebesar || se+besar || besar (a.big) || as big as || besar<adj><se>
| <code>&lt;pro&gt;</code> || proclitic || kukirim || aku<prn><pro><p1><sg>+kirim<v><actv><perf><sg> || I send || The clitics are pronoun. In this example another form will be "aku cari", where it also grammatically correct. Sequence of pronoun followed by verb mostly makes the pronoun becomes the subject in the sentence
 
 
|-
 
|-
 
|}
 
|}
   
  +
===Noun formation===
Transitivity Mark
 
 
{|class=wikitable
 
{|class=wikitable
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
+
! Affix || Function || Example || Segmentation || Lemma || Translation || Analysis
 
|-
 
|-
  +
| -an || entity || tulisan || tulis+an || tulis (v.write) || writings || tulis<vblex><n><sg><an>
| <code>&lt;appl&gt;</code> || Applicative || kirimi || kirim<v><actv><perf><appl><sg> || send ||
 
 
|-
 
|-
  +
| peN- || actor || penjual || peN+jual || jual (v.sell) || seller || jual<vblex><n><sg><peN>
| <code>&lt;caus&gt;</code> || Causative || kirimkan || kirim<v><actv><perf><caus><sg> || send ||
 
|-
 
|}
 
 
Others
 
{|class=wikitable
 
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
 
 
|-
 
|-
  +
| ke-an || abstract || kepandaian || ke+pandai+an || pandai (a.intelligent) || intelligence || pandai<adj><n><sg><ke-an>
| <code>&lt;cap&gt;</code> || capitalized word mark || Buku || buku<n><bare><sg><cap> || Book || Capitalized word
 
 
|-
 
|-
  +
| peN-an || process || pengiriman || pe+kirim+an || kirim (v.send) || delivery || kirim<vblex><n><sg><peN-an>
| <code>&lt;pos&gt;</code> || possesive || bukuku || buku<n><bare><sg>+aku<prn><enc><p1><sg><pos> || my book || The clitics are pronoun. In this example another form will be "buku aku", where it also grammatically correct. Sequence of noun followed by pronoun, makes the pronoun becomes possesive pronoun
 
 
|-
 
|-
  +
| per-an || theme || pertokoan || per+toko+an || toko (n.store) || shopping centre || toko<n><sg><per-an>
 
|}
 
|}
  +
-->
   
 
==See also==
 
==See also==
   
 
* [[/Pending tests|Pending tests]]
 
* [[/Pending tests|Pending tests]]
  +
* [[/Previous morphology|Previous morphology]]
  +
* [[/Work plan|Work plan]]
   
 
==External links==
 
==External links==
   
 
* [http://en.wikipedia.org/wiki/Differences_between_Malay_and_Indonesian Wikipedia: Differences between Malaysian and Indonesian]
 
* [http://en.wikipedia.org/wiki/Differences_between_Malay_and_Indonesian Wikipedia: Differences between Malaysian and Indonesian]
  +
* [http://en.wikipedia.org/wiki/Malay_grammar Malay grammar]
   
 
[[Category:Indonesian and Malaysian|*]]
 
[[Category:Indonesian and Malaysian|*]]

Latest revision as of 18:02, 22 August 2012

This is a language pair translating between Indonesian and Malaysian.

Tagset[edit]

Noun affixes[edit]

Type of noun affixes Affix Example of root word Example of derived word Tag(s)
Prefix per- ajar (teaching) pelajar (student) <per>
peN- duduk (sit) penduduk (population) <peN>
pe- nyanyi (sing) penyanyi (singer) <pe>
Suffix -an bangun (wake up, raise) bangunan (building) <san>
Circumfix ke-...-an raja (king) kerajaan (kingdom) <ke-an>
per-...-an kerja (work) pekerjaan (occupation) <per-an>
peN-...-an buat (make) pembuatan (production) <peN-an>
pe-...-an layan (serve) pelayanan (service) <pe-an>

Verb affixes[edit]

Type of verb affixes Affix Example of root word Example of derived word Tag(s)
Prefix ber- ajar (teach) belajar (to study) <ber>
meN- tolong (help) menolong (to help) <actv>
di- ambil (take) diambil (be taken) <pasv>
memper- kemas (tidy up, orderly) memperkemas (to arrange further) <actv><per>
diper- dalam (deep) diperdalam (be deepened) <pasv><per>
ter- makan (eat) termakan (to have accidentally eaten) <ter>
Suffix -kan letak (place, keep) letakkan (keep) <kan>
-i jauh (far) jauhi (avoid) <si>
Circumfix ber-...-an pasang (pair) berpasangan (in pairs) <ber-an>
ber-...-kan dasar (base) berdasarkan (based on) <ber-kan>
meN-...-kan pasti (sure) memastikan (to make sure) <actv><kan>
meN-...-i teman (company) menemani (to accompany) <actv><si>
memper-...-kan guna (use) mempergunakan (to utilise, to exploit) <actv><per-kan>
memper-...-i ajar (teach) mempelajari (to study) <actv><per-i>
di-...-i sakit (pain) disakiti (to be hurt by) <pasv><si>
di-...-kan benar (right) dibenarkan (is allowed to) <pasv><kan>
diper-...-kan kenal (know, recognise) diperkenalkan (is being introduced) <pasv><per-kan>

Adjective affixes[edit]

Type of adjective affixes Affix Example of root word Example of derived word Tag(s)
Prefix ter- kenal (know) terkenal (famous) <ter>
se- lari (run) selari (parallel) <se>

See also[edit]

External links[edit]