Difference between revisions of "Indonesian and Malaysian"

From Apertium
Jump to navigation Jump to search
(diper-...-kan)
 
(58 intermediate revisions by 2 users not shown)
Line 1: Line 1:
  +
This is a language pair translating between Indonesian and Malaysian.
  +
<!--
  +
'''Community Bonding Period (April 24, 2012 − May 20, 2012)'''<br/>
  +
'''Week 1 (May 21, 2012 − May 27, 2012''')<br/>
  +
* Completing the Indonesian morphotactic and morphophonemic rules.<br/>
  +
* Incorporating the extracted lemma list into the Indonesian monolingual dictionary.<br/>
  +
* Adding more derivational subentries to the Indonesian monolingual dictionary.<br/>
  +
'''Week 2 (May 28, 2012 − June 3, 2012)'''<br/>
  +
* Continuing to add more derivational subentries to the Indonesian monolingual dictionary.<br/>
  +
'''Week 3 (June 4, 2012 − June 10, 2012)'''<br/>
  +
* Continuing to add more derivational subentries to the Indonesian monolingual dictionary.<br/>
  +
* Completing the Malaysian morphotactic and morphophonemic rules. A significant portion of the completed Indonesian rules can be reused and modified as necessary, and moreover, new rules can be augmented.<br/>
  +
'''Week 4 (June 11, 2012 − June 17, 2012)'''<br/>
  +
* Extracting Malaysian stem words from the Kamus Dewan.<br/>
  +
* Incorporating the extracted lemma list into the Malaysian monolingual dictionary<br/>
  +
* Adding more derivational subentries to the Malaysian monolingual dictionary.<br/>
  +
A more detailed work plan can be found in [[/Work plan|Work plan]]
  +
-->
  +
 
==Tagset==
 
==Tagset==
  +
===Noun affixes===
Not fixed yet still under design
 
  +
{|class="wikitable"
  +
|-
  +
! style="background:#efefef;" | Type of noun affixes
  +
! style="background:#efefef;" | Affix
  +
! style="background:#efefef;" | Example of root word
  +
! style="background:#efefef;" | Example of derived word
  +
! style="background:#efefef;" | Tag(s)
  +
|-
  +
|Prefix
  +
|per-
  +
|''ajar'' (teaching)
  +
|''pelajar'' (student)
  +
| {{tag|per}}
  +
|-
  +
|
  +
|peN-
  +
|''duduk'' (sit)
  +
|''penduduk'' (population)
  +
| {{tag|peN}}
  +
|-
  +
|
  +
|pe-
  +
|''nyanyi'' (sing)
  +
|''penyanyi'' (singer)
  +
| {{tag|pe}}
  +
|-
  +
|Suffix
  +
| -an
  +
|''bangun'' (wake up, raise)
  +
|''bangunan'' (building)
  +
| {{tag|san}}
  +
|-
  +
|Circumfix
  +
|ke-...-an
  +
|''raja'' (king)
  +
|''kerajaan'' (kingdom)
  +
| {{tag|ke-an}}
  +
|-
  +
|
  +
|per-...-an
  +
|''kerja'' (work)
  +
|''pekerjaan'' (occupation)
  +
| {{tag|per-an}}
  +
|-
  +
|
  +
|peN-...-an
  +
|''buat'' (make)
  +
|''pembuatan'' (production)
  +
| {{tag|peN-an}}
  +
|-
  +
|
  +
|pe-...-an
  +
|''layan'' (serve)
  +
|''pelayanan'' (service)
  +
| {{tag|pe-an}}
  +
|}
   
  +
===Verb affixes===
{|class=wikitable
 
  +
{|class="wikitable"
! Tag || Description || Tag Type || Example || Meaning || Notes
 
 
|-
 
|-
  +
! style="background:#efefef;" | Type of verb affixes
| <code><prn></code> || pronoun || POS || saya - saya<prn><p1><sg> || I, me, my (depending on the position) ||
 
  +
! style="background:#efefef;" | Affix
  +
! style="background:#efefef;" | Example of root word
  +
! style="background:#efefef;" | Example of derived word
  +
! style="background:#efefef;" | Tag(s)
 
|-
 
|-
  +
|Prefix
| <code><v></code> || verb lemma || POS || kirim - kirim<v><av><perf>? || deliver ||
 
  +
|ber-
  +
|''ajar'' (teach)
  +
|''belajar'' (to study)
  +
|{{tag|ber}}
 
|-
 
|-
  +
|
| <code><n></code> || noun lemma || POS || pesan-pesan - pesan<n><pl> || messages ||
 
  +
|meN-
  +
|''tolong'' (help)
  +
|''menolong'' (to help)
  +
|{{tag|actv}}
 
|-
 
|-
  +
|
| <code><adj></code> || adjective lemma || POS || tinggi - tinggi<adj> || tall, high ||
 
  +
|di-
  +
|''ambil'' (take)
  +
|''diambil'' (be taken)
  +
|{{tag|pasv}}
 
|-
 
|-
  +
|
| <code><num></code> || number lemma || POS || tujuh - tujuh<num><car> || seven ||
 
  +
|memper-
  +
|''kemas'' (tidy up, orderly)
  +
|''memperkemas'' (to arrange further)
  +
|{{tag|actv}}{{tag|per}}
 
|-
 
|-
  +
|
| <code><per></code> || preposition lemma || POS || dalam - dalam<per> || in ||
 
  +
|diper-
  +
|''dalam'' (deep)
  +
|''diperdalam'' (be deepened)
  +
|{{tag|pasv}}{{tag|per}}
 
|-
 
|-
  +
|
| <code><cnjsub></code> || subordinating conjunction lemma || POS || dan - dan<cnjsub> || and ||
 
  +
|ter-
  +
|''makan'' (eat)
  +
|''termakan'' (to have accidentally eaten)
  +
|{{tag|ter}}
 
|-
 
|-
  +
|Suffix
| <code><cnjcoo></code> || coordinating conjunction lemma || POS || ketika - ketika<cnjcoo> || when (not the WH question word) ||
 
  +
| -kan
  +
|''letak'' (place, keep)
  +
|''letakkan'' (keep)
  +
|{{tag|kan}}
 
|-
 
|-
  +
|
| <code><p1></code> || first person || PERSON || saya - saya<prn><p1><sg> || I, me, my (depending on the position) ||
 
  +
| -i
  +
|''jauh'' (far)
  +
|''jauhi'' (avoid)
  +
|{{tag|si}}
 
|-
 
|-
  +
|Circumfix
| <code><p2></code> || second person || PERSON || anda - anda<prn><p2><sg> || You, your (depending on the position) ||
 
  +
|ber-...-an
  +
|''pasang'' (pair)
  +
|''berpasangan'' (in pairs)
  +
|{{tag|ber-an}}
 
|-
 
|-
  +
|
| <code><p3></code> || third person || PERSON || mereka - mereka<prn><p3><pl> || they, them, their (depending on the position) ||
 
  +
|ber-...-kan
  +
|''dasar'' (base)
  +
|''berdasarkan'' (based on)
  +
|{{tag|ber-kan}}
 
|-
 
|-
  +
|
| <code><sg></code> || singular || NUM || melompat - lompat<v><av><imp><sg> || jump || this number plurality feature not only occurs in nouns
 
  +
|meN-...-kan
  +
|''pasti'' (sure)
  +
|''memastikan'' (to make sure)
  +
|{{tag|actv}}{{tag|kan}}
 
|-
 
|-
  +
|
| <code><pl></code> || plural || NUM || melompat-lompat - lompat<v><av><imp><pl> || jump repeatedly || this number plurality feature not only occurs in nouns. It is not only show the plurality on number of entity, but also show plurality of the event or referring to plural entities
 
  +
|meN-...-i
  +
|''teman'' (company)
  +
|''menemani'' (to accompany)
  +
|{{tag|actv}}{{tag|si}}
 
|-
 
|-
  +
|
| <code><car></code> || cardinal number || NUM || tujuh - tujuh<num><car> || seven ||
 
  +
|memper-...-kan
  +
|''guna'' (use)
  +
|''mempergunakan'' (to utilise, to exploit)
  +
|{{tag|actv}}{{tag|per-kan}}
 
|-
 
|-
  +
|
| <code><ord></code> || ordinal number || NUM || ketujuh - tujuh<num><ord> || the seventh ||
 
  +
|memper-...-i
  +
|''ajar'' (teach)
  +
|''mempelajari'' (to study)
  +
|{{tag|actv}}{{tag|per-i}}
 
|-
 
|-
  +
|
| <code><coll></code> || collective number || NUM || bertujuh - tujuh<num><coll> || in group of seven ||
 
  +
|di-...-i
  +
|''sakit'' (pain)
  +
|''disakiti'' (to be hurt by)
  +
|{{tag|pasv}}{{tag|si}}
 
|-
 
|-
  +
|
| <code><vbhaver></code> || verb ‘to have’ || VERB || berpayung - payung<n><av><vbhaver><sg> || having umbrella(payung) ||
 
  +
|di-...-kan
  +
|''benar'' (right)
  +
|''dibenarkan'' (is allowed to)
  +
|{{tag|pasv}}{{tag|kan}}
 
|-
 
|-
  +
|
| <code><vbser></code> || verb ‘to be’ || VERB || || ||
 
  +
|diper-...-kan
  +
|''kenal'' (know, recognise)
  +
|''diperkenalkan'' (is being introduced)
  +
|{{tag|pasv}}{{tag|per-kan}}
  +
|}
  +
  +
===Adjective affixes===
  +
{|class="wikitable"
 
|-
 
|-
  +
! style="background:#efefef;" | Type of adjective affixes
| <code><av></code> || active voice || VOICE || || ||
 
  +
! style="background:#efefef;" | Affix
  +
! style="background:#efefef;" | Example of root word
  +
! style="background:#efefef;" | Example of derived word
  +
! style="background:#efefef;" | Tag(s)
 
|-
 
|-
  +
|Prefix
| <code><pv></code> || passive voice || VOICE || || ||
 
  +
|ter-
  +
|''kenal'' (know)
  +
|''terkenal'' (famous)
  +
|{{tag|ter}}
 
|-
 
|-
  +
|
| <code><perf></code> || perfective aspect || ASPECT || || ||
 
  +
|se-
  +
|''lari'' (run)
  +
|''selari'' (parallel)
  +
|{{tag|se}}
  +
|}
  +
<!--
  +
===Verb-forming affixes===
  +
{|class=wikitable
  +
! Affix || Function || Example || Segmentation || Lemma || Translation || Analysis
 
|-
 
|-
| <code><imp></code> || imperfective aspect || ASPECT || || ||
+
| meN- || active voice || membaca || meN+baca || baca (v.read) || v.read || baca<vblex><actv>
 
|-
 
|-
  +
| di- || to be done by someone || dibaca || di+baca|| baca (v.read)|| read [by someone] || baca<vblex><di>
| <code><ent></code> || derived entity noun || NOUN || kiriman - kirim<v><ent><sg> || package ||
 
 
|-
 
|-
| <code><actor></code> || derived actor noun || NOUN || pengirim - kirim<v><actor><sg> || deliverer ||
+
| ber- || to convert noun to verb || berdiri || ber+diri || diri (n.self)|| stand || diri<n><vblex><ber>
 
|-
 
|-
  +
| ber-R || || bermain-main || ber+main+main || main (v.play) || playing around || main<vblex><ber><pl>
| <code><act></code> || derived act noun || NOUN || pengiriman - kirim<v><act><sg> || delivery ||
 
  +
|-
  +
| ber-an || reciprocal action || bertabrakan || ber+tabrak+an || tabrak (v.hit) || collide || tabrak<vblex><ber><an>
  +
|-
  +
| ber-R-an || reciprocal action || berpeluk-pelukkan || ber+peluk+peluk+kan || peluk (v.hug) || hugging each other || peluk<vblex><ber><pl><an>
  +
|-
  +
| ter- || unintentional action || termakan || ter+makan|| makan (v.eat)|| eaten (unintentionally) || makan<vblex><ter>
  +
|-
  +
| ter-R || || terapung-apung || ter+apung+apung || apung (v.float) || float || apung<vblex><ter><pl>
  +
|-
  +
| meN-i || || mendatangi || meN+datang+i || datang (v.come)|| come (to a place) || datang<vblex><actv>&lt;i&gt;
  +
|-
  +
| di-i || passive || didatangi|| di+datang+i || datang (v.come)|| visited || datang<vblex><di>&lt;i&gt;
  +
|-
  +
| meN-kan || causative || mendatangkan || meN+datang+kan || datang (v.come)|| make someone come || datang<vblex><actv><kan>
  +
|-
  +
| di-kan || causative, passive || didatangkan|| di+datang+kan || datang (v.come)|| imported || datang<vblex><di><kan>
  +
|-
  +
| memper- || to make less or more || memperbesar || meN+per+besar || besar (a.big) || make something bigger || besar<adj><vblex><actv><per>
  +
|-
  +
| diper- || to make less or more, passive || diperbesar || di+per+besar || besar (a.big) || made bigger || besar<vblex><di><per>
  +
|-
  +
| memper-kan || to change noun to transitive verb || mempertanyakan || meN+per+tanya+kan || tanya (n.question) || v.question || tanya<n><vblex><actv><per-kan>
  +
|-
  +
| diper-kan || || dipertanyakan || di+per+tanya+kan || tanya (n.question) || questioned || tanya<n><vblex><di><per-kan>
  +
|}
  +
  +
===Adjective formation===
  +
{|class=wikitable
  +
! Affix || Function || Example || Segmentation || Lemma || Translation || Analysis
  +
|-
  +
| ter- || superlative || terbesar || ter+besar || besar (a.big) || biggest || besar<adj><ter>
  +
|-
  +
| se- || same || sebesar || se+besar || besar (a.big) || as big as || besar<adj><se>
  +
|-
  +
|}
  +
  +
===Noun formation===
  +
{|class=wikitable
  +
! Affix || Function || Example || Segmentation || Lemma || Translation || Analysis
 
|-
 
|-
  +
| -an || entity || tulisan || tulis+an || tulis (v.write) || writings || tulis<vblex><n><sg><an>
| <code>&lt;sup&gt;</code> || superlative adjective || ADJ || tertinggi - tinggi<adj><sup> || the tallest ||
 
 
|-
 
|-
  +
| peN- || actor || penjual || peN+jual || jual (v.sell) || seller || jual<vblex><n><sg><peN>
| <code><enc></code> || enclitic || CLITIC || bukuku/buku<n><sg>+aku<enc><p1> || my book || The clitics are pronoun. In this example another form will be "buku aku", where it also grammatically correct. Sequence of noun followed by pronoun, makes the pronoun becomes possesive pronoun
 
 
|-
 
|-
  +
| ke-an || abstract || kepandaian || ke+pandai+an || pandai (a.intelligent) || intelligence || pandai<adj><n><sg><ke-an>
| <code><pro></code> || proclitic || CLITIC || kucari - aku<pro><p1>+cari<v><av><perf>|| I look (for something) || The clitics are pronoun. In this example another form will be "aku cari", where it also grammatically correct. Sequence of pronoun followed by verb mostly makes the pronoun becomes the subject in the sentence
 
 
|-
 
|-
  +
| peN-an || process || pengiriman || pe+kirim+an || kirim (v.send) || delivery || kirim<vblex><n><sg><peN-an>
| <code><cap></code> || capitalization mark|| MARK || Kirim - kirim<v><av><perf><cap>? || ''see above <v>'' ||
 
 
|-
 
|-
  +
| per-an || theme || pertokoan || per+toko+an || toko (n.store) || shopping centre || toko<n><sg><per-an>
 
|}
 
|}
  +
-->
   
 
==See also==
 
==See also==
   
 
* [[/Pending tests|Pending tests]]
 
* [[/Pending tests|Pending tests]]
  +
* [[/Previous morphology|Previous morphology]]
  +
* [[/Work plan|Work plan]]
   
 
==External links==
 
==External links==
   
 
* [http://en.wikipedia.org/wiki/Differences_between_Malay_and_Indonesian Wikipedia: Differences between Malaysian and Indonesian]
 
* [http://en.wikipedia.org/wiki/Differences_between_Malay_and_Indonesian Wikipedia: Differences between Malaysian and Indonesian]
  +
* [http://en.wikipedia.org/wiki/Malay_grammar Malay grammar]
   
 
[[Category:Indonesian and Malaysian|*]]
 
[[Category:Indonesian and Malaysian|*]]

Latest revision as of 18:02, 22 August 2012

This is a language pair translating between Indonesian and Malaysian.

Tagset[edit]

Noun affixes[edit]

Type of noun affixes Affix Example of root word Example of derived word Tag(s)
Prefix per- ajar (teaching) pelajar (student) <per>
peN- duduk (sit) penduduk (population) <peN>
pe- nyanyi (sing) penyanyi (singer) <pe>
Suffix -an bangun (wake up, raise) bangunan (building) <san>
Circumfix ke-...-an raja (king) kerajaan (kingdom) <ke-an>
per-...-an kerja (work) pekerjaan (occupation) <per-an>
peN-...-an buat (make) pembuatan (production) <peN-an>
pe-...-an layan (serve) pelayanan (service) <pe-an>

Verb affixes[edit]

Type of verb affixes Affix Example of root word Example of derived word Tag(s)
Prefix ber- ajar (teach) belajar (to study) <ber>
meN- tolong (help) menolong (to help) <actv>
di- ambil (take) diambil (be taken) <pasv>
memper- kemas (tidy up, orderly) memperkemas (to arrange further) <actv><per>
diper- dalam (deep) diperdalam (be deepened) <pasv><per>
ter- makan (eat) termakan (to have accidentally eaten) <ter>
Suffix -kan letak (place, keep) letakkan (keep) <kan>
-i jauh (far) jauhi (avoid) <si>
Circumfix ber-...-an pasang (pair) berpasangan (in pairs) <ber-an>
ber-...-kan dasar (base) berdasarkan (based on) <ber-kan>
meN-...-kan pasti (sure) memastikan (to make sure) <actv><kan>
meN-...-i teman (company) menemani (to accompany) <actv><si>
memper-...-kan guna (use) mempergunakan (to utilise, to exploit) <actv><per-kan>
memper-...-i ajar (teach) mempelajari (to study) <actv><per-i>
di-...-i sakit (pain) disakiti (to be hurt by) <pasv><si>
di-...-kan benar (right) dibenarkan (is allowed to) <pasv><kan>
diper-...-kan kenal (know, recognise) diperkenalkan (is being introduced) <pasv><per-kan>

Adjective affixes[edit]

Type of adjective affixes Affix Example of root word Example of derived word Tag(s)
Prefix ter- kenal (know) terkenal (famous) <ter>
se- lari (run) selari (parallel) <se>

See also[edit]

External links[edit]