Difference between revisions of "Indonesian and Malaysian"

From Apertium
Jump to navigation Jump to search
(Personal pronoun)
Line 1: Line 1:
  +
''Description about this project will be added soon.''
==Tagset==
 
   
  +
==Work Plan==
Original Part-of-Speech
 
{|class=wikitable
 
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
 
|-
 
| <code>&lt;adj&gt;</code> || adjective lemma || besar || besar<adj><pst> || big ||
 
|-
 
| <code>&lt;n&gt;</code> || noun lemma || buku || buku<n><bare><sg> || book ||
 
|-
 
| <code>&lt;num&gt;</code> || number lemma || tujuh || tujuh<num><card> || seven ||
 
|-
 
| <code>&lt;prn&gt;</code> || pronoun || saya || saya<prn><pers><p1><sg> || I, me, my (depending on the position) ||
 
|-
 
| <code>&lt;det&gt;</code> || determiner || bukunya || buku<n><bare><sg><det> || can be 'his/her book' or 'the book' ||
 
|-
 
| <code>&lt;cnjcoo&gt;</code> || coordinating conjunction || dan || dan<cnjcoo> || and ||
 
|-
 
| <code>&lt;cnjsub&gt;</code> || subordinating conjunction || ketika || ketika<cnjsub> || when (not the WH question word) ||
 
|-
 
| <code>&lt;vblex&gt;</code> || verb lemma || kirim || kirim<vblex><actv><perf><sg> || deliver ||
 
|-
 
| <code>&lt;part&gt;</code> || particle || lah || lah<part> || - ||
 
|-
 
| <code>&lt;mod&gt;</code> || modal || akan || akan<mod> || will ||
 
|-
 
| <code>&lt;ij&gt;</code> || interjection || ah || ah<ij> || - ||
 
|-
 
| <code>&lt;qst&gt;</code> || question word || apa || apa<qst> || what ||
 
|-
 
| <code>&lt;pr&gt;</code> || preposition lemma || di || di<pr> || in/on || double preposition: 'di dalam' = inside, 'di atas' = above, 'di bawah' = below
 
|-
 
|}
 
   
  +
'''Community Bonding Period (April 24, 2012 − May 20, 2012)'''<br/>
  +
'''Week 1 (May 21, 2012 − May 27, 2012''')<br/>
  +
• Completing the Indonesian morphotactic and morphophonemic rules.<br/>
  +
• Incorporating the extracted lemma list into the Indonesian monolingual dictionary.<br/>
  +
• Adding more derivational subentries to the Indonesian monolingual dictionary.<br/>
  +
'''Week 2 (May 28, 2012 − June 3, 2012)'''<br/>
  +
• Continuing to add more derivational subentries to the Indonesian monolingual dictionary.<br/>
  +
'''Week 3 (June 4, 2012 − June 10, 2012)'''<br/>
  +
• Continuing to add more derivational subentries to the Indonesian monolingual dictionary.<br/>
  +
• Completing the Malaysian morphotactic and morphophonemic rules. A significant portion of the completed Indonesian rules can be reused and modified as necessary, and moreover, new rules can be augmented.<br/>
  +
'''Week 4 (June 11, 2012 − June 17, 2012)'''<br/>
  +
• Extracting Malaysian stem words from the Kamus Dewan.<br/>
  +
• Incorporating the extracted lemma list into the Malaysian monolingual dictionary<br/>
  +
• Adding more derivational subentries to the Malaysian monolingual dictionary.<br/>
   
Personal Pronoun
 
{|class=wikitable
 
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
 
|-
 
| <code>&lt;p1&gt;</code> || first person || saya || saya<prn><pers><p1><sg> || I, me, my (depending on the position) || Saya is the formal 'I'. There are plenty of unformal 'I' variations
 
|-
 
| <code>&lt;p2&gt;</code> || second person || anda || anda<prn><pers><p2><sg> || You, your (depending on the position) || Anda is the formal 'You'. There are plenty of unformal 'You' variations
 
|-
 
| <code>&lt;p3&gt;</code> || third person || mereka || mereka<prn><pers><p3><pl> || they, them, their (depending on the position) ||
 
|-
 
|}
 
   
Number
 
{|class=wikitable
 
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
 
|-
 
| <code>&lt;sg&gt;</code> || singular || buku || buku<n><bare><sg> || book ||
 
|-
 
| || || melompat || lompat<vblex><actv><imperf><sg> || jump ||
 
|-
 
| <code>&lt;pl&gt;</code> || plural || buku-buku || buku<n><bare><pl> || books || this number plurality feature not only occurs in nouns. It is not only show the plurality on number of entity, but also show plurality of the event or referring to plural entities
 
|-
 
| || || melompat-lompat || lompat<vblex><actv><imperf><pl> || jump repeatedly ||
 
|-
 
|}
 
   
  +
==Morphology==
   
  +
'''Verb-forming affixes'''<br/>
Derivation Mark (not yet added/implemented)
 
  +
Prefix meN-
 
{|class=wikitable
 
{|class=wikitable
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
+
! Tags || Meaning || Example || Segmentation || Lemma || Translation || Analysis
 
|-
 
|-
  +
| <vblex> || do something intentionally || membaca || meN+baca || baca (v.read) || v.read || baca<vblex><men>
| <code>&lt;der_n&gt;</code> || || || || ||
 
 
|-
 
|-
  +
| <vblex> || use something as a tool || memalu || meN+palu || palu (n.hammer)|| v.hammer || palu<vblex><men>
| <code>&lt;der_adj&gt;</code> || || || || ||
 
 
|-
 
|-
  +
| <vblex> || become || mengecil || meN+kecil || kecil (a.small)|| become smaller || kecil<vblex><men>
| <code>&lt;der_v&gt;</code> || || || || ||
 
|-
 
| <code>&lt;der_num&gt;</code> || || || || ||
 
 
|-
 
|-
  +
| <vblex> || go to a place || melaut || meN+laut || laut (n.sea) || go to the sea || laut<vblex><men>
 
|}
 
|}
   
Numeral Type
 
{|class=wikitable
 
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
 
|-
 
| <code>&lt;card&gt;</code> || cardinal number || tujuh || tujuh<num><card> || seven ||
 
|-
 
| <code>&lt;ord&gt;</code> || ordinal number || ketujuh || tujuh<num><ord> || the seventh ||
 
|-
 
| <code>&lt;coll&gt;</code> || collective number || bertujuh || tujuh<num><coll> || in group of seven ||
 
|-
 
| <code>&lt;refer&gt;</code> || referential number || tujuh-tujuhnya || tujuh<num>&lt;refer&gt; || all the seven of them ||
 
|-
 
| || || ketujuh-tujuhnya || tujuh<num>&lt;refer&gt; || all the seven of them ||
 
|-
 
|}
 
   
   
Verb Type
 
{|class=wikitable
 
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
 
|-
 
| <code>&lt;vbhaver&gt;</code> || verb ‘to have’ || berpayung || payung<n><actv><perf><vbhaver><bare><sg> || having umbrella(payung) ||
 
|-
 
| <code>&lt;vbser&gt;</code> || verb ‘to be’ || perbesar || besar<adj><actv><perf><vbser><sg> || making something to be big ||
 
|-
 
|}
 
 
Voice Type
 
{|class=wikitable
 
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
 
|-
 
| <code>&lt;actv&gt;</code> || active voice || mengirim || kirim<vblex><actv><imperf><sg> || send ||
 
|-
 
| <code>&lt;pasv&gt;</code> || passive voice || dikirim || kirim<vblex><pasv><imperf><sg> || being sent ||
 
|-
 
|}
 
 
Aspect Type
 
{|class=wikitable
 
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
 
|-
 
| <code>&lt;perf&gt;</code> || perfective aspect || kirim || kirim<vblex><actv><perf><sg> || send ||
 
|-
 
| <code>&lt;imperf&gt;</code> || imperfective aspect || mengirim || kirim<vblex><actv><imperf><sg> || send ||
 
|-
 
|}
 
 
Noun Type
 
{|class=wikitable
 
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
 
|-
 
| <code>&lt;bare&gt;</code> || original bare noun type || buku || buku<n><bare><sg> || book ||
 
|-
 
| <code>&lt;abstr&gt;</code> || derived abstract type || kedinginan || dingin<adj><abstr><sg> || coldness ||
 
|-
 
| <code>&lt;actio&gt;</code> || derived act noun || pengiriman || kirim<vblex><actio><sg> || delivery ||
 
|-
 
| || || pembukuan || buku<n><actio><sg> || archiving ||
 
|-
 
| <code>&lt;actor&gt;</code> || derived actor noun || pengirim || kirim<vblex><actor><sg> || deliverer ||
 
|-
 
| <code>&lt;ent&gt;</code> || derived entity noun || kiriman || kirim<vblex><ent><sg> || package ||
 
|-
 
| <code>&lt;theme&gt;</code> || derived theme noun || pertokoan || toko<n><theme><sg> || something related to shop ||
 
|-
 
|}
 
 
Adjective Type
 
{|class=wikitable
 
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
 
|-
 
| <code>&lt;pst&gt;</code> || original bare adjective type || besar || besar<adj><bare> || big ||
 
|-
 
| <code>&lt;sup&gt;</code> || superlative adjective || terbesar || besar<adj>&lt;sup&gt; || the biggest ||
 
|-
 
| <code>&lt;exceed&gt;</code> || exceed adjective || kebesaran || besar<adj><exceed> || too big || equally grammatical with "terlalu besar"
 
|-
 
| <code>&lt;manner&gt;</code> || manner adjective || keibuan || ibu<n><manner> || motherly ||
 
|-
 
| <code>&lt;uni&gt;</code> || union adjective || seibu || ibu<n><bare><uni><sg> || same/one mother ||
 
|-
 
| <code>&lt;possib&gt;</code> || adjectival phrase || secepat-cepatnya || cepat<adj><possib> || as quick as possible ||
 
|-
 
| || || sebesar-besarnya || besar<adj><possib> || as big as possible ||
 
|-
 
|}
 
 
Clitic Mark
 
{|class=wikitable
 
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
 
|-
 
| <code>&lt;enc&gt;</code> || enclitic || bukuku || buku<n><bare><sg>+aku<prn><pers><enc><p1><sg><pos> || my book || The clitics are pronoun. In this example another form will be "buku aku", where it also grammatically correct. Sequence of noun followed by pronoun, makes the pronoun becomes possesive pronoun
 
|-
 
| <code>&lt;pro&gt;</code> || proclitic || kukirim || aku<prn><pers><pro><p1><sg>+kirim<vblex><actv><perf><sg> || I send || The clitics are pronoun. In this example another form will be "aku cari", where it also grammatically correct. Sequence of pronoun followed by verb mostly makes the pronoun becomes the subject in the sentence
 
|-
 
|}
 
 
Transitivity Mark
 
{|class=wikitable
 
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
 
|-
 
| <code>&lt;appl&gt;</code> || Applicative || kirimi || kirim<vblex><actv><perf><appl><sg> || send ||
 
|-
 
| <code>&lt;caus&gt;</code> || Causative || kirimkan || kirim<vblex><actv><perf><caus><sg> || send ||
 
|-
 
|}
 
 
Prefix
 
{|class=wikitable
 
! Tag || Description || Example || Example's Analysis || Translation in English
 
|-
 
| <code>&lt;men&gt;</code> || 1. inflecting transitive verbs. || melihat || lihat<vblex><men> || to see
 
|-
 
| || 2. deriving intransitive verbs with noun or adjective bases. || || ||
 
|-
 
| <code>&lt;di&gt;</code> || inflecting a verb, indicating passive voice. || || ||
 
|-
 
| <code>&lt;ber&gt;</code> || deriving intransitive verbs. || bermain || main<vblex><ber> || to play
 
|-
 
| <code>&lt;ter&gt;</code> || 1. deriving verbs. || tertawa || tawa<vblex><ter> || to laugh
 
|-
 
| || 2. deriving nouns. || || ||
 
|-
 
| <code>&lt;ke&gt;</code> || 1. deriving nouns. || || ||
 
|-
 
| || 2. deriving verbs. || || ||
 
|-
 
| || 3. deriving numbers. || || ||
 
|-
 
| <code>&lt;se&gt;</code> || to indicate one item (like "a" or "the" in English) || sebuah || buah<n><sg><se>|| one piece
 
|-
 
|}
 
 
Others
 
{|class=wikitable
 
! Tag || Description || Example || Example's Analysis || Translation in English || Notes
 
|-
 
| <code>&lt;cap&gt;</code> || capitalized word mark || Buku || buku<n><bare><sg><cap> || Book || Capitalized word
 
|-
 
| <code>&lt;pos&gt;</code> || possesive || bukuku || buku<n><bare><sg>+aku<prn><pers><enc><p1><sg><pos> || my book || The clitics are pronoun. In this example another form will be "buku aku", where it also grammatically correct. Sequence of noun followed by pronoun, makes the pronoun becomes possesive pronoun
 
|-
 
|}
 
   
 
==See also==
 
==See also==
   
 
* [[/Pending tests|Pending tests]]
 
* [[/Pending tests|Pending tests]]
  +
* [[/Previous Morphology|Previous Morphology]]
  +
   
 
==External links==
 
==External links==

Revision as of 07:47, 7 May 2012

Description about this project will be added soon.

Work Plan

Community Bonding Period (April 24, 2012 − May 20, 2012)
Week 1 (May 21, 2012 − May 27, 2012)
• Completing the Indonesian morphotactic and morphophonemic rules.
• Incorporating the extracted lemma list into the Indonesian monolingual dictionary.
• Adding more derivational subentries to the Indonesian monolingual dictionary.
Week 2 (May 28, 2012 − June 3, 2012)
• Continuing to add more derivational subentries to the Indonesian monolingual dictionary.
Week 3 (June 4, 2012 − June 10, 2012)
• Continuing to add more derivational subentries to the Indonesian monolingual dictionary.
• Completing the Malaysian morphotactic and morphophonemic rules. A significant portion of the completed Indonesian rules can be reused and modified as necessary, and moreover, new rules can be augmented.
Week 4 (June 11, 2012 − June 17, 2012)
• Extracting Malaysian stem words from the Kamus Dewan.
• Incorporating the extracted lemma list into the Malaysian monolingual dictionary
• Adding more derivational subentries to the Malaysian monolingual dictionary.


Morphology

Verb-forming affixes
Prefix meN-

Tags Meaning Example Segmentation Lemma Translation Analysis
<vblex> do something intentionally membaca meN+baca baca (v.read) v.read baca<vblex><men>
<vblex> use something as a tool memalu meN+palu palu (n.hammer) v.hammer palu<vblex><men>
<vblex> become mengecil meN+kecil kecil (a.small) become smaller kecil<vblex><men>
<vblex> go to a place melaut meN+laut laut (n.sea) go to the sea laut<vblex><men>



See also


External links