Difference between revisions of "Indonesian"
Jump to navigation
Jump to search
(start new article for Indonesian) |
(→Corpora: added Leipzig Corpora Collection) |
||
(7 intermediate revisions by the same user not shown) | |||
Line 5: | Line 5: | ||
== Resources == |
== Resources == |
||
This section lists resources available on Indonesian. |
This section lists resources available on Indonesian. |
||
=== General === |
|||
* [http://badanbahasa.kemdikbud.go.id/lamanbahasa/ Badan Pengembangan dan Pembinaan Bahasa] |
|||
* [http://bahasa.cs.ui.ac.id/about.php bahasa.cs.ui.ac.id] |
|||
=== Comparative studies === |
|||
* Baso Andi-Pallawa, Andi Fiptar Abdi Alam. "A Comparative Analysis between English and Indonesian Phonological Systems." International Journal of English Language Education, 2013. |
|||
* FX. Nadar. "A Comparative Study of The Indonesian and English Articles." Humaniora, 2013. |
|||
=== Corpora === |
|||
* [https://id.wikipedia.org/wiki/Halaman_Utama Wikipedia Bahasa Indonesia] |
|||
* [https://github.com/UniversalDependencies/UD_Indonesian-GSD Indonesian UD (treebank)] |
|||
* [https://github.com/famrashel/idn-treebank famrashel/idn-treebank] |
|||
* [https://github.com/famrashel/idn-tagged-corpus famrashel/idn-tagged-corpus] |
|||
* [https://github.com/geovedi/indonesian-wordlist geovedi/indonesian-wordlist] |
|||
* [https://github.com/sastrawi/sastrawi/tree/master/data sastrawi/data] |
|||
* [http://sealang.net/indonesia/corpus.htm SEAlang Library Indonesian Corpus] |
|||
* [https://www.sketchengine.eu/indonesianwac-corpus/ IndonesianWaC] |
|||
* [https://github.com/kmkurn/id-nlp-resource kmkurn/id-nlp-resource] |
|||
* [https://corpora.uni-leipzig.de/en?corpusId=ind_mixed_2013 Leipzig Corpora Collection - Indonesian] |
|||
=== Dictionaries === |
|||
* [https://kbbi.kemdikbud.go.id/ Kamus Besar Bahasa Indonesia Daring] |
|||
* [http://spai.kemdikbud.go.id/ Senarai Padan Asing Indonesia] |
|||
* [http://kateglo.com/ Kateglo] |
|||
* [https://id.wiktionary.org/wiki/Halaman_Utama Wiktionary bahasa Indonesia] |
|||
* [http://bahasa.cs.ui.ac.id/iwn/ WordNet Bahasa Indonesia] |
|||
=== Grammar === |
|||
* James Neil Sneddon, K Alexander Adelaar, Dwi N. Djenar, Michael Ewing. ''Indonesian: A Comprehensive Grammar''. Routledge, 2010. |
|||
* [http://badanbahasa.kemdikbud.go.id/lamanbahasa/sites/default/files/PUEBI.pdf Tim Pengembang Pedoman Bahasa Indonesia. ''Pedoman Umum Ejaan Bahasa Indonesia''. Badan Bahasa, 2016.] |
|||
* [http://badanbahasa.kemdikbud.go.id/lamanbahasa/sites/default/files/Buku%20Penyuluhan%20BPK.pdf Mustakim. ''Bentuk dan Pilihan Kata''. Badan Bahasa, 2014.] |
|||
* [http://badanbahasa.kemdikbud.go.id/lamanbahasa/sites/default/files/Buku%20Penyuluhan%20Ejaan.pdf Sriyanto. ''Ejaan''. Badan Bahasa, 2014.] |
|||
* [http://badanbahasa.kemdikbud.go.id/lamanbahasa/sites/default/files/Buku%20Penyuluhan%20Kalimat.pdf Sry Satriya Tjatur Wisnu Sasangka. ''Kalimat''. Badan Bahasa, 2014.] |
|||
* [http://badanbahasa.kemdikbud.go.id/lamanbahasa/sites/default/files/Buku%20Penyuluhan%20Paragraf.pdf Suladi. ''Paragraf''. Badan Bahasa, 2014.] |
|||
* [https://id.wikisource.org/wiki/Pedoman_Umum_Pembentukan_Istilah ''Pedoman Umum Pembentukan Istilah''. Pusat Bahasa, 2007.] |
|||
* [https://docs.google.com/viewerng/viewer?url=http://wacana.ui.ac.id/index.php/wjhi/article/viewFile/119/112 H. Steinhauer. "Gender and the Indonesian Pronouns." Wacana, 2010.] |
|||
=== Machine Translation Tools and Techniques === |
|||
* [https://www.researchgate.net/publication/318670685_Predicting_the_Gender_of_Indonesian_Names Ali Akbar Septiandri. "Predicting the Gender of Indonesian Names." arXiv, 2017.] |
|||
* Kemal Kurniawan, Samuel Louvan. "IndoSum: A New Benchmark Dataset for Indonesian Text Summarization." IALP, 2018. ([https://arxiv.org/abs/1810.05334 arXiv pre-print]) |
|||
* Ayu Purwarianti, Alvin Andhika, Alfan Farizki Wicaksono, Irfan Afif, Filman Ferdian. "InaNLP: Indonesia natural language processing toolkit, case study: Complaint tweet classification." 2016 International Conference On Advanced Informatics. |
|||
* Kemal Kurniawan, Samuel Louvan. "Empirical Evaluation of Character-Based Model on Neural Named-Entity Recognition in Indonesian Conversational Texts." EMNLP, 2018. ([https://arxiv.org/abs/1805.12291 arXiv pre-print]) |
|||
* Kemal Kurniawan, Alham Fikri Aji. "Toward a Standardized and More Accurate Indonesian Part-of-Speech Tagging." IALP, 2018. ([https://arxiv.org/abs/1809.03391 arXiv pre-print]) |
|||
* [http://www.aclweb.org/anthology/U08-1018 Femphy Pisceldo, Rahmad Mahendra, Ruli Manurung, I Wayan Arka. "A Two-Level Morphological Analyser for the Indonesian Language." Association of Computational Linguistics.] |
|||
* [http://www.aclweb.org/anthology/Y03-1007 Lily Suryana Indradjaja, Stéphane Bressan. "Automatic Learning of Stemming Rules for the Indonesian Language." Association of Computational Linguistics.] |
|||
=== Morphology === |
|||
* [https://github.com/ivanlanin/pengakar ivanlanin/pengakar] |
|||
=== Miscellaneous === |
|||
* [https://rubrikbahasa.wordpress.com/ Rubrik Bahasa] |
|||
* [https://id.wikisource.org/wiki/Kategori:Bahasa_Indonesia Kategori Bahasa Indonesia di Wikisource] |
|||
[[Category:Indonesian]] |
|||
[[Category:Languages]] |
Latest revision as of 13:08, 21 December 2019
Indonesian (Wikipedia:Indonesian language) is an Austronesian language and the official language of Indonesia. Since it is a register of Malay, it is also often generally understood by Malay speakers, who primarily are in Malaysia, Brunei, and Singapore.
In Apertium, there is a language pair of Indonesian and Malaysian already in the trunk category as well as contrastive grammar for English and Indonesian.
Contents
Resources[edit]
This section lists resources available on Indonesian.
General[edit]
Comparative studies[edit]
- Baso Andi-Pallawa, Andi Fiptar Abdi Alam. "A Comparative Analysis between English and Indonesian Phonological Systems." International Journal of English Language Education, 2013.
- FX. Nadar. "A Comparative Study of The Indonesian and English Articles." Humaniora, 2013.
Corpora[edit]
- Wikipedia Bahasa Indonesia
- Indonesian UD (treebank)
- famrashel/idn-treebank
- famrashel/idn-tagged-corpus
- geovedi/indonesian-wordlist
- sastrawi/data
- SEAlang Library Indonesian Corpus
- IndonesianWaC
- kmkurn/id-nlp-resource
- Leipzig Corpora Collection - Indonesian
Dictionaries[edit]
- Kamus Besar Bahasa Indonesia Daring
- Senarai Padan Asing Indonesia
- Kateglo
- Wiktionary bahasa Indonesia
- WordNet Bahasa Indonesia
Grammar[edit]
- James Neil Sneddon, K Alexander Adelaar, Dwi N. Djenar, Michael Ewing. Indonesian: A Comprehensive Grammar. Routledge, 2010.
- Tim Pengembang Pedoman Bahasa Indonesia. Pedoman Umum Ejaan Bahasa Indonesia. Badan Bahasa, 2016.
- Mustakim. Bentuk dan Pilihan Kata. Badan Bahasa, 2014.
- Sriyanto. Ejaan. Badan Bahasa, 2014.
- Sry Satriya Tjatur Wisnu Sasangka. Kalimat. Badan Bahasa, 2014.
- Suladi. Paragraf. Badan Bahasa, 2014.
- Pedoman Umum Pembentukan Istilah. Pusat Bahasa, 2007.
- H. Steinhauer. "Gender and the Indonesian Pronouns." Wacana, 2010.
Machine Translation Tools and Techniques[edit]
- Ali Akbar Septiandri. "Predicting the Gender of Indonesian Names." arXiv, 2017.
- Kemal Kurniawan, Samuel Louvan. "IndoSum: A New Benchmark Dataset for Indonesian Text Summarization." IALP, 2018. (arXiv pre-print)
- Ayu Purwarianti, Alvin Andhika, Alfan Farizki Wicaksono, Irfan Afif, Filman Ferdian. "InaNLP: Indonesia natural language processing toolkit, case study: Complaint tweet classification." 2016 International Conference On Advanced Informatics.
- Kemal Kurniawan, Samuel Louvan. "Empirical Evaluation of Character-Based Model on Neural Named-Entity Recognition in Indonesian Conversational Texts." EMNLP, 2018. (arXiv pre-print)
- Kemal Kurniawan, Alham Fikri Aji. "Toward a Standardized and More Accurate Indonesian Part-of-Speech Tagging." IALP, 2018. (arXiv pre-print)
- Femphy Pisceldo, Rahmad Mahendra, Ruli Manurung, I Wayan Arka. "A Two-Level Morphological Analyser for the Indonesian Language." Association of Computational Linguistics.
- Lily Suryana Indradjaja, Stéphane Bressan. "Automatic Learning of Stemming Rules for the Indonesian Language." Association of Computational Linguistics.