Difference between revisions of "Indonesian"
Jump to navigation
Jump to search
Line 8: | Line 8: | ||
=== General === |
=== General === |
||
* [http://badanbahasa.kemdikbud.go.id/lamanbahasa/ Badan Pengembangan dan Pembinaan Bahasa] |
* [http://badanbahasa.kemdikbud.go.id/lamanbahasa/ Badan Pengembangan dan Pembinaan Bahasa] |
||
* [http://bahasa.cs.ui.ac.id/about.php bahasa.cs.ui.ac.id] |
|||
=== Corpora === |
=== Corpora === |
||
Line 25: | Line 26: | ||
* [http://kateglo.com/ Kateglo] |
* [http://kateglo.com/ Kateglo] |
||
* [https://id.wiktionary.org/wiki/Halaman_Utama Wiktionary bahasa Indonesia] |
* [https://id.wiktionary.org/wiki/Halaman_Utama Wiktionary bahasa Indonesia] |
||
* [http://bahasa.cs.ui.ac.id/iwn/ WordNet Bahasa Indonesia] |
|||
=== Grammar === |
=== Grammar === |
||
Line 40: | Line 42: | ||
* Kemal Kurniawan, Samuel Louvan. "IndoSum: A New Benchmark Dataset for Indonesian Text Summarization." IALP, 2018. ([https://arxiv.org/abs/1810.05334 arXiv pre-print]) |
* Kemal Kurniawan, Samuel Louvan. "IndoSum: A New Benchmark Dataset for Indonesian Text Summarization." IALP, 2018. ([https://arxiv.org/abs/1810.05334 arXiv pre-print]) |
||
* Ayu Purwarianti, Alvin Andhika, Alfan Farizki Wicaksono, Irfan Afif, Filman Ferdian. "InaNLP: Indonesia natural language processing toolkit, case study: Complaint tweet classification." 2016 International Conference On Advanced Informatics. |
* Ayu Purwarianti, Alvin Andhika, Alfan Farizki Wicaksono, Irfan Afif, Filman Ferdian. "InaNLP: Indonesia natural language processing toolkit, case study: Complaint tweet classification." 2016 International Conference On Advanced Informatics. |
||
⚫ | |||
⚫ | |||
* Kemal Kurniawan, Samuel Louvan. "Empirical Evaluation of Character-Based Model on Neural Named-Entity Recognition in Indonesian Conversational Texts." EMNLP, 2018. ([https://arxiv.org/abs/1805.12291 arXiv pre-print]) |
* Kemal Kurniawan, Samuel Louvan. "Empirical Evaluation of Character-Based Model on Neural Named-Entity Recognition in Indonesian Conversational Texts." EMNLP, 2018. ([https://arxiv.org/abs/1805.12291 arXiv pre-print]) |
||
* Kemal Kurniawan, Alham Fikri Aji. "Toward a Standardized and More Accurate Indonesian Part-of-Speech Tagging." IALP, 2018. ([https://arxiv.org/abs/1809.03391 arXiv pre-print]) |
* Kemal Kurniawan, Alham Fikri Aji. "Toward a Standardized and More Accurate Indonesian Part-of-Speech Tagging." IALP, 2018. ([https://arxiv.org/abs/1809.03391 arXiv pre-print]) |
||
* [http://www.aclweb.org/anthology/U08-1018 Femphy Pisceldo, Rahmad Mahendra, Ruli Manurung, I Wayan Arka. "A Two-Level Morphological Analyser for the Indonesian Language." Association of Computational Linguistics.] |
* [http://www.aclweb.org/anthology/U08-1018 Femphy Pisceldo, Rahmad Mahendra, Ruli Manurung, I Wayan Arka. "A Two-Level Morphological Analyser for the Indonesian Language." Association of Computational Linguistics.] |
||
* [http://www.aclweb.org/anthology/Y03-1007 Lily Suryana Indradjaja, Stéphane Bressan. "Automatic Learning of Stemming Rules for the Indonesian Language." Association of Computational Linguistics.] |
|||
⚫ | |||
⚫ | |||
=== Miscellaneous === |
=== Miscellaneous === |
Revision as of 11:54, 21 November 2018
Indonesian (Wikipedia:Indonesian language) is an Austronesian language and the official language of Indonesia. Since it is a register of Malay, it is also often generally understood by Malay speakers, who primarily are in Malaysia, Brunei, and Singapore.
In Apertium, there is a language pair of Indonesian and Malaysian already in the trunk category as well as contrastive grammar for English and Indonesian.
Contents
Resources
This section lists resources available on Indonesian.
General
Corpora
- Wikipedia Bahasa Indonesia
- Indonesian UD (treebank)
- famrashel/idn-treebank
- famrashel/idn-tagged-corpus
- geovedi/indonesian-wordlist
- sastrawi/data
- SEAlang Library Indonesian Corpus
- IndonesianWaC
- kmkurn/id-nlp-resource
Dictionaries
- Kamus Besar Bahasa Indonesia Daring
- Senarai Padan Asing Indonesia
- Kateglo
- Wiktionary bahasa Indonesia
- WordNet Bahasa Indonesia
Grammar
- James Neil Sneddon, K Alexander Adelaar, Dwi N. Djenar, Michael Ewing. Indonesian: A Comprehensive Grammar. Routledge, 2010.
- Tim Pengembang Pedoman Bahasa Indonesia. Pedoman Umum Ejaan Bahasa Indonesia. Badan Bahasa, 2016.
- Mustakim. Bentuk dan Pilihan Kata. Badan Bahasa, 2014.
- Sriyanto. Ejaan. Badan Bahasa, 2014.
- Sry Satriya Tjatur Wisnu Sasangka. Kalimat. Badan Bahasa, 2014.
- Suladi. Paragraf. Badan Bahasa, 2014.
- Pedoman Umum Pembentukan Istilah. Pusat Bahasa, 2007.
- H. Steinhauer. "Gender and the Indonesian Pronouns." Wacana, 2010.
Machine Translation Tools and Techniques
- Ali Akbar Septiandri. "Predicting the Gender of Indonesian Names." arXiv, 2017.
- Kemal Kurniawan, Samuel Louvan. "IndoSum: A New Benchmark Dataset for Indonesian Text Summarization." IALP, 2018. (arXiv pre-print)
- Ayu Purwarianti, Alvin Andhika, Alfan Farizki Wicaksono, Irfan Afif, Filman Ferdian. "InaNLP: Indonesia natural language processing toolkit, case study: Complaint tweet classification." 2016 International Conference On Advanced Informatics.
- Kemal Kurniawan, Samuel Louvan. "Empirical Evaluation of Character-Based Model on Neural Named-Entity Recognition in Indonesian Conversational Texts." EMNLP, 2018. (arXiv pre-print)
- Kemal Kurniawan, Alham Fikri Aji. "Toward a Standardized and More Accurate Indonesian Part-of-Speech Tagging." IALP, 2018. (arXiv pre-print)
- Femphy Pisceldo, Rahmad Mahendra, Ruli Manurung, I Wayan Arka. "A Two-Level Morphological Analyser for the Indonesian Language." Association of Computational Linguistics.
- Lily Suryana Indradjaja, Stéphane Bressan. "Automatic Learning of Stemming Rules for the Indonesian Language." Association of Computational Linguistics.