Difference between revisions of "Indonesian"

From Apertium
Jump to navigation Jump to search
Line 8: Line 8:
=== General ===
=== General ===
* [http://badanbahasa.kemdikbud.go.id/lamanbahasa/ Badan Pengembangan dan Pembinaan Bahasa]
* [http://badanbahasa.kemdikbud.go.id/lamanbahasa/ Badan Pengembangan dan Pembinaan Bahasa]
* [http://bahasa.cs.ui.ac.id/about.php bahasa.cs.ui.ac.id]


=== Corpora ===
=== Corpora ===
Line 25: Line 26:
* [http://kateglo.com/ Kateglo]
* [http://kateglo.com/ Kateglo]
* [https://id.wiktionary.org/wiki/Halaman_Utama Wiktionary bahasa Indonesia]
* [https://id.wiktionary.org/wiki/Halaman_Utama Wiktionary bahasa Indonesia]
* [http://bahasa.cs.ui.ac.id/iwn/ WordNet Bahasa Indonesia]


=== Grammar ===
=== Grammar ===
Line 40: Line 42:
* Kemal Kurniawan, Samuel Louvan. "IndoSum: A New Benchmark Dataset for Indonesian Text Summarization." IALP, 2018. ([https://arxiv.org/abs/1810.05334 arXiv pre-print])
* Kemal Kurniawan, Samuel Louvan. "IndoSum: A New Benchmark Dataset for Indonesian Text Summarization." IALP, 2018. ([https://arxiv.org/abs/1810.05334 arXiv pre-print])
* Ayu Purwarianti, Alvin Andhika, Alfan Farizki Wicaksono, Irfan Afif, Filman Ferdian. "InaNLP: Indonesia natural language processing toolkit, case study: Complaint tweet classification." 2016 International Conference On Advanced Informatics.
* Ayu Purwarianti, Alvin Andhika, Alfan Farizki Wicaksono, Irfan Afif, Filman Ferdian. "InaNLP: Indonesia natural language processing toolkit, case study: Complaint tweet classification." 2016 International Conference On Advanced Informatics.

=== Morphology ===
* [https://github.com/ivanlanin/pengakar ivanlanin/pengakar]
* Kemal Kurniawan, Samuel Louvan. "Empirical Evaluation of Character-Based Model on Neural Named-Entity Recognition in Indonesian Conversational Texts." EMNLP, 2018. ([https://arxiv.org/abs/1805.12291 arXiv pre-print])
* Kemal Kurniawan, Samuel Louvan. "Empirical Evaluation of Character-Based Model on Neural Named-Entity Recognition in Indonesian Conversational Texts." EMNLP, 2018. ([https://arxiv.org/abs/1805.12291 arXiv pre-print])
* Kemal Kurniawan, Alham Fikri Aji. "Toward a Standardized and More Accurate Indonesian Part-of-Speech Tagging." IALP, 2018. ([https://arxiv.org/abs/1809.03391 arXiv pre-print])
* Kemal Kurniawan, Alham Fikri Aji. "Toward a Standardized and More Accurate Indonesian Part-of-Speech Tagging." IALP, 2018. ([https://arxiv.org/abs/1809.03391 arXiv pre-print])
* [http://www.aclweb.org/anthology/U08-1018 Femphy Pisceldo, Rahmad Mahendra, Ruli Manurung, I Wayan Arka. "A Two-Level Morphological Analyser for the Indonesian Language." Association of Computational Linguistics.]
* [http://www.aclweb.org/anthology/U08-1018 Femphy Pisceldo, Rahmad Mahendra, Ruli Manurung, I Wayan Arka. "A Two-Level Morphological Analyser for the Indonesian Language." Association of Computational Linguistics.]
* [http://www.aclweb.org/anthology/Y03-1007 Lily Suryana Indradjaja, Stéphane Bressan. "Automatic Learning of Stemming Rules for the Indonesian Language." Association of Computational Linguistics.]

=== Morphology ===
* [https://github.com/ivanlanin/pengakar ivanlanin/pengakar]


=== Miscellaneous ===
=== Miscellaneous ===

Revision as of 11:54, 21 November 2018

Indonesian (Wikipedia:Indonesian language) is an Austronesian language and the official language of Indonesia. Since it is a register of Malay, it is also often generally understood by Malay speakers, who primarily are in Malaysia, Brunei, and Singapore.

In Apertium, there is a language pair of Indonesian and Malaysian already in the trunk category as well as contrastive grammar for English and Indonesian.

Resources

This section lists resources available on Indonesian.

General

Corpora

Dictionaries

Grammar

Machine Translation Tools and Techniques

Morphology

Miscellaneous