Difference between revisions of "Indonesian"

From Apertium
Jump to navigation Jump to search
(→‎Corpora: added Leipzig Corpora Collection)
 
(4 intermediate revisions by the same user not shown)
Line 8: Line 8:
=== General ===
=== General ===
* [http://badanbahasa.kemdikbud.go.id/lamanbahasa/ Badan Pengembangan dan Pembinaan Bahasa]
* [http://badanbahasa.kemdikbud.go.id/lamanbahasa/ Badan Pengembangan dan Pembinaan Bahasa]
* [http://bahasa.cs.ui.ac.id/about.php bahasa.cs.ui.ac.id]

=== Comparative studies ===
* Baso Andi-Pallawa, Andi Fiptar Abdi Alam. "A Comparative Analysis between English and Indonesian Phonological Systems." International Journal of English Language Education, 2013.
* FX. Nadar. "A Comparative Study of The Indonesian and English Articles." Humaniora, 2013.


=== Corpora ===
=== Corpora ===
Line 19: Line 24:
* [https://www.sketchengine.eu/indonesianwac-corpus/ IndonesianWaC]
* [https://www.sketchengine.eu/indonesianwac-corpus/ IndonesianWaC]
* [https://github.com/kmkurn/id-nlp-resource kmkurn/id-nlp-resource]
* [https://github.com/kmkurn/id-nlp-resource kmkurn/id-nlp-resource]
* [https://corpora.uni-leipzig.de/en?corpusId=ind_mixed_2013 Leipzig Corpora Collection - Indonesian]


=== Dictionaries ===
=== Dictionaries ===
Line 25: Line 31:
* [http://kateglo.com/ Kateglo]
* [http://kateglo.com/ Kateglo]
* [https://id.wiktionary.org/wiki/Halaman_Utama Wiktionary bahasa Indonesia]
* [https://id.wiktionary.org/wiki/Halaman_Utama Wiktionary bahasa Indonesia]
* [http://bahasa.cs.ui.ac.id/iwn/ WordNet Bahasa Indonesia]


=== Grammar ===
=== Grammar ===
Line 40: Line 47:
* Kemal Kurniawan, Samuel Louvan. "IndoSum: A New Benchmark Dataset for Indonesian Text Summarization." IALP, 2018. ([https://arxiv.org/abs/1810.05334 arXiv pre-print])
* Kemal Kurniawan, Samuel Louvan. "IndoSum: A New Benchmark Dataset for Indonesian Text Summarization." IALP, 2018. ([https://arxiv.org/abs/1810.05334 arXiv pre-print])
* Ayu Purwarianti, Alvin Andhika, Alfan Farizki Wicaksono, Irfan Afif, Filman Ferdian. "InaNLP: Indonesia natural language processing toolkit, case study: Complaint tweet classification." 2016 International Conference On Advanced Informatics.
* Ayu Purwarianti, Alvin Andhika, Alfan Farizki Wicaksono, Irfan Afif, Filman Ferdian. "InaNLP: Indonesia natural language processing toolkit, case study: Complaint tweet classification." 2016 International Conference On Advanced Informatics.

=== Morphology ===
* [https://github.com/ivanlanin/pengakar ivanlanin/pengakar]
* Kemal Kurniawan, Samuel Louvan. "Empirical Evaluation of Character-Based Model on Neural Named-Entity Recognition in Indonesian Conversational Texts." EMNLP, 2018. ([https://arxiv.org/abs/1805.12291 arXiv pre-print])
* Kemal Kurniawan, Samuel Louvan. "Empirical Evaluation of Character-Based Model on Neural Named-Entity Recognition in Indonesian Conversational Texts." EMNLP, 2018. ([https://arxiv.org/abs/1805.12291 arXiv pre-print])
* Kemal Kurniawan, Alham Fikri Aji. "Toward a Standardized and More Accurate Indonesian Part-of-Speech Tagging." IALP, 2018. ([https://arxiv.org/abs/1809.03391 arXiv pre-print])
* Kemal Kurniawan, Alham Fikri Aji. "Toward a Standardized and More Accurate Indonesian Part-of-Speech Tagging." IALP, 2018. ([https://arxiv.org/abs/1809.03391 arXiv pre-print])
* [http://www.aclweb.org/anthology/U08-1018 Femphy Pisceldo, Rahmad Mahendra, Ruli Manurung, I Wayan Arka. "A Two-Level Morphological Analyser for the Indonesian Language." Association of Computational Linguistics.]
* [http://www.aclweb.org/anthology/U08-1018 Femphy Pisceldo, Rahmad Mahendra, Ruli Manurung, I Wayan Arka. "A Two-Level Morphological Analyser for the Indonesian Language." Association of Computational Linguistics.]
* [http://www.aclweb.org/anthology/Y03-1007 Lily Suryana Indradjaja, Stéphane Bressan. "Automatic Learning of Stemming Rules for the Indonesian Language." Association of Computational Linguistics.]

=== Morphology ===
* [https://github.com/ivanlanin/pengakar ivanlanin/pengakar]


=== Miscellaneous ===
=== Miscellaneous ===
* [https://rubrikbahasa.wordpress.com/ Rubrik Bahasa]
* [https://rubrikbahasa.wordpress.com/ Rubrik Bahasa]
* [https://id.wikisource.org/wiki/Kategori:Bahasa_Indonesia Kategori Bahasa Indonesia di Wikisource]
* [https://id.wikisource.org/wiki/Kategori:Bahasa_Indonesia Kategori Bahasa Indonesia di Wikisource]

[[Category:Indonesian]]
[[Category:Languages]]

Latest revision as of 13:08, 21 December 2019

Indonesian (Wikipedia:Indonesian language) is an Austronesian language and the official language of Indonesia. Since it is a register of Malay, it is also often generally understood by Malay speakers, who primarily are in Malaysia, Brunei, and Singapore.

In Apertium, there is a language pair of Indonesian and Malaysian already in the trunk category as well as contrastive grammar for English and Indonesian.

Resources[edit]

This section lists resources available on Indonesian.

General[edit]

Comparative studies[edit]

  • Baso Andi-Pallawa, Andi Fiptar Abdi Alam. "A Comparative Analysis between English and Indonesian Phonological Systems." International Journal of English Language Education, 2013.
  • FX. Nadar. "A Comparative Study of The Indonesian and English Articles." Humaniora, 2013.

Corpora[edit]

Dictionaries[edit]

Grammar[edit]

Machine Translation Tools and Techniques[edit]

Morphology[edit]

Miscellaneous[edit]