Difference between revisions of "Xhosa"

From Apertium
Jump to navigation Jump to search
 
(28 intermediate revisions by one other user not shown)
Line 3: Line 3:
'''Xhosa''' ([[Wikipedia: Xhosa language]]) is a ([[Wikipedia: Nguni Bantu]]) language spoken mainly in Africa. Its widespread use is not very common and only has a small number of individuals enacting the language (11 million)
'''Xhosa''' ([[Wikipedia: Xhosa language]]) is a ([[Wikipedia: Nguni Bantu]]) language spoken mainly in Africa. Its widespread use is not very common and only has a small number of individuals enacting the language (11 million)


* Zulu and Xhosa are relatably the most similar due to them both being Nguni transcripts (predominant in Southern Africa)

* Xhosa is also similar to the languages of Swati and Ndebele which are spoken among the Bantu people

'''Apertium Pairs'''


== Apertium Language Pairs ==
Currently, on [[Apertium]], the language of Xhosa is recorded to have one language pair:
Currently, on [[Apertium]], the language of Xhosa is recorded to have one language pair:
*'''[[apertium-zu-xh|Zulu-?-Xhosa]]''' (08 Nov 2010) (most likely incubator)
*'''[[apertium-zu-xh|Zulu-?-Xhosa]]''' (08 Nov 2010) (incubator)


([[list of language pairs]])
([[list of language pairs]])


== Computational Linguistics/Comparative Studies ==
== Computational Linguistics ==
'''Comparative Study: Zulu - Xhosa'''
* [http://www.aclweb.org/anthology/W09-0714 - The similarities between the two languages based on a morphological analyzer (ZuluMorph) cross-linguistically]


'''Xhosa Cross Linguistics'''
'''Xhosa Cross Linguistics'''
* [https://www.uwc.ac.za/Faculties/ART/Xhosa/Pages/Research-.aspx - Research by the University of Western Cape providing cross linguistics upon Xhosa corpora]
* [https://www.uwc.ac.za/Faculties/ART/Xhosa/Pages/Research-.aspx "Cross linguistics upon Xhosa Corpora Research"]

'''The Tone Analysis of Xhosa'''
*[http://www.tandfonline.com/doi/abs/10.1080/00020185808707048 "The Tonemes of Xhosa"]

== Comparative Studies ==



'''Xhosa Literary + Academic Study (Rhodes University)
'''Xhosa Literary + Academic Study (Rhodes University)
*[http://www.dispatchlive.co.za/lifestyle/2017/05/02/xhosa-holds-academic-studies/ Providing Academic Studies on Literary Aspects of Xhosa]
*[http://www.dispatchlive.co.za/lifestyle/2017/05/02/xhosa-holds-academic-studies/ "Providing Academic Studies on Literary Aspects of Xhosa"]


'''The Tone Analysis of Xhosa'''
'''Comparative Study: Zulu - Xhosa'''
*[http://www.tandfonline.com/doi/abs/10.1080/00020185808707048 The Tonemes of Xhosa]
* [http://www.aclweb.org/anthology/W09-0714 "ZuluMorph's Analysis on Xhosa to Zulu Translation"]






== Linguistic Grammar ==
== Linguistic Grammar ==
*http://www.academicroom.com/topics/what-is-xhosa-language - a concept summary providing full context over the following grammars stated and discussed
*[http://www.academicroom.com/topics/what-is-xhosa-language "What is Xhosa?"]
*[http://www.lingref.com/cpp/galana/2/paper1552.pdf "The Acquisition of Subject Agreement in Xhosa"]

*[http://www.athingforwords.com/words/learning-xhosa/ "Let’s Talk isiXhosa", ''the process of mastering a foreign tongue'']
*[http://www.omniglot.com/writing/xhosa.htm "Omniglot Writing Systems and Languages"]
Xhosa contains multiple prefixes and suffixes which are attached to root words. Thus, the language is declassified into fifteen morphological classes or genders. Furthermore, the language is unique based on its tones, the phonemic low, and high tones;
Xhosa contains multiple prefixes and suffixes which are attached to root words. Thus, the language is declassified into fifteen morphological classes or genders. Furthermore, the language is unique based on its tones, the phonemic low, and high tones;


Line 34: Line 44:
The usage of uncommon consonants is dominant throughout the language in the version of clicks. The language uses 21 clicks (7 dental), however, the number of clicks varies based on each region (Namibia and Botswana primarily)
The usage of uncommon consonants is dominant throughout the language in the version of clicks. The language uses 21 clicks (7 dental), however, the number of clicks varies based on each region (Namibia and Botswana primarily)


Full List of Grammar Examples (Apertium Page): [[Xhosa Grammar Examples]]
'''Examples'''

*ukudlala - to play

*ukubona - to see
*umntwana - a child
*abantwana - children
*umntwana uyadlala - the child plays
*abantwana bayadlala - the children play
*indoda - a man
*amadoda - men
*indoda iyambona umntwana - the man sees the child
*amadoda ayababona abantwana - the men see the children
*Zonke zinto ezilungile zivela kuThixo - all things that are good proceed from God.
Xhosa Pronunciation + Language Set:[http://www.omniglot.com/writing/xhosa.htm Omniglot Writing Systems and Languages]


Update: Zulu is a large complement to the language of Xhosa, so most common corpora, morphologies, etc. contain Zulu




Line 55: Line 54:
(most are mainly decoded within English subtexts and contexts)
(most are mainly decoded within English subtexts and contexts)


* [http://tshwanedje.com/members/gmds/documents/Prinsloo_on_OZSD.pdf Oxford Studies] http://tshwanedje.com/members/gmds/documents/Prinsloo_on_OZSD.pdf
* [http://tshwanedje.com/members/gmds/documents/Prinsloo_on_OZSD.pdf "Oxford Studies"]
* [https://glosbe.com/en/xh/ English to Xhosa Bilingual Dictionary] https://glosbe.com/en/xh/ - Includes Zulu transcripts
* [https://glosbe.com/en/xh/ "English to Xhosa Bilingual Dictionary"]
* [http://www.gononda.com/xhosa/ - Xhosa to English Dictionary] http://www.gononda.com/xhosa/
* [http://www.gononda.com/xhosa/ "Xhosa to English Dictionary"]
* [http://ufh.ac.za/centres/xnlu/introduction "isiXhosa National Lexicography Unit"]
*[http://dreuarchive.cra.org/2011/Johnson/kkjohnson_report.pdf "Xhosa-English Machine Translation: Working with a Low-Resource Language"]


''Elaboration on Machine Translation'':
Due to the language being indigenous and mostly forgotten as a dialect of the older remnants of Africa, many dictionaries could not be found so many of these dictionaries are transcribed to the modern English meanings

* Machine translation is a sub-field of computational linguistics that investigates the use of software to translate text or speech from one language to another.
* The isiXhosa language is a low resource language for which there are not many morphological analyzers, so much of the research is compiled into putting together a parallel text for which a translation model can be trained


== Monolingual/Parallel Corpora ==
== Monolingual/Parallel Corpora ==


* [http://corpora.uni-leipzig.de/en?corpusId=deu_newscrawl_2011 ''Leipzig Corpora Dictionaries''] http://corpora.uni-leipzig.de/en?corpusId=deu_newscrawl_2011
* [http://corpora.uni-leipzig.de/en?corpusId=deu_newscrawl_2011"Leipzig Corpora Dictionaries"]
* [http://curl.corpora.uni-leipzig.de/ - Crawling Under-Resourced Languages, Focusing upon Xhosa to Zulu Corpora]
* [http://curl.corpora.uni-leipzig.de/ "Crawling Under-Resourced Languages"]
* [http://uir.unisa.ac.za/bitstream/handle/10500/5762/MOROPA%20ARTICLEs7.pdf?sequence=1 Koliswa Moropa (2007) - English/Xhosa Parallel Corpus]
* [http://uir.unisa.ac.za/bitstream/handle/10500/5762/MOROPA%20ARTICLEs7.pdf?sequence=1 Koliswa Moropa (2007) - "English/Xhosa Parallel Corpus"]
* [https://books.google.com/books/about/Corpus_Linguistics_and_World_Englishes.html?id=SFpusC25Z44C - Corpus Linguistics (focusing on English Xhosan)]
* [https://books.google.com/books/about/Corpus_Linguistics_and_World_Englishes.html?id=SFpusC25Z44C "Corpus Linguistics"] -
* [http://researchspace.csir.co.za/dspace/bitstream/handle/10204/5589/Govender2_2005.pdf;sequence=1 Natasha Govender, Etienne Barnard, Marelie Davel: Developing Intonation Corpora for isiXhosa and isiZulu]
* [http://researchspace.csir.co.za/dspace/bitstream/handle/10204/5589/Govender2_2005.pdf;sequence=1 Natasha Govender, Etienne Barnard, Marelie Davel - "Developing Intonation Corpora for isiXhosa and isiZulu"]
* [https://muse.jhu.edu/article/405145/pdf "Corpus Linguistics and World Englishes"]

[[Category:Nguni Bantu languages]]

Latest revision as of 05:57, 18 April 2020

The Xhosa Language[edit]

Xhosa (Wikipedia: Xhosa language) is a (Wikipedia: Nguni Bantu) language spoken mainly in Africa. Its widespread use is not very common and only has a small number of individuals enacting the language (11 million)

  • Zulu and Xhosa are relatably the most similar due to them both being Nguni transcripts (predominant in Southern Africa)
  • Xhosa is also similar to the languages of Swati and Ndebele which are spoken among the Bantu people

Apertium Pairs

Currently, on Apertium, the language of Xhosa is recorded to have one language pair:

(list of language pairs)

Computational Linguistics[edit]

Xhosa Cross Linguistics

The Tone Analysis of Xhosa

Comparative Studies[edit]

Xhosa Literary + Academic Study (Rhodes University)

Comparative Study: Zulu - Xhosa


Linguistic Grammar[edit]

Xhosa contains multiple prefixes and suffixes which are attached to root words. Thus, the language is declassified into fifteen morphological classes or genders. Furthermore, the language is unique based on its tones, the phonemic low, and high tones;

  • they are a [à], á [á], â [áà], ä [àá]. Long vowels are phonemic but are usually not written, except for â and ä

The usage of uncommon consonants is dominant throughout the language in the version of clicks. The language uses 21 clicks (7 dental), however, the number of clicks varies based on each region (Namibia and Botswana primarily)

Full List of Grammar Examples (Apertium Page): Xhosa Grammar Examples



Bilingual/Monolingual Dictionaries[edit]

(most are mainly decoded within English subtexts and contexts)

Elaboration on Machine Translation:

  • Machine translation is a sub-field of computational linguistics that investigates the use of software to translate text or speech from one language to another.
  • The isiXhosa language is a low resource language for which there are not many morphological analyzers, so much of the research is compiled into putting together a parallel text for which a translation model can be trained

Monolingual/Parallel Corpora[edit]