Difference between revisions of "Languages of India"

From Apertium
Jump to navigation Jump to search
(Corrected spellings and added Also see)
 
(47 intermediate revisions by one other user not shown)
Line 24: Line 24:
|}
|}



== Transducers ==
{| class="wikitable sortable"
|-
!rowspan=2| name
!rowspan=2| Language
!rowspan=2| native name
!colspan=2 class="unsortable"| ISO 639
!rowspan=2| formalism
!rowspan=2| state
!rowspan=2| stems
!rowspan=2| coverage
!rowspan=2| location
!rowspan=2 class="unsortable"| primary authors
|-class="sortbottom"
! -1
! -3
|-
|| <code>[[apertium-hin]]</code>
|| [[Hindi]]
|| हिन्दी
|| <code>hi</code>
|| <code>hin</code>
|| HFST (lexc+twol)
|| working
|align="right"| {{#lst:apertium-hin/stats|stems}}
|align="center"| [[Apertium-hin#Current_State|~{{:Apertium-kaz/stats/average}}%]]
|-
|}


== Existing language pairs ==
== Existing language pairs ==
Text in ''italics'' denotes language pairs in the incubator. Regular text denotes a developing language pair in staging, while text in '''bold''' denotes a stable well-working language pair in trunk and text in '''''bold and italics''''' denotes a pair in staging. Bidix stems as counted with [[dixcounter]] are displayed below.
Text in ''italics'' denotes language pairs in the incubator. Regular text denotes a developing language pair in staging, while text in '''bold''' denotes a stable well-working language pair in trunk and text in '''''bold and italics''''' denotes a pair in staging. Bidix stems as counted with [[dixcounter]] are displayed below.



{| style="text-align: center;" class="wikitable dixtable"
{| style="text-align: center;" class="wikitable dixtable"
|- style="background: #ececec"
|- style="background: #ececec"
! !! eng !! hin
! !! eng !! hin !! asm !! ben !! guj !! mal !! mar !! pan !! tel !! urd
|-
|-
| '''eng''' || - || [[Apertium-eng-hin|eng-hin]]<br>{{#lst:Apertium-eng-hin/stat
| '''eng''' || - || [[Apertium-eng-hin|eng-hin]]<br>{{#lst:Apertium-eng-hin/stat
s|eng-hin_stems}}
s|eng-hin_stems}} || ''[[Apertium-asm-eng|asm-eng]]''<br>{{#lst:Apertium-asm-eng
/stats|asm-eng_stems}} || ''[[Apertium-bn-en|bn-en]]''<br>{{#lst:Apertium-bn-en/
stats|bn-en_stems}} || || ''[[Apertium-mal-eng|mal-eng]]''<br>{{#lst:Apertium-m
al-eng/stats|mal-eng_stems}} || ''[[Apertium-mar-eng|mar-eng]]''<br>{{#lst:Apert
ium-mar-eng/stats|mar-eng_stems}} || || ''[[Apertium-eng-tel|eng-tel]]''||
|-
|-
| '''hin''' || [[Apertium-eng-hin|eng-hin]]<br>{{#lst:Apertium-eng-hin/stats|eng
| '''hin''' || [[Apertium-eng-hin|eng-hin]]<br>{{#lst:Apertium-eng-hin/stats|eng
-hin_stems}} || -
-hin_stems}} || - || ''[[Apertium-as-hi|as-hi]]''<br>{{#lst:Apertium-as-hi/stats
|as-hi_stems}} || ''[[Apertium-bn-hi|bn-hi]]''<br>{{#lst:Apertium-bn-hi/stats|bn
|-
-hi_stems}} || ''[[Apertium-guj-hin|guj-hin]]''<br>
| || ||
|| || ''[[Apertium-mar-hin|mar-hin]]''<br>{{#lst:Apertium-mar-
hin/stats|mar-hin_stems}} || ''[[Apertium-pa-hi|pa-hi]]''<br>{{#lst:Apertium-pa-
hi/stats|pa-hi_stems}} || || '''[[Apertium-urd-hin|urd-hin]]'''<br>'''{{#lst:Ap
ertium-urd-hin/stats|urd-hin_stems}}'''
|-
|-
| '''asm''' || ''[[Apertium-asm-eng|asm-eng]]''<br>{{#lst:Apertium-asm-eng/stats
| '''asm''' || ''[[Apertium-asm-eng|asm-eng]]''<br>{{#lst:Apertium-asm-eng/stats
|asm-eng_stems}} || ''[[Apertium-as-hi|as-hi]]''<br>{{#lst:Apertium-as-hi/stats|
|asm-eng_stems}} || ''[[Apertium-as-hi|as-hi]]''<br>{{#lst:Apertium-as-hi/stats|
as-hi_stems}} || - || ''[[Apertium-asm-ben|asm-ben]]''<br>{{#lst:Apertium-asm-be
as-hi_stems}}
n/stats|asm-ben_stems}} || || || || || ||
|-
|-
| '''ben''' || ''[[Apertium-bn-en|bn-en]]''<br>{{#lst:Apertium-bn-en/stats|bn-en
| '''ben''' || ''[[Apertium-bn-en|bn-en]]''<br>{{#lst:Apertium-bn-en/stats|bn-en
_stems}} || ''[[Apertium-bn-hi|bn-hi]]''<br>{{#lst:Apertium-bn-hi/stats|bn-hi_st
_stems}} || ''[[Apertium-bn-hi|bn-hi]]''<br>{{#lst:Apertium-bn-hi/stats|bn-hi_st
ems}} || ''[[Apertium-asm-ben|asm-ben]]''<br>{{#lst:Apertium-asm-ben/stats|asm-b
ems}}
en_stems}} || - || || || || || ||
|-
|-
| '''guj''' || || ''[[Apertium-guj-hin|guj-hin]]''<br>{{#lst:Apertium-guj-hin/s
| '''guj''' || || ''[[Apertium-guj-hin|guj-hin]]''<br>{{#lst:Apertium-guj-hin/s
tats|guj-hin_stems}}
tats|guj-hin_stems}} || || || - || || || || ||
|-
|-
| '''mal''' || ''[[Apertium-mal-eng|mal-eng]]''<br>{{#lst:Apertium-mal-eng/stats
| '''mal''' || ''[[Apertium-mal-eng|mal-eng]]''<br>{{#lst:Apertium-mal-eng/stats
|mal-eng_stems}} ||
|mal-eng_stems}} || || || || || - || || || ||
|-
|-
| '''mar''' || ''[[Apertium-mar-eng|mar-eng]]''<br>{{#lst:Apertium-mar-eng/stats
| '''mar''' || ''[[Apertium-mar-eng|mar-eng]]''<br>{{#lst:Apertium-mar-eng/stats
|mar-eng_stems}} || ''[[Apertium-mar-hin|mar-hin]]''<br>{{#lst:Apertium-mar-hin/
|mar-eng_stems}} || ''[[Apertium-mar-hin|mar-hin]]''<br>{{#lst:Apertium-mar-hin/
stats|mar-hin_stems}}
stats|mar-hin_stems}} || || || || || - || || ''[[Apertium-tel-mar|tel-mar]]'' ||
|-
|-
| '''pan''' || || ''[[Apertium-pa-hi|pa-hi]]''<br>{{#lst:Apertium-pa-hi/stats|p
| '''pan''' || || ''[[Apertium-pa-hi|pa-hi]]''<br>{{#lst:Apertium-pa-hi/stats|p
a-hi_stems}} || || || || || || - || || ''[[Apertium-ur-pa|ur-pa]]''
a-hi_stems}}
|-
| '''snd''' || || ''[[Apertium-snd-hin|snd-hin]]''<br>{{#lst:Apertium-snd-hin/s
tats|snd-hin_stems}}
|-
|-
| '''tel''' || ''[[Apertium-eng-tel|eng-tel]]''<br>{{#lst:Apertium-eng-tel/stats
| '''tel''' || ''[[Apertium-eng-tel|eng-tel]]''<br>{{#lst:Apertium-eng-tel/stats
|eng-tel_stems}} ||
|eng-tel_stems}} || || || || || || || || - ||
|-
|-
| '''urd''' || || '''[[Apertium-urd-hin|urd-hin]]'''<br>'''{{#lst:Apertium-urd-
| '''urd''' || || '''[[Apertium-urd-hin|urd-hin]]'''<br>'''{{#lst:Apertium-urd-
hin/stats|urd-hin_stems}}'''
hin/stats|urd-hin_stems}}''' || || || || || || ''[[Apertium-ur-pa|ur-pa]]''
|| || -
|-
|-
|}
|}




== Indian languages by subgroup ==
== Indian languages by subgroup ==


*Indo-Aryan language family: [[Hindi]], [[Bengali]], [[Marathi]], [[Urdu]], [[Gujarati]], [[Punjabi]], [[Kashmiri]], [[Rajasthani]], [[Sindhi]],[[Assamese]], [[Maithili]] and [[Odia]].
== Vulnerability ==
*Dravidian language family: [[Telugu]], [[Tamil]], [[Kannada]] and [[Malayalam]].
This table summarizes the vulnerability of various Indian languages. Vulnerability data is derived from the ‘Atlas of the World’s Languages in Danger, © UNESCO, http://www.unesco.org/culture/languages-atlas’,[https://iasscore.in/national-issues/protection-of-endangered-languages IASSCORE] and [https://en.wikipedia.org/wiki/List_of_endangered_languages_in_India Wikipedia].
*Austroasiatic language family: [[Khasi]],[[Munda]] and [[Santali]].

*Sino-Tibetan language family: [[Meitei]],[[Bodo]], [[Karbi]] and [[Lepcha]].

*Tai-Kadai language family: [[Ahom]]
*Great Andamanese language family: [[Sentinelese]],[[Önge]] and [[Jarawa]].




Line 154: Line 134:
|-
|-
|}
|}

== Also See ==
*[[Indic languages]]

[[Category:Language families]]
[[Category:Language families]]

Latest revision as of 19:27, 12 January 2022

The Indian languages belong to six language families: Indo-Aryan language family,Austroasiatic language family,Dravidian language family, Sino-Tibetan language family,Tai–Kadai language family and Great Andamanese languages.

Status[edit]

Name Language Stems Covergae Loctaion
Apertium-hin Hindi 37,833 ~83.1% Apertium-hin
Apertium-urd Urdu 14,943 ~64.6% Apertium-urd


Existing language pairs[edit]

Text in italics denotes language pairs in the incubator. Regular text denotes a developing language pair in staging, while text in bold denotes a stable well-working language pair in trunk and text in bold and italics denotes a pair in staging. Bidix stems as counted with dixcounter are displayed below.


eng hin asm ben guj mal mar pan tel urd
eng - eng-hin
asm-eng
bn-en
mal-eng
mar-eng
eng-tel
hin eng-hin
- as-hi
833
bn-hi
guj-hin
mar-hin
pa-hi
'urd-hin
'
asm asm-eng
?
as-hi
833
- asm-ben
ben bn-en
bn-hi
asm-ben
-
guj guj-hin
-
mal mal-eng
9,095
-
mar mar-eng
102
mar-hin
- tel-mar
pan pa-hi
- ur-pa
tel eng-tel
1
-
urd 'urd-hin
'
ur-pa -

Indian languages by subgroup[edit]


Most widely spoken languages of India

  1. Hindi
  2. English
  3. Bengali
  4. Marathi
  5. Telegu
  6. Tamil
  7. Malayalam
  8. Kashmiri
  9. Urdu
  10. Sanskrit

Samples[edit]

Article 1 of the Universal Declaration of Human Rights:

All human beings are born free and equal in dignity and rights. They are endowed with reason and conscience and should act towards one another in a spirit of brotherhood.

Language Text
Hindi सभी मनुष्यों को गौरव और अधिकारों के मामले में जन्मजात स्वतन्त्रता और समानता प्राप्त है । उन्हें बुद्धि और अन्तरात्मा की देन प्राप्त है और परस्पर उन्हें भाईचारे के भाव से बर्ताव करना चाहिए ।
Marathi सर्व मानवी व्यक्ति जन्मतःच स्वतंत्र आहेत व त्यांना समान प्रतिष्ठा व समान अधिकार आहेत. त्यांना विचारशक्ति व सदसविद्वेकबुद्धि लाभलेली आहे. व त्यांनी एकमेकांशी बंधुत्याच्या भावनेने आचरण करावे.
Sanskrit सर्वे मानवाः स्वतन्त्राः समुत्पन्नाः वर्तन्ते अपि च, गौरवदृशा अधिकारदृशा च समानाः एव वर्तन्ते। एते सर्वे चेतना-तर्क-शक्तिभ्यां सुसम्पन्नाः सन्ति। अपि च, सर्वेऽपि बन्धुत्व-भावनया परस्परं व्यवहरन्तु।
Urdu تمام انسان آزاد اور حقوق و عزت کے اعتبار سے برابر پیدا ہوئے ہیں۔ انہیں ضمیر اور عقل ودیعت ہوئی ہے۔ اس لئے انہیں ایک دوسرے کے ساتھ بھائی چارے کا سلوک کرنا چاہیئے۔
Bangla সমস্ত মানুষ স্বাধীনভাবে সমান মর্যাদা এবং অধিকার নিয়ে জন্মগ্রহণ করে। তাঁদের বিবেক এবং বুদ্ধি আছে; সুতরাং সকলেরই একে অপরের প্রতি ভ্রাতৃত্বসুলভ মনোভাব নিয়ে আচরণ করা উচিত।
Gujarati પ્રતિષ્ઠા અને અધિકારોની દૃષ્ટિએ સર્વ માનવો જન્મથી સ્વતંત્ર અને સમાન હોય છે. તેમનામાં વિચારશક્તિ અને અંતઃકરણ હોય છે અને તેમણે પરસ્પર બંધુત્વની ભાવનાથી વર્તવું જોઇએ.
Punjabi ਸਾਰਾ ਮਨੁੱਖੀ ਪਰਿਵਾਰ ਆਪਣੀ ਮਹਿਮਾ, ਸ਼ਾਨ ਅਤੇ ਹੱਕਾਂ ਦੇ ਪੱਖੋਂ ਜਨਮ ਤੋਂ ਹੀ ਆਜ਼ਾਦ ਹੈ ਅਤੇ ਸੁਤੇ ਸਿੱਧ ਸਾਰੇ ਲੋਕ ਬਰਾਬਰ ਹਨ । ਉਨ੍ਹਾਂ ਸਭਨਾ ਨੂੰ ਤਰਕ ਅਤੇ ਜ਼ਮੀਰ ਦੀ ਸੌਗਾਤ ਮਿਲੀ ਹੋਈ ਹੈ ਅਤੇ ਉਨ੍ਹਾਂ ਨੂੰ ਭਰਾਤਰੀਭਾਵ ਦੀ ਭਾਵਨਾ ਰਖਦਿਆਂ ਆਪਸ ਵਿਚ ਵਿਚਰਣਾ ਚਾਹੀਦਾ ਹੈ ।
Kannada ಎಲ್ಲಾ ಮಾನವರೂ ಸ್ವತಂತ್ರರಾಗಿಯೇ ಜನಿಸಿದ್ದಾರೆ. ಹಾಗೂ ಘನತೆ ಮತ್ತು ಹಕ್ಕುಗಳಲ್ಲಿ ಸಮಾನರಾಗಿದ್ದಾರೆ. ವಿವೇಕ ಮತ್ತು ಅಂತಃಕರಣಗಳನ್ನು ಪಡೆದವರಾದ್ದರಿಂದ ಅವರು ಪರಸ್ಪರ ಸಹೋದರ ಭಾವದಿಂದ ವರ್ತಿಸಬೇಕು.
Malayalam മനുഷ്യരെല്ലാവരും തുല്യാവകാശങ്ങളോടും അന്തസ്സോടും സ്വാതന്ത്ര്യത്തോടുംകൂടി ജനിച്ചിട്ടുള്ളവരാണ്‌. അന്യോന്യം ഭ്രാതൃഭാവത്തോടെ പെരുമാറുവാനാണ്‌ മനുഷ്യന്നു വിവേകബുദ്ധിയും മനസ്സാക്ഷിയും സിദ്ധമായിരിക്കുന്നത്‌.
Telegu ప్రతిపత్తిస్వత్వముల విషయమున మానవులెల్లరును జన్మతః స్వతంత్రులును సమానులును నగుదురు. వారు వివేచన-అంతఃకరణ సంపన్నులగుటచే పరస్పరము భ్రాతృభావముతో వర్తింపవలయును.
Tamil எல்லா மனிதர்களும் ஒருவருக்கொருவர் தொடர்பில் சுதந்திரமாகவும், சமமாகவும், சமமாகவும் பிறந்தவர்கள். அவர்கள் புத்திஜீவிகள் வாரியாக இருக்க பரஸ்பரம் சாய்ந்திருக்க வேண்டும்.

Also See[edit]