Difference between revisions of "Languages of the Caucasus"

From Apertium
Jump to navigation Jump to search
(pairs)
 
(25 intermediate revisions by 4 users not shown)
Line 1: Line 1:
  +
{{TOCD}}
The Caucasus is home to a number of languages and language families. The term "Languages of the Caucasus" is mostly a geographic grouping and does not imply anything about the relatedness of the languages or even their similarity (due to processes like language contact). A division may be made between North Caucasus languages and South Caucasus languages. North Caucasus languages include [[Kumyk]], [[Nogay]], [[Karachay-Balkar]], [[Avar]], [[Chechen]], [[Ossetian]], [[Abkhaz]], [[Adyghe]], [[Ingushetian]], etc., while South Caucasus languages include [[Georgian]], [[Azeri]], and [[Armenian]].
 
  +
  +
The Caucasus is home to a number of languages and language families. The term "Languages of the Caucasus" is mostly a geographic grouping and does not imply anything about the relatedness of the languages or even their similarity (due to processes like language contact). A division may be made between North Caucasus languages and South Caucasus languages. North Caucasus languages include [[Kumyk]], [[Nogay]], [[Karachay-Balkar]], [[Avar]], [[Chechen]], [[Kabardian]], [[Lak]], [[Lezgian]], [[Ossetian]], [[Abkhaz]], [[Adyghe]], [[Ingushetian]], etc., while South Caucasus languages include [[Georgian]], [[Azeri]], and [[Armenian]].
   
 
== Status ==
 
== Status ==
kum, nog, krc, ava, che, oss, abk, ady, inh, kat, aze, hye
+
kum, nog, krc, ava, che, oss, abk, ady, inh, kbd, lbe, lez, kat, aze, hye
  +
 
=== Transducers ===
 
=== Transducers ===
  +
{| class="wikitable sortable"
  +
|-
  +
!rowspan=2| name
  +
!rowspan=2| Language
  +
!rowspan=2| native name
  +
!colspan=2 class="unsortable"| ISO 639
  +
!rowspan=2| formalism
  +
!rowspan=2| state
  +
!rowspan=2| stems
  +
!rowspan=2| coverage
  +
!rowspan=2| location
  +
!rowspan=2 class="unsortable"| primary authors
  +
|-class="sortbottom"
  +
! -2
  +
! -3
  +
|-
  +
|| <code>[[apertium-kum]]</code>
  +
|| [[Kumyk]]
  +
|| къумукъ тил
  +
|| <code>-</code>
  +
|| <code>kum</code>
  +
|| HFST (lexc+twol)
  +
|| working
  +
|align="right"| {{#lst:apertium-kum/stats|stems}}
  +
|align="center"| [[Apertium-kum#Current_State|~{{:Apertium-kum/stats/average}}%]]
  +
|| [[apertium-kum]]&nbsp;([[languages]])
  +
|| [[User:Francis Tyers|Fran]], [[User:Firespeaker|Jonathan]]
  +
|-
  +
| <code>[[apertium-ava]]</code>
  +
|| [[Avar]]
  +
|| Магӏарул мацӏ
  +
|| <code>-</code>
  +
|| <code>ava</code>
  +
|| HFST (lexc+twol)
  +
|| development
  +
|align="right"|{{#lst:apertium-ava/stats|stems}}
  +
|align="center"| [[apertium-ava#Current_State|~{{:apertium-ava/stats/average}}%]]
  +
|| [[apertium-ava]] ([[languages]])
  +
|| [[User:Francis Tyers|Fran]],
  +
|-
  +
| <code>[[apertium-nog]]</code>
  +
|| [[Nogay]]
  +
|| Ногай тили
  +
|| <code>-</code>
  +
|| <code>nog</code>
  +
|| HFST (lexc+twol)
  +
|| development
  +
|align="right"|{{#lst:apertium-nog/stats|stems}}
  +
|align="center"| [[apertium-nog#Current_State|~{{:apertium-nog/stats/average}}%]]
  +
|| [[apertium-nog]] ([[languages]])
  +
|| [[User:Francis Tyers|Fran]], [[User:Firespeaker|Jonathan]]
  +
|-
  +
| <code>[[apertium-hye]]</code>
  +
|| [[Armenian]]
  +
||
  +
|| <code>-</code>
  +
|| <code>hye</code>
  +
|| HFST (lexc+twol)
  +
|| development
  +
|align="right"|{{#lst:apertium-hye/stats|stems}}
  +
|align="center"| [[apertium-hye#Current_State|~{{:apertium-hye/stats/average}}%]]
  +
|| [[apertium-hye]] ([[languages]])
  +
|| [[User:Francis Tyers|Fran]],
  +
|-
  +
|| <code>[[apertium-oss]]</code>
  +
|| [[Ossetian]]
  +
||
  +
|| <code>os</code>
  +
|| <code>oss</code>
  +
|| [[lttoolbox]]
  +
|| development
  +
  +
|align="right"|{{#lst:apertium-oss/stats|stems}}
  +
|align="center"| [[apertium-oss#Current_State|~{{:apertium-oss/stats/average}}%]]
  +
|| [[apertium-oss]]&nbsp;([[incubator]])
  +
||
  +
|-
  +
| <code>[[apertium-aze]]</code>
  +
|| [[Azerbaijani]]
  +
|| Azərbaycan dili
  +
|| <code>az</code>
  +
|| <code>aze</code>
  +
|| SFST
  +
|| not known to work
  +
||
  +
||
  +
|| [[apertium-tur-aze]] ([[staging]])
  +
|| [[User:zfe|Gianluca]]
  +
  +
|}
  +
  +
=== Languages by family ===
  +
* [[Turkic languages|Turkic]]: [[Kumyk]], [[Nogay]], Azerbaijani
  +
* Daghestani: [[Avar]], [[Chechen]], [[Ingush]], Lak, Lezgian
  +
* Indo-European: [[Armenian]], [[Ossetian]]
  +
* Kartvelian: Georgian
  +
* North-West Caucasian: Abkhaz, Adyghe
   
 
=== Existing language pairs ===
 
=== Existing language pairs ===
{| style="text-align: center;" class="wikitable"
+
{| style="text-align: center;" class="wikitable dixtable"
 
|- style="background: #ececec"
 
|- style="background: #ececec"
 
! !! kum !! nog !! krc !! ava !! che !! oss !! abk !! ady !! inh !! kat !! aze !! hye
 
! !! kum !! nog !! krc !! ava !! che !! oss !! abk !! ady !! inh !! kat !! aze !! hye
Line 36: Line 136:
 
| || || || || || || || || || || || ||
 
| || || || || || || || || || || || ||
 
|-
 
|-
| '''eng''' || || || || || || || || || || || || [[Apertium-hye-eng|hye-eng]]<br>12,218
+
| '''eng''' || || || || || || || || || || || || [[Apertium-hye-eng|hye-eng]]<br>{{#lst:Apertium-hye-eng/stats|hye-eng_stems}}
 
|-
 
|-
| '''kaz''' || ''[[Apertium-kaz-kum|kaz-kum]]''<br>8 || ''[[Apertium-nog-kaz|nog-kaz]]''<br>9 || || || || || || || || || ||
+
| '''kaz''' || ''[[Apertium-kaz-kum|kaz-kum]]''<br>{{#lst:Apertium-kaz-kum/stats|kaz-kum_stems}} || ''[[Apertium-nog-kaz|nog-kaz]]''<br>{{#lst:Apertium-nog-kaz/stats|nog-kaz_stems}} || || || || || || || || || ||
 
|-
 
|-
| '''rus''' || || || || ''[[Apertium-ava-rus|ava-rus]]''<br>1,432 || || || || || || || ||
+
| '''rus''' || || || || ''[[Apertium-ava-rus|ava-rus]]''<br>{{#lst:Apertium-ava-rus/stats|ava-rus_stems}} || || || || || || || ||
 
|-
 
|-
| '''tur''' || || || || || || || || || || || '''''[[Apertium-tur-aze|tur-aze]]'''''<br>8,194 ||
+
| '''tur''' || || || || || || || || || || || '''''[[Apertium-tur-aze|tur-aze]]'''''<br>{{#lst:Apertium-tur-aze/stats|tur-aze_stems}} ||
 
|}
 
|}
   
 
== Endangerment ==
 
== Endangerment ==
  +
{| class="wikitable sortable"
  +
!rowspan=2| Language
  +
!rowspan=2| ISO639-3
  +
!rowspan=2| Location
  +
!rowspan=2| Speakers
  +
!colspan=2|Status
  +
|-class="sortbottom"
  +
! Ethnologue
  +
! UNESCO
  +
|-
  +
|| Nogay
  +
|align="center"| <code>[http://www.ethnologue.com/language/nog nog]</code>
  +
|| Russian Federation
  +
|align="right"| 87,410
  +
|| 5 (Developing)
  +
|| 2 (Definitely endangered)
  +
|-
  +
|| Kumyk
  +
|align="center"| <code>[http://www.ethnologue.com/language/kum kum]</code>
  +
|| Russian Federation
  +
|align="right"| 426,550
  +
|| 5 (Developing)
  +
|| 1 (Vulnerable)
  +
|-
  +
|| Ossetic
  +
|align="center"| <code>[http://www.ethnologue.com/language/oss oss]</code>
  +
|| Georgia, Russian Federation
  +
|align="right"| 577,450
  +
|| 5 (Developing)
  +
|| 1 (Vulnerable)
  +
|-
  +
|| Kabardian
  +
|align="center"| <code>[http://www.ethnologue.com/language/kbd kbd]</code>
  +
|| Russian Federation, Turkey
  +
|align="right"| 1,628,500
  +
|| 5 (Developing)
  +
|| 1 (Vulnerable)
  +
|-
  +
|| Lak
  +
|align="center"| <code>[http://www.ethnologue.com/language/lbe lbe]</code>
  +
|| Russian Federation
  +
|align="right"| 153,170
  +
|| 4 (Educational)
  +
|| 1 (Vulnerable)
  +
|-
  +
|| Karachay-Balkar
  +
|align="center"| <code>[http://www.ethnologue.com/language/krc krc]</code>
  +
|| Russian Federation
  +
|align="right"| 310,730
  +
|| 4 (Educational)
  +
|| 1 (Vulnerable)
  +
|-
  +
|| Ingush
  +
|align="center"| <code>[http://www.ethnologue.com/language/inh inh]</code>
  +
|| Russian Federation
  +
|align="right"| 322,900
  +
|| 4 (Educational)
  +
|| 1 (Vulnerable)
  +
|-
  +
|| Avar
  +
|align="center"| <code>[http://www.ethnologue.com/language/ava ava]</code>
  +
|| Russian Federation
  +
|align="right"| 761,960
  +
|| 4 (Educational)
  +
|| 1 (Vulnerable)
  +
|-
  +
|| Lezgi
  +
|align="center"| <code>[http://www.ethnologue.com/language/lez lez]</code>
  +
|| Azerbaijan, Russian Federation
  +
|align="right"| 788,720
  +
|| 4 (Educational)
  +
|| 1 (Vulnerable)
  +
|-
  +
|| Abkhaz
  +
|align="center"| <code>[http://www.ethnologue.com/language/abk abk]</code>
  +
|| Georgia, Russian Federation, Turkey
  +
|align="right"| 112,740
  +
|| 2 (Provincial)
  +
|| 1 (Vulnerable)
  +
|-
  +
|| Adyghe
  +
|align="center"| <code>[http://www.ethnologue.com/language/ady ady]</code>
  +
|| Iraq, Israel, Jordan, The former Yugoslav Republic of Macedonia, Russian Federation, Syrian Arab Republic, Turkey
  +
|align="right"| 491,800
  +
|| 2 (Provincial)
  +
|| 1 (Vulnerable)
  +
|-
  +
|| Chechen
  +
|align="center"| <code>[http://www.ethnologue.com/language/che che]</code>
  +
|| Russian Federation
  +
|align="right"| 1,361,000
  +
|| 2 (Provincial)
  +
|| 1 (Vulnerable)
  +
|-
  +
|| Georgian
  +
|align="center"| <code>[http://www.ethnologue.com/language/kat kat]</code>
  +
|| Georgia
  +
|align="right"| 4,237,710
  +
|| 1 (National)
  +
|| -
  +
|-
  +
|| Armenian
  +
|align="center"| <code>[http://www.ethnologue.com/language/hye hye]</code>
  +
|| Armenia & Georgia, Russian Federation
  +
|align="right"| 5,924,320
  +
|| 1 (National)
  +
|| 3 (Severely endangered)
  +
|-
  +
|| Azerbaijani, Northern
  +
|align="center"| <code>[http://www.ethnologue.com/language/azj azj]</code>
  +
|| Azerbaijan
  +
|align="right"| 7,324,270
  +
|| 1 (National)
  +
|| -
  +
|}
  +
  +
== Examples ==
  +
* [[UDHR]]
  +
  +
{|class=wikitable
  +
! Language !! Text
  +
|-
  +
|| Ossetian || Адӕймӕгтӕ се' ппӕт дӕр райгуырынц сӕрибарӕй ӕмӕ ӕмхуызонӕй сӕ барты. Уыдон ӕххӕст сты зонд ӕмӕ намысӕй, ӕмӕ кӕрӕдзийӕн хъуамӕ уой ӕфсымӕрты хуызӕн.
  +
|-
  +
|| Abkhaz || Дарбанзаалак ауаҩы дшоуп ихы дақәиҭны. Ауаа зегь зинлеи патулеи еиҟароуп. Урҭ ирымоуп ахшыҩи аламыси, дара дарагь аешьеи аешьеи реиҧш еизыҟазароуп.
  +
|-
  +
|| Georgian || ყველა ადამიანი იბადება თავისუფალი და თანასწორი თავისი ღირსებითა და უფლებებით. მათ მინიჭებული აქვთ გონება და სინდისი და ერთმანეთის მიმართ უნდა იქცეოდნენ ძმობის სულისკვეთებით.
  +
|-
  +
|| Armenian || Բոլոր մարդիկ ծնվում են ազատ ու հավասար իրենց արժանապատվությամբ ու իրավունքներով։ Նրանք ունեն բանականություն ու խիղճ և միմյանց պետք է եղբայրաբար վերաբերվեն։
  +
|}
  +
  +
[[Category:Languages of the Caucasus|*]]

Latest revision as of 17:51, 27 August 2017

The Caucasus is home to a number of languages and language families. The term "Languages of the Caucasus" is mostly a geographic grouping and does not imply anything about the relatedness of the languages or even their similarity (due to processes like language contact). A division may be made between North Caucasus languages and South Caucasus languages. North Caucasus languages include Kumyk, Nogay, Karachay-Balkar, Avar, Chechen, Kabardian, Lak, Lezgian, Ossetian, Abkhaz, Adyghe, Ingushetian, etc., while South Caucasus languages include Georgian, Azeri, and Armenian.

Status[edit]

kum, nog, krc, ava, che, oss, abk, ady, inh, kbd, lbe, lez, kat, aze, hye

Transducers[edit]

name Language native name ISO 639 formalism state stems coverage location primary authors
-2 -3
apertium-kum Kumyk къумукъ тил - kum HFST (lexc+twol) working 4,918 ~90.2% apertium-kum (languages) Fran, Jonathan
apertium-ava Avar Магӏарул мацӏ - ava HFST (lexc+twol) development 4,904 ~86.5% apertium-ava (languages) Fran,
apertium-nog Nogay Ногай тили - nog HFST (lexc+twol) development 1,385 ~81.4% apertium-nog (languages) Fran, Jonathan
apertium-hye Armenian - hye HFST (lexc+twol) development 8,247 ~63.5% apertium-hye (languages) Fran,
apertium-oss Ossetian os oss lttoolbox development 111 ~17% apertium-oss (incubator)
apertium-aze Azerbaijani Azərbaycan dili az aze SFST not known to work apertium-tur-aze (staging) Gianluca

Languages by family[edit]

Existing language pairs[edit]

kum nog krc ava che oss abk ady inh kat aze hye
kum -
nog -
krc -
ava -
che -
oss -
abk -
ady -
inh -
kat -
aze -
hye -
eng hye-eng
12,218
kaz kaz-kum
561
nog-kaz
9
rus ava-rus
5,509
tur tur-aze
8,194

Endangerment[edit]

Language ISO639-3 Location Speakers Status
Ethnologue UNESCO
Nogay nog Russian Federation 87,410 5 (Developing) 2 (Definitely endangered)
Kumyk kum Russian Federation 426,550 5 (Developing) 1 (Vulnerable)
Ossetic oss Georgia, Russian Federation 577,450 5 (Developing) 1 (Vulnerable)
Kabardian kbd Russian Federation, Turkey 1,628,500 5 (Developing) 1 (Vulnerable)
Lak lbe Russian Federation 153,170 4 (Educational) 1 (Vulnerable)
Karachay-Balkar krc Russian Federation 310,730 4 (Educational) 1 (Vulnerable)
Ingush inh Russian Federation 322,900 4 (Educational) 1 (Vulnerable)
Avar ava Russian Federation 761,960 4 (Educational) 1 (Vulnerable)
Lezgi lez Azerbaijan, Russian Federation 788,720 4 (Educational) 1 (Vulnerable)
Abkhaz abk Georgia, Russian Federation, Turkey 112,740 2 (Provincial) 1 (Vulnerable)
Adyghe ady Iraq, Israel, Jordan, The former Yugoslav Republic of Macedonia, Russian Federation, Syrian Arab Republic, Turkey 491,800 2 (Provincial) 1 (Vulnerable)
Chechen che Russian Federation 1,361,000 2 (Provincial) 1 (Vulnerable)
Georgian kat Georgia 4,237,710 1 (National) -
Armenian hye Armenia & Georgia, Russian Federation 5,924,320 1 (National) 3 (Severely endangered)
Azerbaijani, Northern azj Azerbaijan 7,324,270 1 (National) -

Examples[edit]

Language Text
Ossetian Адӕймӕгтӕ се' ппӕт дӕр райгуырынц сӕрибарӕй ӕмӕ ӕмхуызонӕй сӕ барты. Уыдон ӕххӕст сты зонд ӕмӕ намысӕй, ӕмӕ кӕрӕдзийӕн хъуамӕ уой ӕфсымӕрты хуызӕн.
Abkhaz Дарбанзаалак ауаҩы дшоуп ихы дақәиҭны. Ауаа зегь зинлеи патулеи еиҟароуп. Урҭ ирымоуп ахшыҩи аламыси, дара дарагь аешьеи аешьеи реиҧш еизыҟазароуп.
Georgian ყველა ადამიანი იბადება თავისუფალი და თანასწორი თავისი ღირსებითა და უფლებებით. მათ მინიჭებული აქვთ გონება და სინდისი და ერთმანეთის მიმართ უნდა იქცეოდნენ ძმობის სულისკვეთებით.
Armenian Բոլոր մարդիկ ծնվում են ազատ ու հավասար իրենց արժանապատվությամբ ու իրավունքներով։ Նրանք ունեն բանականություն ու խիղճ և միմյանց պետք է եղբայրաբար վերաբերվեն։