Difference between revisions of "WX notation"
Jump to navigation
Jump to search
(New page: '''WX notation''' is used to represent the Devanagari alphabet, used by Hindi, Nepali, Marathi, Bengali and many other Indian languages in ASCII. ==External links== * [http://sanskrit.i...) |
(Link to French page) |
||
(17 intermediate revisions by 8 users not shown) | |||
Line 1: | Line 1: | ||
[[Notation WX|En français]] |
|||
'''WX notation''' is used to represent the Devanagari alphabet, used by Hindi, Nepali, Marathi, Bengali and many other Indian languages in ASCII. |
|||
{{TOCD}} |
|||
'''WX notation''' is used to represent the Devanagari alphabet, which is used by [http://en.wikipedia.org/wiki/Sanskrit Sanskrit], [http://en.wikipedia.org/wiki/Standard_Hindi Hindi], [http://en.wikipedia.org/wiki/Nepali_language Nepali], [http://en.wikipedia.org/wiki/Marathi_language Marathi], [http://en.wikipedia.org/wiki/Bengali_language Bengali] and many other Indian languages in ASCII. Devanagari script also has [http://en.wikipedia.org/wiki/Unicode Unicode] support. |
|||
==Table== |
|||
==Details== |
|||
<pre> |
|||
<anudev> There are some issues regarding assigning some letters of Hindi with Unicode. |
|||
<anudev> They are still unresolved. |
|||
<anudev> Actually there is the issue of separate vowels and matras. |
|||
<avinesh_> Could you give an example? |
|||
<anudev> We don't need the vowels and matras(markers) differently. |
|||
<avinesh_> For every matra there is a mapping in wx. |
|||
<anudev> Like a, aa, ii. |
|||
<avinesh_> yeah |
|||
<anudev> but again ी ू े |
|||
<anudev> are not needed |
|||
<spectie> matras = ? |
|||
<avinesh_> matra is the later representation |
|||
<anudev> matras= markers |
|||
<anudev> ka |
|||
<anudev> kaa |
|||
<anudev> we will write kaa as kA in wx |
|||
<anudev> in unicode there is a separate place for both A and the marker aa |
|||
<anudev> we need a same code for both of them, |
|||
<avinesh_> sry still not getting ur point why should we use wx instead of unicode? |
|||
<avinesh_> but people only follow one convention either the A or aa |
|||
<avinesh_> not both |
|||
<avinesh_> i mean if u see a document |
|||
<avinesh_> it will generally be consistent |
|||
<anudev> I mean we write A for both the vowel and matra |
|||
<avinesh_> oh.. |
|||
<avinesh_> ok |
|||
<avinesh_> got it |
|||
<anudev> but unicode will write differently for A as a vowel and matra |
|||
<avinesh_> k got it |
|||
<anudev> so it creates unnecessary complication |
|||
<spectie> so the problem is that in unicode |
|||
<spectie> combining characters have a separate code point |
|||
<spectie> and in WX they are unified to one code point? |
|||
<spectie> = letter |
|||
<anudev> yes |
|||
<spectie> why not use unicode normalisation ? |
|||
</pre> |
|||
==Examples== |
|||
* राम = र्+आ+म्+अ (rAma) |
|||
* कृष्ण = क्+ऋ+ष्+ण्+अ (kqRNa) |
|||
==External links== |
==External links== |
||
* [http://sanskrit.inria.fr/DATA/wx.html WX notation: Overview] |
* [http://sanskrit.inria.fr/DATA/wx.html WX notation: Overview] |
||
* [http://mirror.umoss.org/mozdev/indicime/wx_keyboard.html WX Keyboard Mappings] |
|||
* [http://ltrc.iiit. |
* [http://ltrc.iiit.ac.in/downloads/nlpbook/nlp-panini.pdf NLP: A paninian perspective] (page 191) |
||
---- |
|||
[[Category:Terminology]] |
[[Category:Terminology]] |
||
[[Category:Documentation in English]] |
|||
[[Category:Hindi]] |
|||
[[Category:IIIT]] |
Latest revision as of 12:24, 7 October 2014
Contents |
WX notation is used to represent the Devanagari alphabet, which is used by Sanskrit, Hindi, Nepali, Marathi, Bengali and many other Indian languages in ASCII. Devanagari script also has Unicode support.
Table[edit]
Details[edit]
<anudev> There are some issues regarding assigning some letters of Hindi with Unicode. <anudev> They are still unresolved. <anudev> Actually there is the issue of separate vowels and matras. <avinesh_> Could you give an example? <anudev> We don't need the vowels and matras(markers) differently. <avinesh_> For every matra there is a mapping in wx. <anudev> Like a, aa, ii. <avinesh_> yeah <anudev> but again ी ू े <anudev> are not needed <spectie> matras = ? <avinesh_> matra is the later representation <anudev> matras= markers <anudev> ka <anudev> kaa <anudev> we will write kaa as kA in wx <anudev> in unicode there is a separate place for both A and the marker aa <anudev> we need a same code for both of them, <avinesh_> sry still not getting ur point why should we use wx instead of unicode? <avinesh_> but people only follow one convention either the A or aa <avinesh_> not both <avinesh_> i mean if u see a document <avinesh_> it will generally be consistent <anudev> I mean we write A for both the vowel and matra <avinesh_> oh.. <avinesh_> ok <avinesh_> got it <anudev> but unicode will write differently for A as a vowel and matra <avinesh_> k got it <anudev> so it creates unnecessary complication <spectie> so the problem is that in unicode <spectie> combining characters have a separate code point <spectie> and in WX they are unified to one code point? <spectie> = letter <anudev> yes <spectie> why not use unicode normalisation ?
Examples[edit]
- राम = र्+आ+म्+अ (rAma)
- कृष्ण = क्+ऋ+ष्+ण्+अ (kqRNa)