Difference between revisions of "WX notation"

From Apertium
Jump to navigation Jump to search
(unicode support is available)
(Link to French page)
 
(8 intermediate revisions by 7 users not shown)
Line 1: Line 1:
[[Notation WX|En français]]
'''WX notation''' is used to represent the Devanagari alphabet, which is used by [http://en.wikipedia.org/wiki/Sanskrit Sanskrit], [http://en.wikipedia.org/wiki/Hindi Hindi], Nepali, [http://en.wikipedia.org/wiki/Marathi Marathi], Bengali and many other Indian languages in ASCII. Devanagari script also has [http://en.wikipedia.org/wiki/Unicode Unicode] support.

{{TOCD}}
'''WX notation''' is used to represent the Devanagari alphabet, which is used by [http://en.wikipedia.org/wiki/Sanskrit Sanskrit], [http://en.wikipedia.org/wiki/Standard_Hindi Hindi], [http://en.wikipedia.org/wiki/Nepali_language Nepali], [http://en.wikipedia.org/wiki/Marathi_language Marathi], [http://en.wikipedia.org/wiki/Bengali_language Bengali] and many other Indian languages in ASCII. Devanagari script also has [http://en.wikipedia.org/wiki/Unicode Unicode] support.




Line 6: Line 9:
==Details==
==Details==
<pre>
<pre>
<anudev> there r some issues of assigning some letters of hindi with Unicode
<anudev> There are some issues regarding assigning some letters of Hindi with Unicode.
<anudev> still unresolved
<anudev> They are still unresolved.
<anudev> actually there is the issue of separate vowels and matras
<anudev> Actually there is the issue of separate vowels and matras.
<avinesh_> could u give an example
<avinesh_> Could you give an example?
<anudev> we don't need the vowels and matras(markers) differently
<anudev> We don't need the vowels and matras(markers) differently.
<avinesh_> because for every matra there is a mapping in wx
<avinesh_> For every matra there is a mapping in wx.
<anudev> like a, aa, ii, u r there
<anudev> Like a, aa, ii.
<avinesh_> yeah
<avinesh_> yeah
<anudev> but again ी ू े
<anudev> but again ी ू े
Line 53: Line 56:
* [http://sanskrit.inria.fr/DATA/wx.html WX notation: Overview]
* [http://sanskrit.inria.fr/DATA/wx.html WX notation: Overview]
* [http://mirror.umoss.org/mozdev/indicime/wx_keyboard.html WX Keyboard Mappings]
* [http://mirror.umoss.org/mozdev/indicime/wx_keyboard.html WX Keyboard Mappings]
* [http://ltrc.iiit.net/downloads/nlpbook/nlp-panini.pdf NLP: A paninian perspective] (page 191) [comment: this link does not work. ----svaksha]
* [http://ltrc.iiit.ac.in/downloads/nlpbook/nlp-panini.pdf NLP: A paninian perspective] (page 191)


----
[[Category:Terminology]]
[[Category:Terminology]]
[[Category:Documentation in English]]
[[Category:Hindi]]
[[Category:IIIT]]

Latest revision as of 12:24, 7 October 2014

En français

WX notation is used to represent the Devanagari alphabet, which is used by Sanskrit, Hindi, Nepali, Marathi, Bengali and many other Indian languages in ASCII. Devanagari script also has Unicode support.


Table[edit]

Details[edit]

<anudev> There are some issues regarding assigning some letters of Hindi with Unicode. 
<anudev> They are still unresolved.
<anudev> Actually there is the issue of separate vowels and matras.
<avinesh_> Could you give an example?
<anudev> We don't need the vowels and matras(markers) differently.
<avinesh_> For every matra there is a mapping in wx.
<anudev> Like a, aa, ii.
<avinesh_> yeah 
<anudev> but again ी ू े 
<anudev> are not needed
<spectie> matras = ?
<avinesh_> matra is the later representation
<anudev> matras= markers
<anudev> ka
<anudev> kaa
<anudev> we will write kaa as kA in wx
<anudev> in unicode there is a separate place for both A and the marker aa
<anudev> we need a same code for both of them,
<avinesh_> sry still not getting ur point why should we use wx instead of unicode?
<avinesh_> but people only follow one convention either the A or aa 
<avinesh_> not both
<avinesh_> i mean if u see a document 
<avinesh_> it will generally be consistent
<anudev> I mean we write A for both the vowel and matra
<avinesh_> oh..
<avinesh_> ok
<avinesh_> got it
<anudev> but unicode will write differently for A as a vowel and matra
<avinesh_> k got it
<anudev> so it creates unnecessary complication
<spectie> so the problem is that in unicode
<spectie> combining characters have a separate code point
<spectie> and in WX they are unified to one code point?
<spectie> = letter
<anudev> yes
<spectie> why not use unicode normalisation ?

Examples[edit]

  • राम = र्+आ+म्+अ (rAma)
  • कृष्ण = क्+ऋ+ष्+ण्+अ (kqRNa)

External links[edit]