Devanagari is interesting because it's a completely rationalized alphabet. The symbols are organized by phonetic type and place of articulation, in the same way a modern linguist would lay them out, and they have been for thousands of years. Speakers of languages written in devanagari usually become incredibly frustrated when learning a language written in the Roman alphabet, because of the totally irrational and apparently random order the letters. The only other alphabet I know of that has the same level of organization is the Korean hangul script.
The reason for this order is the genius of the early Sanskrit linguists, one of whom, Panini, has a legitimate claim to being considered the greatest linguist of all time.
a aa i ii u uu r. r.r l. e o ai au m. h. ka kha ga gha nga ca cha ja jha nya t.a t.ha d.a d.ha n.a ta tha da dha na pa pha ba bha ma ya ra la va s'a s.a sa ha
The vowel symbols in the alphabet are used only for word-initial vowels. Elsewhere, they are represented by a system of combining marks which follow, precede, or hang above or below the consonant in question. If a consonant has no vowel marked, -a is assumed; there is a mark known as virama which indicates a word-final consonant.
There are also other characters not used in Classical Sanskrit which appear in some devanagari variants. Urdu devanagari has `dotted forms' of many characters, which indicate sounds borrowed from non-Indic languages such as Dravidian and Persian. When devanagari is used to write particularly old Vedas (written in Vedic, a somewhat formalized version of the living language which later became the literary language Sanskrit), there is a long form of the vowel l.
Devanagari makes extensive use of ligatures. Whenever consonants occur without an intervening vowel, they are written with a ligature. Forms of ligature include: vertical (the first consonant appearing above the second), horizontal (with the main vertical stroke on all but the last consonant omitted), and special (where the combined form does not resemble the separate consonants; the two most common examples are ks.a and jnya, which are learned by children as separate letters). In addition, r is represented specially in combination with other consonants: r before a consonant cluster is indicated by a mark above the cluster (to the right of any vowel marker), while r after a cluster is indicated by a diagonal tick in the lower left. The presence of these ligatures makes computerization of the devanagari script nontrivial but not impossible; in particular, the free software package ITRANS handles most devanagari ligatures rather well.
Devanagari is assigned the Unicode range U+0900 through U+097f. This range includes the basic alphabet, as well as special characters such as OM, numerals, Urdu characters, vowel markers, anunasika, avagraha, stress accent marks, and punctuation.
Devanagari script has one serious problem as I see it. When used to write Sanskrit, it is actually too phonetically faithful, and represents the sounds so accurately that the morphemes can be obscured. The classic analogy is the /s/ which represents the plural, as in cats, but sometimes it is pronounced as a z, as in dogs. In either case, however, it is written as s, because this represents the phoneme, not the actual pronunciation.
In Sanskrit, however, these different sounds would be written out, as if in English we wrote cats and *dogz. Furthermore, it becomes more complicated than this, because of sandhi. Therefore if we apply devanagari principles to English, a sentence which we write as I have to go to the store might be written as *I hafta gotth' store, depending of course on accent and dialect.
All this makes Sanskrit easier to pronounce correctly, based on the written form, but it really obscures words and word relationships, at least in the written form.
All other Indic scripts, as well as the Sinhala script of Sri Lanka, the Tibetan script and the Southeast Asian scripts (Thai, Lao, Khmer and Myanmar), as historically connected with the Devanagari script as descendants of the ancient Brahmi script. The entire family of scripts shares a large number of structural features.
The Devanagari block of the Unicode standard is based on ISCII-1988 (Indian Standard Code for Information Interchange) which is an update of earlier ISCII standards from 1983 and 1986. In 1991, a new version of ISCII was published in IS 13194, which partially modified the layout and repertoire; thus, Unicode does not precisely follow the layout of the new standard. Unicode remains a superset of the ISCII-1991 repertoire, except for a number of new Vedic extensions defined in IS 13194:1991 Annex G Extended Character Set for Vedic.
Unicode encodes Devanagari characters in the same relative position as those coded in positions 0xA0 to 0xF4 in ISCII-1988. The same character code layout is followed for eight other Indic scripts in Unicode : Bengali, Gurmukhi, Gujarati, Oriya, Tamil, Telugu, Kannada and Malayalam. This parallel code layout emphasizes the structural similarities of the Brahmi scripts and follows the stated intention of the Indian coding standards to enable one-to-one mappings between analogous coding positions in different scripts in the family. Sinhala, Thai, Lao, Khmer and Myanmar depart to a greater extent from the Devanagari structural pattern, so the Unicode standard does not attempt to provide any direct mapping for these scripts to the Devanagari order.
The writing systems that employ Devanagari and other Indic scripts constitute a cross between syllabic writing systems as phonemic writing systems (alphabets). The effective unit of these writing systems is the orthographic syllable, consisting of a consonant and vowel (CV) core and zero or more preceding consonants, with a canonical structure of ((C)C)CV. The orthographic syllable need not correspond exactly with a phonological syllable, especially when a consonant cluster is involved, but the writing system is built on phonological principles and tends to correspond quite closely to pronunciation.
The orthographic syllable is built up of alphabetic pieces, the actual letters of the Devanagari script. The pieces consist of three distinct character types : consonant letters, independent vowels and dependent vowel signs. In a text sequence, these characters are stored in logical (phonetic) order.
Devanagari characters can combine or change shape depending on their context, such as its ordering with respect to other characters. A few Devanagari characters cause a change in the order of the displayed characters. This reordering is not commonly seen in non-Indic scripts and occurs independently of and bidirectional character reordering that might be required.
Be advised that the rules for correctly rendering Devanagari text from a stream of characters are horrifically complex.
NKo <-- Devanagari --> Bengali
Number of characters added in each version of the Unicode standard : Unicode 1.1 : 104 Unicode 4.0 : 1 Unicode 4.1 : 1 Unicode 5.0 : 4 Unicode 5.1 : 2
Number of characters in each General Category :
Letter, Modifier Lm : 1 Letter, Other Lo : 72 Mark, Non-Spacing Mn : 18 Mark, Spacing Combining Mc : 8 Number, Decimal Digit Nd : 10 Punctuation, Other Po : 3
Number of characters in each Bidirectional Category :
Left To Right L : 94 Non Spacing Mark NSM : 18
The columns below should be interpreted as :
If the characters below show up poorly, or not at all, see Unicode Support for possible solutions.
Devanagari
Based on ISCII 1988
Independent vowels
Consonants
Various signs
Dependent vowel signs
Additional consonants
Additional vowels for Sanskrit
Generic punctuation for scripts of India These punctuation marks are for common use for the scripts of India despite being named "DEVANAGARI".
Digits
Devanagari-specific additions
Additional vowel for Marathi
Sindhi implosives These are added from Amendment 3 to 10646:2003.
Glottal stop
http://unicode.org Some prose may have been lifted verbatim from unicode.org, as is permitted by their terms of use at http://www.unicode.org/copyright.html
De`va*na"ga*ri (?), n. [Skr. dvanagari; dva god + nagara city, i. e., divine city.]
The character in which Sanskrit is written.
© Webster 1913.
printable version chaos
Everything2 Help