Dear Frank,

Yes the two "m's" are the same. It's anusvaara (nasalization of a previous vowel) and the placement of the dot is sometimes variable. I'm not sure I understand your second point,but if you encounter a word like nekkhamma, the conjunct is composed of two characters -k- and -kh- there is no -h- standing by itself,

Best, Bryan





--- On Tue, 2/15/11, frank <fcckuan@...> wrote:

From: frank <fcckuan@...>
Subject: [Pali] pali alphabetical order
To: Pali@yahoogroups.com
Received: Tuesday, February 15, 2011, 8:29 PM
















 











I found a piece of perl code for sorting unicode utf-8 format text. I

adapted it slightly for pali unicode.

I ran it on Ven. BKh's list, and it seems to be working as intended, but

I have a couple of questions.



I used the pali alphabetical order from wikipedia

http://en.wikipedia.org/wiki/Pali

The Pali alphabetical order is as follows:



* a ā i ī u ū e o ṁ k kh g gh ṅ c ch j jh ñ ṭ ṭh ḍ ḍh ṇ t th d dh n p ph

b bh m y r l ḷ v s h



1) I'm puzzled by "ṁ". Is that the same as "ṃ"?

2) For the code that does the alphabetizing, I removed all the pali

alphabet characters represented by double characters ending in "h". That

is I removed kh, gh, jh, etc. Since single "h" comes at the end of the

list, I believe things should sort correctly just by comparing one

character at a time.



-Frank





























[Non-text portions of this message have been removed]