Dear Frank,
Yes the two "m's" are the same. It's anusvaara (nasalization of a previous vowel) and the placement of the dot is sometimes variable. I'm not sure I understand your second point,but if you encounter a word like nekkhamma, the conjunct is composed of two characters -k- and -kh- there is no -h- standing by itself,
Best, Bryan
--- On Tue, 2/15/11, frank <fcckuan@...> wrote:
From: frank <fcckuan@...>
Subject: [Pali] pali alphabetical order
To: Pali@yahoogroups.com
Received: Tuesday, February 15, 2011, 8:29 PM
I found a piece of perl code for sorting unicode utf-8 format text. I
adapted it slightly for pali unicode.
I ran it on Ven. BKh's list, and it seems to be working as intended, but
I have a couple of questions.
I used the pali alphabetical order from wikipedia
http://en.wikipedia.org/wiki/Pali
The Pali alphabetical order is as follows:
* a ā i ī u ū e o ṁ k kh g gh ṅ c ch j jh ñ ṭ ṭh ḍ ḍh ṇ t th d dh n p ph
b bh m y r l ḷ v s h
1) I'm puzzled by "ṁ". Is that the same as "ṃ"?
2) For the code that does the alphabetizing, I removed all the pali
alphabet characters represented by double characters ending in "h". That
is I removed kh, gh, jh, etc. Since single "h" comes at the end of the
list, I believe things should sort correctly just by comparing one
character at a time.
-Frank
[Non-text portions of this message have been removed]