Re: Cree collation sequence

--- In qalam@yahoogroups.com, "Richard Wordingham"
<richard.wordingham@...> wrote:

As I said - thanks for resorting my list.

> Why do you call the Eastern Cree pre-aspiration symbol U+1426 a
> diacritic?

That was in the days before computers ruled! It used to be called a
diacritic and a little used one at that. Now it is a 'spacing'
character and can be called a diacritic no longer. So sorry!

>
> > How many possible encodings would there
> > be for the word "legend"?
>
> Is this a trick question? The two spellings, namely
>
> E: U+140B U+144E U+152B U+1426 U+1473 U+14D0 AA-TI-YUU-H-KAA-N

legend

> N: U+140A U+144E U+152A U+146D U+14D0 A-TI-YU-KI-N legend

KA/KI reflect a regional difference in pronunciation. Otherewise,
16+ different encodings as someone may dot the TI.

> Why did you call your message 'Cree collation sequence'?

Collation

> sequences are not defined by Unicode, and could not be, for they

are

> language-dependent.

The sequence in these two dictionaries represent different
decisions by the authors of the dictionary. They have little to do
with the language. I know that *ostensibly* they are for two
different languages but I don't want to argue with Peter Constable
over that issue.

It is quite conceivable that one person will keep data in pointed
text and another person will keep data in unpointed text for the
same language. In reality they both might use some points but not
always the same ones. Sometimes preaspiration will be represented
but more likely not.

Wouldn't it be easier to have the vowel length overdot represented
by one non-spacing codepoint and then that codepoint could be
excluded for the purposes of data sharing and sorting? Maybe tables
have already been created to match up the pointed and unpointed
syllabics. If so I would love to hear about it.

Suzanne