Does anyone here know where I can find some machine-readable texts in
the Yi syllabary? If there's a choice of forms, HTML would be
preferable to PDF.
I'm interested in the frequency and co-occurrence of syllables and how
much alien text there is, e.g. Chinese ideographs or Western
punctuation. I'm afraid the reason isn't particularly linguistic - I
am wondering how well simple Unicode compression algorithms could be
made to work on it.