Dear Yong Peng & others,

I updated the "word list generator". It's possible now to generate a list of unique words for any unicode text with additional word frequency output. The generated list will be ordered by the number of occurences of a word.

You find it here:

www.nibbanam.com/pali_language_tools.html

Simply set the "count" flag when starting the program, in this way:

pwlg "PATH/myPaliText.txt" roman count

output will be something like this (sample is from the Majjhima Nikaya, Volume One)

....
yathā Count: 199
iti Count: 203
aggivessana Count: 204
ayaṃ Count: 221
ahaṃ Count: 231
sāriputta Count: 233
bhante Count: 240
ye Count: 241
yaṃ Count: 250
atha Count: 258
bhikkhū Count: 259
dhammaṃ Count: 267
tassa Count: 281
cittaṃ Count: 292
pana Count: 296
bhagavā Count: 321
no Count: 331
viharati Count: 344
pe Count: 352
...


mettâya,

Lennart

PS: It's a quick "hack" if you find bugs, let me know :-)
----- Original Message -----
From: Ong Yong Peng
To: Pali@yahoogroups.com
Sent: Saturday, February 05, 2005 5:47 AM
Subject: [Pali] Re: Pali programming projects?



Dear Lennart, Clay and friends,

Lennart: I wonder if you would put in the word count (no. of
occurrence) for each word in a document? Thanks.


metta,
Yong Peng.


--- In Pali@yahoogroups.com, Lennart Lopin wrote:
You are right, you could use the word list generator for any unicode
based document. You could even parse the entire suttapitaka and get a
list with all vocabulary you will ever encounter - its quite handy if
you want to build a dictionary or :-) a word list...

> ----- Original Message -----
> From: Clay Collier
>
> Your projects look interesting, Lennart. Am I correct in guessing
that the Pali wordlist generator could be used to create a similar
list for any Unicode-encoded document?





- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
[Homepage] http://www.tipitaka.net
[Send Message] pali@yahoogroups.com
Paaliga.na - a community for Pali students
Yahoo! Groups members can set their delivery options to daily digest or web only.



------------------------------------------------------------------------------
Yahoo! Groups Links

a.. To visit your group on the web, go to:
http://groups.yahoo.com/group/Pali/

b.. To unsubscribe from this group, send an email to:
Pali-unsubscribe@yahoogroups.com

c.. Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service.



[Non-text portions of this message have been removed]