Dear Jon and friends,
the wiki is here:
http://tipitaka.pbwiki.com/
Again, anyone keen in helping to complete the entire tipitaka, please
let me know.
Jon, as for your proposed work, statistics is an area which I like.
;-) The group has completed the whole of AN1 in trilinear format,
which you may find helpful. I have also done some foundation works on
Pali Scope,
http://www.tipitaka.net/pali/scope/. You are welcome to
use the data as required. If you are interested, you are also welcome
to provide assistance to the project (Pali Scope). The project is very
extensive, the first phase is to build up a substantial repository of
Pali words directly from the Tipitaka (probably only AN of Sutta
Pitaka). As a lexicordance, it will eventually have a lexicon and a
concordance built to utilise the database. A parallel translator can
also be incorporated, if you like to provide the technical
functionalities.
Unlike the wikis, Pali Scope is not open for free editing, but any
information on Pali Scope is free for use.
metta,
Yong Peng.
--- In Pali@yahoogroups.com, Jon Fernquest wrote:
Where is the Wiki?
One of these days I'm going to try my hand with statistical based
alignment of the Pali Jatakas. with their English translation. I have
all the necessary texts on my hard drive now.
It requires some creative programming but there� are papers� detailing
the techniques.
First, you'd need to lematise the words, eliminate inflection and
conjugation, map the nouns to third person singular and verbs to their
stem (or something like that). Then the search engine searches on
these lematised words.
http://en.wikipedia.org/wiki/Lemmatisation