Re: Automatic clustering of languages

From: Peter P
Message: 48205
Date: 2007-04-03

--- In cybalist@yahoogroups.com, "Richard Wordingham" <richard@...> wrote:
>
> --- In cybalist@yahoogroups.com, "Daniel J. Milton" <dmilt1896@> wrote:
>
> Persian doesn't actually cluster with 'Finno-Maori' in the other
> metrics, so I'll just point out the commonalities (in upper case):
>
> Maori Finnish distance (insertion/deletion)
> KAtoa KAikki 7
> kiNO huoNO 5
> hoopArA vAtsA 8
> hiwAhiwa mutsA 11
> iwi luu 6 - This is a good match!!
> mAeVAo pAiVA 5
> hemo varjata 11
> inU jUoda 6
> poKORAringa KORvA 8
> haupA syodA 8
> heeki muna 9
> kaIkA sIlmA 6
> paaparA isA 8
> iKA KAla 3
> rIma vIisi 7
> wAe jAlka 6
>
> I thought the key to the Finno-Maori cluster was that the words are
> vowel rich and there is only a limited range of frequent consonants.
> However, the above gives a distance of 114, whereas the distance
> between Persian and 'Sanskrit' is 100. This certainly merits
> investigation. I wonder if it is an effect of the clustering algorithm.
>
> Richard.
>

Taking Finnish into consideration some Finnish words are written
incorrectly. The Finnish vowels are a,e,i,o,u,y,ä,ö.

> hoopArA vAtsA 8 (This is correct)
> mAeVAo pÄiVÄ 5 (should be)
> haupA syodÄ 8 (should be)
> kaIkA sIlmÄ 6 (should be)
> iKA KAla 3 (This is correct)

In Finnish A and Ä are not the same vowel.

Peter P