[tied] Re: obscure languages... - Recovering Cyrillic

From: richardwordingham
Message: 14149
Date: 2002-07-25

--- In cybalist@..., "richardwordingham" <richard.wordingham@...>
wrote:
> --- In cybalist@..., erobert52@... wrote:
> > In a message dated 25/07/02 09:17:57 GMT Daylight Time,
> jdcroft@...
> > writes:
> >
> > > What appears to be an excellent source for unravelling this
> > > information is found at
> > >
> > > Tbilisi: Mematiane 2001
> > > 2001
> > > Giorgi Leon Kavtaradze
>
> The URL for it appears to be
> http://www.geocities.com/komblege/ethn.htm
>
> Reference 19 is in Cyrillic - it reads
>
> 19. Êàïàíöÿí, Ã. À. 1975. Î âçàèìîîòíîøåíèÿõ
> àðìÿíñêîãî è ëàçî-
> ìåãðåëüñêîãî ÿçûêîâ, â êí.: Êàïàíöÿí, Ã. À.
> Èñòîðèêî-ëèíãâèñòè÷åñêèå
> ðàáîòû, ò. II. Åðåâàí.
>
> In case the mangling is irretrievable (pasting and then previewing
> lost its Cyrillicity, but there may be a retrieval technique - see
> next posting),...

The following technique works if the data is accessed from the
archive using Netscape Communicator 4.5 or internet Explorer 5.50 or
from e-mail accessed via Micrososft Outlook.

1. Create a Word file with a couple of adjacent Cyrillic characters.
Use insert symbol. It works with the Arial font, I don't know about
others.

2. Cut and paste the strange characters (chiefly accented characters)
from the posting into a different part of the same file.

3. Save the file in RTF format.

4. Open the file as plain text. Word can be used for this - coerce
it to query file formats for 'non-Word' formats. RTF is a non-Word
format!

5. The obscure character should be enclosed by braces and code such
as {\f59 }. The number after the '\f' will probably be different.

6. Enclose the Cyrillic text, which will appear as escaped hex codes,
within these braces and the \f code.

7. Save the file as plain text. This would be the default if using
Word.

8. Open the file in Word as an RTF file. The Cyrillic should then be
displayed.

Happy hacking,

Richard.