Re: Cyrillic in AW documents


Subject: Re: Cyrillic in AW documents
From: Tomas Frydrych (tomas@frydrych.uklinux.net)
Date: Thu Sep 06 2001 - 08:59:49 CDT


Hi Andrew,

> Actually, my understanding is that .abw documents are
> *always* in UTF-8 encoding. At the top of any XML
> document the encoding must be declared if it is
> something other than UTF-8.

Things must have changed recently, AW used to use the encoding
of the locale under which it is running in the abw document, i.e., if
LANG was set to ru_RU.KOI8-R, the document was internally
coded using KOI8-R. Any characters not found in the encoding set,
were represented by xml entities &#... UTF-8 was only used if the
locale itself used utf-8. The main reason for this was so that the
user could use external utilities, such as grep, on their documents.

If it is true that AW now defaults to utf-8, I would like to suggest
that this should be changed back.

Tomas



This archive was generated by hypermail 2b25 : Thu Sep 06 2001 - 09:00:14 CDT