Re: Cyrillic in AW documents


Subject: Re: Cyrillic in AW documents
From: Andrew Dunbar (hippietrail@yahoo.com)
Date: Fri Sep 07 2001 - 08:01:21 CDT


 --- Tomas Frydrych <tomas@frydrych.uklinux.net>
wrote: > Hi Andrew,
>
> > Actually, my understanding is that .abw documents
> are
> > *always* in UTF-8 encoding. At the top of any XML
> > document the encoding must be declared if it is
> > something other than UTF-8.
>
> Things must have changed recently, AW used to use
> the encoding
> of the locale under which it is running in the abw
> document, i.e., if
> LANG was set to ru_RU.KOI8-R, the document was
> internally
> coded using KOI8-R. Any characters not found in the
> encoding set,
> were represented by xml entities &#... UTF-8 was
> only used if the
> locale itself used utf-8. The main reason for this
> was so that the
> user could use external utilities, such as grep, on
> their documents.
>
> If it is true that AW now defaults to utf-8, I would
> like to suggest
> that this should be changed back.

Hi Tomas. This makes good sense. However if I recall
correctly, XML documents, including .abw documents
should then declare in the header which encoding they
are using. In the sample document we declared
nothing.
Can somebody please look into this?

Andrew Dunbar.

=====
http://linguaphile.sourceforge.net

____________________________________________________________
Do You Yahoo!?
Get your free @yahoo.co.uk address at http://mail.yahoo.co.uk
or your free @yahoo.ie address at http://mail.yahoo.ie



This archive was generated by hypermail 2b25 : Fri Sep 07 2001 - 08:01:40 CDT