Re: wv patch for iso-8859-1 chars in LaTeX output


Subject: Re: wv patch for iso-8859-1 chars in LaTeX output
mv@liisa.pp.fi
Date: Tue Jul 04 2000 - 03:09:29 CDT


Hello,

here is a revision of the earlier patch to allow wv to output
iso-8859-1 characters. A big "thank you!" to Joaquim Cuenca
Abela for pointing out the problem and forcing me to read some
background documentation ;-)

I believe the current patch is correct, almost complete
and "politically correct". And it shrinks the code ;-)

1. The LaTeX output now uses the latin1 (iso-8859-1) encoding,
   which is the best choice until a Unicode/UTF-8 enabled
   LaTeX comes along. Supported are standard iso-8859-1
   ("Latin-1") 8-bit codes for chars 159-255; this is done
   by simple hand-through as these codes are already in the
   char16 variable. So sequences like \"a have vanished.
   (I assume this arrangement makes also implementing
   support for Latin-x easier.)

2. A number of Windows extended chars often found in
   Windows documents were added by hand ("Latin1-Win"),
   also on the Html side. A few had already been done by
   Caolan.

3. The Html output now uses consistently Unicode/UTF-8.
   The block that output German characters like a-umlaut as
   ä has been removed, as these are not necessary if
   the doc is marked as being UTF-8 encoded.
   About ß I'm not sure. Left it there.

Please let me know if this is the way to go. The patch is
against the current CVS.

Martin

--
Martin Vermeer                 mv@liisa.pp.fi  mv@fgi.fi
-- Linux means: never having to delete your love letters




This archive was generated by hypermail 2b25 : Tue Jul 04 2000 - 04:13:15 CDT