Re: XML_Char


Subject: Re: XML_Char
From: Vlad Harchev (hvv@hippo.ru)
Date: Mon Dec 18 2000 - 00:38:15 CST


On Mon, 18 Dec 2000, Mike Nordell wrote:

> In C++ we have three different byte-sized char types: "char", "signed char"
> and "unsugned char". These have been abused in C to hold UTF-8 (and other
> stuff also), but no matter how you look at it it's wrong. An UTF-8 string is
> not a, and can never be, C/C++ char string literal since they are by their
> very definition one-byte chars whereas UTF-8 are multibyte.

 Mike, it seems you are wrong. Whether each "symbol" of UTF-8 string takes one
or more bytes - it doesn't matter for C compiler. The type of UTF8 string is
char[] for C compiler since it doesn't care about what's stored in that
array.

> /Mike - please don't cc
>

 Best regards,
  -Vlad



This archive was generated by hypermail 2b25 : Mon Dec 18 2000 - 01:04:00 CST