From: Joaquin Cuenca Abela (cuenca@pacaterie.u-psud.fr)
Date: Sun Apr 21 2002 - 13:47:15 EDT
Tomas wrote:
>
> > Tomas, I think that you're confusing the intent of UTF-8 with UTF-7.
> > Can you explain where helps UTF-8 in the communication through 8-bit
> > channels? You can also send UTF-32 (or any other encoding) over 8-bit
> > channels, chopping each char in 4 bytes.
> UTF-8 strings can be passed through dumb processes based on
> the 8-bit C library; UCS-2 strings, for instance, cannot, because
> any leading nulls will be interpreted as string terminators; that's all I
> meant (I suspect this is responsible for the proliferation of utf-8 in
> the *nix world, because it allows to start using Unicode without
> having to rewrite the entire system).
ok, I understand now your point. Yes, that's an advantage of UTF-8, but I
would not say that it has been designed only to make strlen & al. work for
english users.
Cheers,
-- Joaquin Cuenca Abela cuenca@pacaterie.u-psud.fr
This archive was generated by hypermail 2.1.4 : Sun Apr 21 2002 - 13:44:51 EDT