Re: commit: abi: UTF8String class

From: Joaquin Cuenca Abela (cuenca@pacaterie.u-psud.fr)
Date: Sun Apr 21 2002 - 13:47:15 EDT

  • Next message: Leonard Rosenthol: "Re: Next Generation Containers."

    Tomas wrote:
    >
    > > Tomas, I think that you're confusing the intent of UTF-8 with UTF-7.
    > > Can you explain where helps UTF-8 in the communication through 8-bit
    > > channels? You can also send UTF-32 (or any other encoding) over 8-bit
    > > channels, chopping each char in 4 bytes.
    > UTF-8 strings can be passed through dumb processes based on
    > the 8-bit C library; UCS-2 strings, for instance, cannot, because
    > any leading nulls will be interpreted as string terminators; that's all I
    > meant (I suspect this is responsible for the proliferation of utf-8 in
    > the *nix world, because it allows to start using Unicode without
    > having to rewrite the entire system).

    ok, I understand now your point. Yes, that's an advantage of UTF-8, but I
    would not say that it has been designed only to make strlen & al. work for
    english users.

    Cheers,

    --
    Joaquin Cuenca Abela
    cuenca@pacaterie.u-psud.fr
    


    This archive was generated by hypermail 2.1.4 : Sun Apr 21 2002 - 13:44:51 EDT