From: F J Franklin (F.J.Franklin@sheffield.ac.uk)
Date: Sun Apr 21 2002 - 06:04:14 EDT
I think Java uses UTF-16.
UTF-32 is a subset of UCS-4, I believe. My impression is that UTF-8 and
UCS-4 have less to do with UNICODE than UTF-16 and UTF-32.
I was using:
http://www.cl.cam.ac.uk/~mgk25/ucs/ISO-10646-UTF-8.html
Some other links for the curious:
http://czyborra.com/utf/
http://www.tldp.org/HOWTO/Unicode-HOWTO-1.html
http://www.cl.cam.ac.uk/~mgk25/unicode.html
Andrew, thanks for the answer. Personally I see no problem using UTF-8
internally, but I don't do piecetable work so it's not really my call. I
wasn't trying to preempt the decision; the new class is just a utility.
Frank
Francis James Franklin
F.J.Franklin@shef.ac.uk
"No, she really likes me. She told me I look like Britney Spears, and why
would you say that to somebody you don't like?"
--- Elle Woods
This archive was generated by hypermail 2.1.4 : Sun Apr 21 2002 - 06:05:24 EDT