Subject: Re: commit: UTF-8 recognition patch (2nd attempt)
From: Andrew Dunbar (falconsquire@start.com.au)
Date: Tue Apr 10 2001 - 08:45:03 CDT
It looks like my email software trashed the formatting. Here it is
again as an attachment. By the way I just tested this patch with the
native
Windows 2000 code page plain text and UTF-8 with the
following locales:
English, Japanese, Greek, Hungarian, Turkish,
Chinese (China), Chinese (Hong Kong), Korean, Thai,
and Arabic.
All worked fine except for Korean which seemed to be
due to something else.
I've noticed these unrelated Windows i10ln problems:
* Korean locale is either being confused with Chinese
or the wrong Korean encoding as any Hangul text I load
is displayed as 100% Hanja. Does this happen on Unix?
* CJK files need to display with a font that supports
them. I have to change the font manually each time.
* Exotic locales such as Hindi and Georgian cause
asserts in libiconv though loading either as UTF-8
display fine. This seems to be due Windows supporting
as Unicode locales only.
* Complex writing systems like Hindi and Thai have a
lot of problems with editing. Some problems are
similar to those with Right to Left languages.
Andrew.
http://linguaphile.sourceforge.net
__________________________________________________________________
Get your free Australian email account at http://www.start.com.au
This archive was generated by hypermail 2b25 : Tue Apr 10 2001 - 08:52:00 CDT