Re: commit: UTF-8 recognition patch (2nd attempt)


Subject: Re: commit: UTF-8 recognition patch (2nd attempt)
From: Andrew Dunbar (falconsquire@start.com.au)
Date: Tue Apr 10 2001 - 08:45:03 CDT


It looks like my email software trashed the formatting. Here it is
again as an attachment. By the way I just tested this patch with the
native
Windows 2000 code page plain text and UTF-8 with the
following locales:
English, Japanese, Greek, Hungarian, Turkish,
Chinese (China), Chinese (Hong Kong), Korean, Thai,
and Arabic.

All worked fine except for Korean which seemed to be
due to something else.

I've noticed these unrelated Windows i10ln problems:

* Korean locale is either being confused with Chinese
or the wrong Korean encoding as any Hangul text I load
is displayed as 100% Hanja. Does this happen on Unix?

* CJK files need to display with a font that supports
them. I have to change the font manually each time.

* Exotic locales such as Hindi and Georgian cause
asserts in libiconv though loading either as UTF-8
display fine. This seems to be due Windows supporting
as Unicode locales only.

* Complex writing systems like Hindi and Thai have a
lot of problems with editing. Some problems are
similar to those with Right to Left languages.

Andrew.

http://linguaphile.sourceforge.net

__________________________________________________________________
Get your free Australian email account at http://www.start.com.au




This archive was generated by hypermail 2b25 : Tue Apr 10 2001 - 08:52:00 CDT