DragonFly BSD
DragonFly users List (threaded) for 2006-08
[Date Prev][Date Next]  [Thread Prev][Thread Next]  [Date Index][Thread Index]

Re: Confusion over encodings, utf-8, etc.


From: Bill Hacker <wbh@xxxxxxxxxxxxx>
Date: Wed, 30 Aug 2006 16:12:49 +0800

Jonathon McKitrick wrote:

I've entered the world of unicode with my mac, and I'd like to make things
consistent across my machines.

I naïvely thought utf-8 would solve everything.  But I find a lot of web pages
and people's names only show correctly in iso-8859-1.  I thought utf-8 covered
them all?

Jonathon McKitrick
--
My other computer is your Windows box.

Similar environment here. Couple of "gotcha's"


- Mac's own unicode has a couple of broken codes - legacy pre-OS X control-key compatibiity, IIRC. Best to use real UTF-8.

- Websites are *supposed to* call out their encoding, but not all do so correctly, consistently, or even at all. Nor are all available pages the same.

One that DOES ('coz I wrote it that way...) is here:

http://precisa.ch/Precisa/vert1_srch?Zone=as&SUBMIT=Submit+Query

Your page should show several dialects of Chinese, Korean, and Thai, as well as Western languages.

- Browsers have their own ways.

Taking Mozilla/Firefox as an example, rather than just setting UTF-8 as the default, you may need to list it (and others) in the 'auto-detect' options, then optionally leave ISO-8859-1 as the default.

I still commonly get alleged- UTF-8 pages that mis-convert '-' and quotes and such, but will clean-up if ISO-8859-1 is manually selected.

Bad page coding. I just live with it, as it doesn't affect much I can't auto-convert well enough with 'wetware'.

HTH,

Bill





[Date Prev][Date Next]  [Thread Prev][Thread Next]  [Date Index][Thread Index]