Re: Unicode programming

To: tech-userlevel%NetBSD.org@localhost
Subject: Re: Unicode programming
From: Ken Hornstein <kenh%pobox.com@localhost>
Date: Wed, 05 Oct 2011 21:19:42 -0400

>Yeah, they exist within the BMP, mostly within CJK/East Asian; see
>http://unicode.org/reports/tr11/#Ambiguous for some info.

Ah, okay ... as I read that, if I can't figure out the width in context I
should treat them as "narrow".  Which I'm willing to do in this case.  As
I read that whole thing, I'd have to know what font I'm using to really
make it work.

>As far as surrogates in UTF-16: yeah, they only exist in UTF-16; they're
>one of the primary differentiations between UTF-16 and UCS-2. One of the
>_other_ bugaboos with UTF-16 is that you need to keep track of the byte
>order and/or insert a BOM to deambiguate what kind of stream you're
>generating.

I think that's only an issue if I get UTF-16 from somewhere else, right?
As long as UTF-16 is completely internal to my application I could assume
that it's using the host native byte order.

Thanks again for the help!

--Ken

References:
- Re: Unicode programming
  - From: Tom Spindler

Prev by Date: Re: Unicode programming
Next by Date: Re: Unicode programming
Previous by Thread: Re: Unicode programming
Next by Thread: Re: Unicode programming
Indexes:

Home | Main Index | Thread Index | Old Index