tech-userlevel archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]
Re: Unicode programming
>Yeah, they exist within the BMP, mostly within CJK/East Asian; see
>http://unicode.org/reports/tr11/#Ambiguous for some info.
Ah, okay ... as I read that, if I can't figure out the width in context I
should treat them as "narrow". Which I'm willing to do in this case. As
I read that whole thing, I'd have to know what font I'm using to really
make it work.
>As far as surrogates in UTF-16: yeah, they only exist in UTF-16; they're
>one of the primary differentiations between UTF-16 and UCS-2. One of the
>_other_ bugaboos with UTF-16 is that you need to keep track of the byte
>order and/or insert a BOM to deambiguate what kind of stream you're
>generating.
I think that's only an issue if I get UTF-16 from somewhere else, right?
As long as UTF-16 is completely internal to my application I could assume
that it's using the host native byte order.
Thanks again for the help!
--Ken
Home |
Main Index |
Thread Index |
Old Index