> Yes. Go with this. There is an ISO character set standard (eclipsed by > unicode, thankfully) which uses 4-byte characters. > > I forget the ISO number, but something like 10617 feels close. It's ISO-10646. :-) There is also other ISO standard which supports 2/3/4/..-bytes characters. (ISO-2022) -- soda