tech-userlevel archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]
Re: UTF8 (Was: [PATCH] replace 0xA0 to whitespace in plain text files (part 2))
der Mouse wrote:-
> Even things like the C compiler don't really work with characters. The
> text "if" in a source file is not really "character i" "character f";
> rather, it's "octet 0x69" "octet 0x66" (well, typically - that's
> assuming ASCII was used when the compiler was built) - as it has to be,
> since there is no way to declare what character set the input uses.
This isn't true; the C standard says it is determined by the current
locale. If you have a UTF-8 locale, your C compiler should expect
all source to be UTF-8.
I've written a C front end that does this just fine.
Neil.
Home |
Main Index |
Thread Index |
Old Index