tech-userlevel archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]
Re: A draft for a multibyte and multi-codepoint C string interface
On Tue, 2 Apr 2013 12:21:03 -0400
Thor Lancelot Simon <tls%panix.com@localhost> wrote:
> On Tue, Apr 02, 2013 at 06:08:01PM +0200, tlaronde%polynum.com@localhost
> wrote:
> >
> > That UTF-8 is the answer, since this allows to use C "char" (at
> > least an octet, signed or unsigned) programs.
>
> Except it can't, really, quite be UTF-8 -- it has to be "Modified
> UTF-8", because C strings can't contain 0.
What are you referring to, exactly? UTF-8 and ASCII both represent NUL
with 0. The filename rule is that only '/' and NUL are prohibited. I
don't see how adopting the convention that filenames are UTF-8 encoded
conflicts with that.
--jkl
Home |
Main Index |
Thread Index |
Old Index