tech-userlevel archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Re: libcodecs(3), take 2



On Tue, 21 Sep 2010 15:57:15 +0900
SODA Noriyuki <soda%yuruyuru.net@localhost> wrote:

> >      to-unicode   [charset] translate to unicode-16 from UTF-8
> > 
> > 
> >      to-utf8      [charset] translate from unicode-16 to UTF-8
> 
> Those are bad names, since unicode is a concept which includes
> UTF-16+BOM, UTF-16BE, UTF-16LE, UTF-8, UTF-8+BOM, UCS-4 and others.

A also think that at least UCS-4/UTF-32BE/UTF-32LE should be supported.

> For code conversion, I think libcodec(3) shouldn't handle codeset names
> by itself.  Maybe it makes sense to provde a transformation
> "iconv(from_codeset,to_codeset)", though.  In that case libcodec(3)
> internally can call iconv(3) for the actual conversion, and
> ascii2ebcdic, ebcdic2ascii, to-unicode and to-utf8 are all unnecessary.

True.

In any case, this looks like a nice library+utility AGC, thanks for
working on this.
-- 
Matt


Home | Main Index | Thread Index | Old Index