tech-userlevel archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]
Re: libcodecs(3), take 2
On Tue, 21 Sep 2010 15:57:15 +0900
SODA Noriyuki <soda%yuruyuru.net@localhost> wrote:
> > to-unicode [charset] translate to unicode-16 from UTF-8
> >
> >
> > to-utf8 [charset] translate from unicode-16 to UTF-8
>
> Those are bad names, since unicode is a concept which includes
> UTF-16+BOM, UTF-16BE, UTF-16LE, UTF-8, UTF-8+BOM, UCS-4 and others.
A also think that at least UCS-4/UTF-32BE/UTF-32LE should be supported.
> For code conversion, I think libcodec(3) shouldn't handle codeset names
> by itself. Maybe it makes sense to provde a transformation
> "iconv(from_codeset,to_codeset)", though. In that case libcodec(3)
> internally can call iconv(3) for the actual conversion, and
> ascii2ebcdic, ebcdic2ascii, to-unicode and to-utf8 are all unnecessary.
True.
In any case, this looks like a nice library+utility AGC, thanks for
working on this.
--
Matt
Home |
Main Index |
Thread Index |
Old Index