Subject: lib/25795: iconv doesn't support all of CP932
To: None <gnats-bugs@gnats.NetBSD.org>
From: None <khym@azeotrope.org>
List: netbsd-bugs
Date: 06/02/2004 16:16:16
>Number: 25795
>Category: lib
>Synopsis: iconv doesn't support all of CP932
>Confidential: no
>Severity: non-critical
>Priority: medium
>Responsible: lib-bug-people
>State: open
>Class: sw-bug
>Submitter-Id: net
>Arrival-Date: Wed Jun 02 21:42:00 UTC 2004
>Closed-Date:
>Last-Modified:
>Originator: Dave Huang
>Release: NetBSD 2.0_BETA
>Organization:
Name: Dave Huang | Mammal, mammal / their names are called /
INet: khym@azeotrope.org | they raise a paw / the bat, the cat /
FurryMUCK: Dahan | dolphin and dog / koala bear and hog -- TMBG
Dahan: Hani G Y+C 28 Y++ L+++ W- C++ T++ A+ E+ S++ V++ F- Q+++ P+ B+ PA+ PL++
>Environment:
System: NetBSD yerfable.azeotrope.org 2.0_BETA NetBSD 2.0_BETA (YERFABLE) #206: Mon May 31 05:53:47 CDT 2004 khym@yerfable.azeotrope.org:/usr2/obj.alpha/sys/arch/alpha/compile/YERFABLE alpha
Architecture: alpha
Machine: alpha
>Description:
iconv -l claims that Microsoft codepage 932 (cp932) is a
supported character set, but it doesn't support all of the characters,
such as the circled numbers and roman numerals:
http://www.microsoft.com/globaldev/reference/dbcs/932/932_87.htm
It looks like NetBSD has a mapping table for those characters in
src/share/i18n/csmapper/JIS/JISX0208VDC@NEC%UCS.src, but it's unused.
>How-To-Repeat:
% printf '\x87\x40' | iconv -f cp932 -t utf-16 | hexdump -C
iconv: iconv(): Illegal byte sequence
The same command on a MacOS X system (with GNU libiconv) produces:
00000000 fe ff 24 60 |..$`|
00000004
>Fix:
>Release-Note:
>Audit-Trail:
>Unformatted: