The following reply was made to PR bin/57616; it has been noted by GNATS.
From:
mlelstv%serpens.de@localhost (Michael van Elst)
To:
gnats-bugs%netbsd.org@localhost
Cc:
Subject: Re: bin/57616: sed(1) is unable to process multibyte unicode characters properly
Date: Mon, 11 Sep 2023 15:03:24 -0000 (UTC)
marc.fege%uni-bonn.de@localhost writes:
>NetBSD rpi 9.3 NetBSD 9.3 (RPI) #0: Thu Aug 4 15:30:37 UTC 2022
mkrepro%mkrepro.NetBSD.org@localhost:/usr/src/sys/arch/evbarm/compile/RPI evbarm
>sed(1) has a problem processing multibyte unicode characters properly.
> echo "abc???xyz" | sed 's/./& /g'
>I expect the following output format for further processing:
> "a b c ? ? ? x y z "
It's not actually about sed failing but what the underlying regexp
library can do.
Wide char support ("NLS") from FreeBSD was integrated in 2021 and
will be in NetBSD-10.