Subject: Re: port-sparc/13298: sparc esp driver leaves processes stuck forever after a time out
To: NetBSD GNATS submissions and followups <gnats-bugs@gnats.netbsd.org>
From: Greg A. Woods <woods@weird.com>
List: netbsd-bugs
Date: 06/24/2001 22:46:14
[ On Sunday, June 24, 2001 at 14:54:36 (-0400), Greg A. Woods wrote: ]
> Subject: port-sparc/13298: sparc esp driver leaves processes stuck forever after a time out
>
> Jun 24 12:35:50 sometimes /netbsd: sd1(esp0:0:1:0): esp0: timed out [ecb 0xf09494e0 (flags 0x1, dleft 800, stat 0)], <state 1, nexus 0x0, phase(l 10, c 100, p 3), resid 2000, msg(q 0,o 0) >
Well it happened again, but with a bit more info this time:
sd1: waiting for pack to spin up...
sd1(esp0:0:1:0): esp0: timed out [ecb 0xf0852c08 (flags 0x1, dleft 2000, stat 0)], <state 1, nexus 0x0, phase(l 10, c 100, p 3), resid 0, msg(q 0,o 0) >
sd1(esp0:0:1:0): esp0: timed out [ecb 0xf0852ab8 (flags 0x1, dleft 2000, stat 0)], <state 1, nexus 0x0, phase(l 10, c 100, p 3), resid 0, msg(q 0,o 0) >
This time the triggering process was stuck in 'getblk':
0 24532 24531 24 -5 0 192 192 getblk D p0 0:00.03 make -m /usr/
Hmmmm... I thought... if the disk takes itself offline momentarily then
that smells a lot like the disk is doing something bad, like maybe
encountering an error that 'esp's not reporting properly....
Well it turns out that ARRE wasn't enabled on the disk (damn I wish this
would happen automatically if the driver's not going to do reassignment!)
Using the old FreeBSD "scsi" tool (which still works just fine for
mode-page adjustments on NetBSD, including on sparc), I turned it on and
did a 'dd if=/dev/rsd1c of=/dev/null' with narry a problem. The
performance wasn't very stunning though:
4197405+0 records in
4197405+0 records out
2149071360 bytes transferred in 4885.573 secs (439881 bytes/sec)
So, one more reboot to restart the 'make build' again....
This time the disk synching failed but the reboot didn't hang....
I wonder if this problem is the same or similar to the one Jim Bernard
has been reporting on port-sparc under the heading "SCSI probs on spork 10."
In any case it seems to me that the 'esp' driver hasn't kept up to the
new scsipi times.....
--
Greg A. Woods
+1 416 218-0098 VE3TCP <gwoods@acm.org> <woods@robohack.ca>
Planix, Inc. <woods@planix.com>; Secrets of the Weird <woods@weird.com>