Subject: Re: wd0 intermittent disk errors (correctable soft-errors, DMA error:
To: Manuel Bouyer <bouyer@antioche.eu.org>
From: Dave <dgriffi@cs.csubak.edu>
List: tech-kern
Date: 08/23/2005 16:36:28
On Wed, 24 Aug 2005, Manuel Bouyer wrote:

> > wd0a: device timeout writing fsbn 8236512 of 8236512-8236527 (wd0 bn
> > 8236512; cn 8171 tn 2 sn 18), retrying
> > pciide0:1:0: not ready, st=0x80, err=0x00
> > wd0a: device timeout writing fsbn 8236512 of 8236512-8236527 (wd0 bn
> > 8236512; cn 8171 tn 2 sn 18), retrying
>
> This is more serious, this means the drive is stalled, it doens't
> even honnor the reset signal. I guess the drive doesn't recover from this ?
> Maybe it's a drive firmware issue, maybe it's just dying ...
>
> I've seen this on occasion on sparc64 system, I suspect it's a read/write
> reordering issue on this platform. But I've never seen it on PCs.

I've had this happen on a PC several times.  It seemed to be related to a
bad batch of hard drives.


-- 
David Griffith
dgriffi@cs.csubak.edu