Subject: Re: raidframe on a pmax?
To: Simon Burge <simonb@NetBSD.ORG>
From: Greg Oster <oster@cs.usask.ca>
List: current-users
Date: 08/02/1999 09:04:01
Simon Burge writes:
> Greg Oster wrote:
>
> > With whatever you decide, I'd be interested in hearing how it goes :)
>
> Well, here's some initial results. I've got a 5000/240, with 1 boot
> RZ28 (2GB) disk, and another 11 RZ28s spread across two controllers in
> a raid5 setup.
That's a lotta disk for one of those machines, no? :)
> For now, I've decided not to have a hot spare, but will
> when I find another disk. The initial "raidctl -i raid0" took 2h33m57s,
> and a "newfs -i 8192" took 19m26s.
>
> Here's some bonnie times. First, a ccd using 6 RZ28s on the same bus (under
> NetBSD 1.3.1):
>
> -------Sequential Output-------- ---Sequential Input-- --Random
> --
> -Per Char- --Block--- -Rewrite-- -Per Char- --Block--- --Seeks-
> --
> Machine MB K/sec %CPU K/sec %CPU K/sec %CPU K/sec %CPU K/sec %CPU /sec %C
> PU
> 6#rz28 128 1540 99.1 3224 27.9 1741 28.4 1377 99.6 3652 33.9 76.0 8
> .4
>
> and then some results on the new raid5 filesystem (NetBSD 1.4.1):
>
> 11#rz28 32 615 41.2 928 13.9 760 15.0 1200 97.7 3324 49.7 47.5 10
> .2
> 11#rz28 128 624 42.3 924 14.4 772 17.0 1175 97.6 3388 52.2 41.7 9
> .4
>
> These aren't apples vs. apples numbers, but (as has been mentioned
> before) write performance isn't too flash.
That's about right... You'll notice a performance improvement for writes
with a -current kernel..
> I'm not hugely worried about
> this, as the box is serving an FTP site and is only connected with
> 10mbit ethernet.
>
> My primary concern at this stage is the time it takes to rewrite the
> parity. Has any thought been given to backgrounding this process so
> that the raid set operates in degraded mode until parity is recomputed?
> (No I haven't read the RAIDframe papers yet :).
:-) Just run 'raidctl -i raid0' in the background. You can still fsck,
mount, use, etc. the filesystem while the parity is begin computed -- it's
just that it's somewhat riskier, of course. You actually don't want the raid
set to be operating in degraded mode before parity is computed, as degraded
mode implies that you may be using the parity bits to compute data bits -- but
at that point you don't know that the parity bits are good! (if you knew the
parity bits were good, you wouldn't be looking to recompute them :) ).
While the parity is begin recomputed, the real data blocks will be being read
on reads, and on writes, the real data blocks will be written, and the
associated parity blocks will be updated as well. The parity re-write can
operate independent of all of this, and can do it's job even if you're working
with the RAID set.
> In a nutshell - so far, so good :-)
Cool :)
Later...
Greg Oster