Broken 6.0.1 RAIDframe

To: port-i386%netbsd.org@localhost
Subject: Broken 6.0.1 RAIDframe
From: Ray Phillips <r.phillips%uq.edu.au@localhost>
Date: Wed, 24 Apr 2013 18:44:24 +1000

I've set up a NetBSD/i386 6.0.1 system with its root partition on aRAID-1 RAIDframe volume and somehow managed to get it into a brokenstate. This is only a test system containing no important data, soit doesn't matter if it can't be fixed, but I'd be interested to knowif it can be for reference, please.


The RAID-1 array is composed of wd0a and wd1a.

# uname -mrs
NetBSD 6.0.1 i386
#
# raidctl -s raid0
Components:
           /dev/wd0a: optimal
          component1: failed
No spares.
Component label for /dev/wd0a:
   Row: 0, Column: 0, Num Rows: 1, Num Columns: 2
   Version: 2, Serial Number: 2013001, Mod Counter: 135
   Clean: No, Status: 0
   sectPerSU: 128, SUsPerPU: 1, SUsPerRU: 1
   Queue size: 100, blocksize: 512, numBlocks: 39100160
   RAID Level: 1
   Autoconfig: Yes
   Root partition: Yes
   Last configured as: raid0
component1 status is: failed.  Skipping label.
Parity status: clean
Reconstruction is 100% complete.
Parity Re-write is 100% complete.
Copyback is 100% complete.
#
# raidctl -g component1 raid0
Component label for component1:
   Row: 0, Column: 0, Num Rows: 0, Num Columns: 0
   Version: 0, Serial Number: 0, Mod Counter: 0
   Clean: No, Status: 0
   sectPerSU: 0, SUsPerPU: 0, SUsPerRU: 0
   Queue size: 0, blocksize: 0, numBlocks: 0
   RAID Level:
   Autoconfig: No
   Root partition: No
   Last configured as: raid0
#
# disklabel wd0 | tail -10
disklabel: partitions a and b overlap
headswitch: 0           # microseconds
track-to-track seek: 0  # microseconds
drivedata: 0

5 partitions:
#        size    offset     fstype [fsize bsize cpg/sgs]
 a:  39100288      2048       RAID                     # (Cyl.      2*-  38791)

b: 395095 38707177 swap # (Cyl.38399*- 38791*)

 c:  39100288      2048     unused      0     0        # (Cyl.      2*-  38791)

d: 40132503 0 unused 0 0 # (Cyl. 0- 39813*)

#
# disklabel wd1 | tail -10
disklabel: partitions a and b overlap
headswitch: 0           # microseconds
track-to-track seek: 0  # microseconds
drivedata: 0

16 partitions:
#        size    offset     fstype [fsize bsize cpg/sgs]
 a:  39100288      2048       RAID                     # (Cyl.      2*-  38791)

b: 395095 38707177 swap # (Cyl.38399*- 38791*)c: 40130455 2048 unused 0 0 # (Cyl.2*- 39813*)d: 40132503 0 unused 0 0 # (Cyl. 0- 39813*)

I don't believe wd1 is faulty so I tried to bring it back into thearray using raidctl's -R switch:


# raidctl -R component1 raid0
# tail -1 /var/log/messages

Apr 25 03:51:42 bs5t /netbsd: raid0: rebuilding: dk_lookup on device:component1 failed: 2!

Is it possible to remove wd1 from the array somehow, add it as a hotspare, then use -F to reconstruct onto it?

After failing to improve the situation, I tried starting the machinewith just wd0 attached and then with just wd1 attached. Does thatleave the RAID array in an inconsistent state when both disks areconnected again? Is there a record kept of which disk was used mostrecently so its contents can be considered to be correct and willoverwrite its partner's when a reconstruction occurs?

By the way, I noticed the MBR partition 6.0.1's sysinst creates has a2048-sector offset instead of the 63 sectors I'm used to.


# fdisk wd0
Disk: /dev/rwd0d
NetBSD disklabel disk geometry:
cylinders: 39813, heads: 16, sectors/track: 63 (1008 sectors/cylinder)
total sectors: 40132503

BIOS disk geometry:
cylinders: 1023, heads: 255, sectors/track: 63 (16065 sectors/cylinder)
total sectors: 40131504

Partitions aligned to 16065 sector boundaries, offset 63

Partition table:
0: NetBSD (sysid 169)
    start 2048, size 39100288 (19092 MB, Cyls 0/32/33-2434/1/63), Active
1: <UNUSED>
2: <UNUSED>
3: <UNUSED>
Bootselector disabled.
First active partition: 0
#

I guess that's to accomodate more information, but what's an example of that?


Ray

Follow-Ups:
- Re: Broken 6.0.1 RAIDframe
  - From: Manuel Bouyer
- Re: Broken 6.0.1 RAIDframe
  - From: Patrick Welche

Prev by Date: Confidential Notice
Next by Date: Re: Broken 6.0.1 RAIDframe
Previous by Thread: Confidential Notice
Next by Thread: Re: Broken 6.0.1 RAIDframe
Indexes:

Home | Main Index | Thread Index | Old Index