Subject: Repeated panics on a Proliant 800 biproc with 1.6D kernel and
To: None <tech-kern@netbsd.org, current-users@netbsd.org, tech-smp@netbsd.org>
From: Xavier HUMBERT <xavier.humbert@xavhome.fr.eu.org>
List: tech-smp
Date: 07/24/2002 14:41:08
[posted to tech-kern, current-users, tech-smp, please feel free to
restrict the followup to tech-kern]
Context :
---------
Compaq Proliant 800, bi PII/400 (dmesg later)
1- Installed 1.5.2 relaese on this machine (latest snapshot,
20020518-1.5ZC panics when installing)
2- Imported -current source tree.
3- copied a GENERIC-1.6D kernel build on another machine running
up-to-date NetBSD-current, then rebooted.
4- Started to rebuild the userland (previous hazardous experiments told
me to do the compile in a single pass into /, to avoid lib and toolchain
conflicts)
5- Almost immediate panic. Actually, not a panic, but a NMI !
(copied by hand, no serial console on the machine)
| NMI ... going to Debugger
| Stopped in pid 1153 (cc1) at pmap_zero_page+0x3d repe stosl %es:(%edi)
|> trace
| pmap_zero_page(10002000,1,e3760e30,c036310c,e363f220) at pmap_zero_page+0x3d
| uvm_pagealloc_strat(0,0,0,e363f220,2) at uvm_pagealloc_strat+0x395
| uvm_fault(e32098d0,81b5000,0,2,fe8) at uvm_fault+0xd40
| trap() at trap+0x4d4
| --- trap (number 6) ---
| 0x80cb664
I dunno which device is responsible for the NMI .... Perhaps the kernel
thinks the second (actually unused drive is part of a RAIS frame ?)
Here is dmesg :
| NetBSD 1.6D (GENERIC) #0: Wed Jul 24 11:54:10 CEST 2002
| root@aragorn:/usr/src/sys/arch/i386/compile/GENERIC
| cpu0: Intel Pentium II/Celeron (Deschutes) (686-class), 398.80 MHz
| cpu0: I-cache 16 KB 32b/line 4-way, D-cache 16 KB 32b/line 2-way
| cpu0: L2 cache 512 KB 32b/line 4-way
| cpu0: features 183fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR>
| cpu0: features 183fbff<PGE,MCA,CMOV,FGPAT,PSE36,MMX>
| cpu0: features 183fbff<FXSR>
| total memory = 511 MB
| avail memory = 468 MB
| using 6144 buffers containing 26296 KB of memory
| BIOS32 rev. 0 found at 0xf0000
| mainbus0 (root)
| pci0 at mainbus0 bus 0: configuration mode 1
| pci0: i/o space, memory space enabled, rd/line, rd/mult, wr/inv ok
| pchb0 at pci0 dev 0 function 0
| pchb0: Intel 82443BX Host Bridge/Controller (AGP disabled) (rev. 0x02)
| pchb0: fixing Idle/Pipeline DRAM Leadoff Timing
| siop0 at pci0 dev 6 function 0: Symbios Logic 53c875 (ultra-wide scsi)
| siop0: using on-board RAM
| siop0: interrupting at irq 5
| scsibus0 at siop0: 16 targets, 8 luns per target
| siop1 at pci0 dev 6 function 1: Symbios Logic 53c875 (ultra-wide scsi)
| siop1: using on-board RAM
| siop1: interrupting at irq 9
| scsibus1 at siop1: 16 targets, 8 luns per target
| tl0 at pci0 dev 7 function 0
| tl0: Compaq ProLiant Integrated Netelligent 10/100 TX
| tl0: Ethernet address 00:50:8b:73:0c:2f
| tl0: interrupting at irq 10
| lxtphy0 at tl0 phy 1: LXT970 10/100 media interface, rev. 3
| lxtphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
| tlphy0 at tl0 phy 31: ThunderLAN 10BASE-T media interface, rev. 5
| tlphy0: 10base2
| vga1 at pci0 dev 8 function 0: ATI Technologies 3D Rage IIC (rev. 0x7a)
| wsdisplay0 at vga1 kbdmux 1: console (80x25, vt100 emulation)
| wsmux1: connecting to wsdisplay0
| Compaq product 0xa0f0 (miscellaneous system) at pci0 dev 9 function 0 not configured
| ppb0 at pci0 dev 10 function 0: Digital Equipment DECchip 21152 PCI-PCI Bridge (rev. 0x03)
| pci1 at ppb0 bus 1
| pci1: i/o space, memory space enabled, rd/line, wr/inv ok
| pcib0 at pci0 dev 20 function 0
| pcib0: Intel 82371AB PCI-to-ISA Bridge (PIIX4) (rev. 0x02)
| pciide0 at pci0 dev 20 function 1: Intel 82371AB IDE controller (PIIX4) (rev. 0x01)
| pciide0: bus-master DMA support present
| pciide0: primary channel wired to compatibility mode
| atapibus0 at pciide0 channel 0: 2 targets
| cd0 at atapibus0 drive 0: <CD-ROM CDU701-Q, , 1.0r> type 5 cdrom removable
| cd0: 32-bit data port
| cd0: drive supports PIO mode 4, DMA mode 2
| pciide0: primary channel interrupting at irq 14
| cd0(pciide0:0:0): using PIO mode 4, DMA mode 2 (using DMA data transfers)
| pciide0: secondary channel wired to compatibility mode
| pciide0: secondary channel ignored (disabled)
| uhci0 at pci0 dev 20 function 2: Intel 82371AB USB Host Controller (PIIX4) (rev. 0x01)
| pci_intr_map: no mapping for pin D (line=ff)
| uhci0: couldn't map interrupt
| Intel 82371AB Power Management Controller (PIIX4) (miscellaneous bridge, revision 0x02) at pci0 dev 20 function 3 not configured
| isa0 at pcib0
| com0 at isa0 port 0x3f8-0x3ff irq 4: ns16550a, working fifo
| com1 at isa0 port 0x2f8-0x2ff irq 3: ns16550a, working fifo
| pckbc0 at isa0 port 0x60-0x64
| pckbd0 at pckbc0 (kbd slot)
| pckbc0: using irq 1 for kbd slot
| wskbd0 at pckbd0: console keyboard, using wsdisplay0
| pms0 at pckbc0 (aux slot)
| pckbc0: using irq 12 for aux slot
| wsmouse0 at pms0 mux 0
| lpt0 at isa0 port 0x378-0x37b irq 7
| pcppi0 at isa0 port 0x61
| midi0 at pcppi0: PC speaker
| sysbeep0 at pcppi0
| isapnp0 at isa0 port 0x279: ISA Plug 'n Play device support
| npx0 at isa0 port 0xf0-0xff: using exception 16
| fdc0 at isa0 port 0x3f0-0x3f7 irq 6 drq 2
| fd0 at fdc0 drive 0: 1.44MB, 80 cyl, 2 head, 18 sec
| isapnp0: no ISA Plug 'n Play devices found
| biomask eb65 netmask ef65 ttymask ffe7
| scsibus0: waiting 2 seconds for devices to settle...
| sd0 at scsibus0 target 0 lun 0: <COMPAQ, HD0094172B, 3208> SCSI2 0/direct fixed
| sd0: 8678 MB, 6962 cyl, 12 head, 212 sec, 512 bytes/sect x 17773500 sectors
| sd0: sync (50.0ns offset 15), 16-bit (40.000MB/s) transfers, tagged queueing
| sd1 at scsibus0 target 3 lun 0: <IBM, DNES-309170W, SA30> SCSI3 0/direct fixed
| sd1: 8748 MB, 11474 cyl, 5 head, 312 sec, 512 bytes/sect x 17916240 sectors
| sd1: sync (50.0ns offset 16), 16-bit (40.000MB/s) transfers, tagged queueing
| scsibus1: waiting 2 seconds for devices to settle...
| Kernelized RAIDframe activated
| boot device: sd0
| root on sd0a dumps on sd0b
| root file system type: ffs
Any help greatly appreciated !
Thanks,
--
Xavier HUMBERT - Systemes et Reseaux | labo-info@injep.fr
INJEP | humbert@injep.fr