Subject: "panic: extent_free: region not found" on AS4100, 1.6.x with mlx(4)
To: NetBSD port-alpha List <port-alpha@NetBSD.org>
From: Greg A. Woods <woods@weird.com>
List: port-alpha
Date: 10/24/2005 12:41:27
--Multipart_Mon_Oct_24_12:40:55_2005-2
Content-Type: multipart/signed;
boundary="pgp-sign-Multipart_Mon_Oct_24_12:40:55_2005-1"; micalg=pgp-sha1;
protocol="application/pgp-signature"
Content-Transfer-Encoding: 7bit
--pgp-sign-Multipart_Mon_Oct_24_12:40:55_2005-1
Content-Type: text/plain; charset=US-ASCII
These two panics, which appear to be releated, have happened to my new
AS4100 running NetBSD 1.6.x, now with three CPUs, and of course a mlx(4)
(DCA960) RAID controller:
extent `mcpcia pci sgmap' (0x0 - 0x3fffffff), flags = 0x2
0x0 - 0x1fff
0x2000 - 0x3fff
0x4000 - 0x5fff
0x6000 - 0x7fff
0x8000 - 0x9fff
0xa000 - 0xbfff
0xc000 - 0xdfff
0xe000 - 0xffff
0x10000 - 0x11fff
0x12000 - 0x13fff
0x14000 - 0x15fff
0x16000 - 0x17fff
0x18000 - 0x1dfff
extent_free: start 0x18000, end 0x1bfff
panic: extent_free: region not found
Stopped at cpu_Debugger+0x4: ret zero,(ra)
db{0}> trace
cpu_Debugger() at cpu_Debugger+0x4
panic() at panic+0x160
extent_free() at extent_free+0x320
pci_sgmap_pte64_unload() at pci_sgmap_pte64_unload+0x108
mcpcia_bus_dmamap_unload_sgmap() at mcpcia_bus_dmamap_unload_sgmap+0x38
mlx_ccb_unmap() at mlx_ccb_unmap+0xa8
ld_mlx_handler() at ld_mlx_handler+0xe4
mlx_intr() at mlx_intr+0xec
alpha_shared_intr_dispatch() at alpha_shared_intr_dispatch+0x6c
kn300_iointr() at kn300_iointr+0x54
interrupt() at interrupt+0x33c
XentInt() at XentInt+0x1c
--- interrupt (from ipl 0) ---
idle() at idle+0x78
idle() at idle+0x54
--- root of call graph ---
db{0}> reboot
syncing disks... tlp0: receive ring overrun
CPU 0: fatal kernel trap:
CPU 0 trap entry = 0x2 (memory management fault)
CPU 0 a0 = 0x1a4
CPU 0 a1 = 0x1
CPU 0 a2 = 0x0
CPU 0 pc = 0xfffffc00004a3bd4
CPU 0 ra = 0xfffffc00003fa598
CPU 0 pv = 0xfffffc00004a3860
CPU 0 curproc = 0x0
panic: trap
Stopped at cpu_Debugger+0x4: ret zero,(ra)
db{0}> reboot
cpu1: shutting down...
cpu2: shutting down...
rebooting...
[[ and still a hard reset was needed ]]
and the second one:
extent `mcpcia pci sgmap' (0x0 - 0x3fffffff), flags = 0x2
0x0 - 0x1fff
0x2000 - 0x3fff
0x4000 - 0x5fff
0x6000 - 0x7fff
0x8000 - 0x9fff
0xa000 - 0xdfff
extent_free: start 0xa000, end 0xbfff
panic: extent_free: region not found
Stopped at cpu_Debugger+0x4: ret zero,(ra)
db{0}> trace
cpu_Debugger() at cpu_Debugger+0x4
panic() at panic+0x160
extent_free() at extent_free+0x320
pci_sgmap_pte64_unload() at pci_sgmap_pte64_unload+0x108
mcpcia_bus_dmamap_unload_sgmap() at mcpcia_bus_dmamap_unload_sgmap+0x38
mlx_ccb_unmap() at mlx_ccb_unmap+0xa8
ld_mlx_handler() at ld_mlx_handler+0xe4
mlx_intr() at mlx_intr+0xec
alpha_shared_intr_dispatch() at alpha_shared_intr_dispatch+0x6c
kn300_iointr() at kn300_iointr+0x54
interrupt() at interrupt+0x33c
XentInt() at XentInt+0x1c
--- interrupt (from ipl 0) ---
idle() at idle+0x78
idle() at idle+0x54
--- root of call graph ---
db{0}>
Is there any further information I can try to find if this happens
again?
Unfortunately I doubt I'll be able to get a kernel core dump, especially
since the default dump device is one of the logical devices attached to
the mlx(4) controller.
--pgp-sign-Multipart_Mon_Oct_24_12:40:55_2005-1
Content-Type: application/pgp-signature
Content-Transfer-Encoding: 7bit
-----BEGIN PGP SIGNATURE-----
Version: PGPfreeware 5.0i for non-commercial use
MessageID: KkBYolzulu6M1srWPxD0dCFAMfvWVf9A
iQA/AwUBQ10OtmJ7XxTCWceFEQLhAgCfXPjr+kL1rVAepKa63VKp2uWxWzQAn1A4
IwtUBMKmT2MAZkAsu06Ekzsj
=cq8z
-----END PGP SIGNATURE-----
--pgp-sign-Multipart_Mon_Oct_24_12:40:55_2005-1--
--Multipart_Mon_Oct_24_12:40:55_2005-2
Content-Type: text/plain; charset=US-ASCII
--
Greg A. Woods
H:+1 416 218-0098 W:+1 416 489-5852 x122 VE3TCP RoboHack <woods@robohack.ca>
Planix, Inc. <woods@planix.com> Secrets of the Weird <woods@weird.com>
--Multipart_Mon_Oct_24_12:40:55_2005-2--