Subject: kern/25073: current i386 DIAGNOSTIC kernels panic early multi-user
To: None <gnats-bugs@gnats.NetBSD.org>
From: Paul Shupak <paul@svcs.plectere.com>
List: netbsd-bugs
Date: 04/06/2004 07:16:19
>Number: 25073
>Category: kern
>Synopsis: something apparently in or related to sorecive() panics
>Confidential: yes
>Severity: critical
>Priority: high
>Responsible: kern-bug-people
>State: open
>Class: sw-bug
>Submitter-Id: net
>Arrival-Date: Tue Apr 06 14:18:00 UTC 2004
>Closed-Date:
>Last-Modified:
>Originator: Paul Shupak
>Release: NetBSD 2.0x -- anything after the ip-filter 4.1.1 integration
>Organization:
>Environment:
NetBSD 2.0C (COBALT-$Revision: 1.4 $) #421: Mon Apr 5 18:28:24 PDT 2004
root@svcs:/sys/arch/i386/compile/COBALT
Architecture: i386
Machine: i386
>Description:
Below are two example panics occurred while /etc/rc was
still running. The tracebacks are:
...
wsdisplay0: screen 4 added (80x25, vt100 emulation)
panic: receive 1a
Begin traceback...
soreceive(c21b7190,cffa2e98,cffa2ea4,0,0) at netbsd:soreceive+0x88b
recvit(ce446b30,5,cffa2f14,bfbff7c0,cffa2f5c) at netbsd:recvit+0xfa
sys_recvfrom(ce38bdec,cffa2f64,cffa2f5c,0,cffa2fa4) at netbsd:sys_recvfrom+0x5e
syscall_plain() at netbsd:syscall_plain+0x7e
--- syscall (number 29) ---
0x480e4b6f:
End traceback...
syncing disks...
dumping to dev 0,1 offset 3911591
dump 1535 1534 1533 1532 1531 1530 1529 1528 1527 1526 1525 1524 1523 1522 1521
...
and:
...
wsdisplay0: screen 4 added (80x25, vt100 emulation)
panic: receive 1
Begin traceback...
soreceive(c2cfcbb8,d154ce98,d154cea4,0,0) at netbsd:soreceive+0x8d7
recvit(ce4c1b34,3,d154cf14,bfbff44c,d154cf5c) at netbsd:recvit+0xfa
sys_recvfrom(ce4c3424,d154cf64,d154cf5c,0,c02f3136) at netbsd:sys_recvfrom+0x5e
syscall_plain() at netbsd:syscall_plain+0x7e
--- syscall (number 29) ---
0x480f0b6f:
End traceback...
syncing disks... panic: ip_output no HDR
Begin traceback...
ip_output(c2194100,0,c2dd8390,0,0) at netbsd:ip_output+0xa27
udp_output(c2ddea00,c2dd8360,c2ddea00,c23b4000,c2d4d800) at netbsd:udp_output+0x
146
udp_usrreq(c2d4e0cc,9,c2ddea00,0,0) at netbsd:udp_usrreq+0x1fc
nfs_timer(0,d154cc14,0,ce4c1b34,d154cc40) at netbsd:nfs_timer+0x22c
softclock(0,d154cc34,c02e836e,d154cc40,d154cc44) at netbsd:softclock+0x259
softintr_dispatch(0,10,30,10,10) at netbsd:softintr_dispatch+0x73
Xsoftclock() at netbsd:Xsoftclock+0x25
--- interrupt ---
ltsleep(c2cfa210,11,c03f492c,0,0) at netbsd:ltsleep+0x22c
lfs_segunlock(c2cfa000,d2604000,ce4e8000,d2604000,0) at netbsd:lfs_segunlock+0x1
18
lfs_segwrite(c21fd000,5,c080b444,ce4c3424,c21ffc18) at netbsd:lfs_segwrite+0x348
lfs_sync(c21fd000,2,c2195e80,ce4c1b34,d154cd9c) at netbsd:lfs_sync+0x4f
sys_sync(ce4c3424,0,0,0,100) at netbsd:sys_sync+0xba
vfs_shutdown(c03f75db,0,d154cdfc,c028cd9c,100) at netbsd:vfs_shutdown+0x5f
cpu_reboot(100,0,d154cdfc,c02a00c5,c2cfcbb8) at netbsd:cpu_reboot+0x155
panic(c03f75db,3,ce37de38,d154ce5c,16) at netbsd:panic+0x108
soreceive(c2cfcbb8,d154ce98,d154cea4,0,0) at netbsd:soreceive+0x8d7
recvit(ce4c1b34,3,d154cf14,bfbff44c,d154cf5c) at netbsd:recvit+0xfa
sys_recvfrom(ce4c3424,d154cf64,d154cf5c,0,c02f3136) at netbsd:sys_recvfrom+0x5e
syscall_plain() at netbsd:syscall_plain+0x7e
--- syscall (number 29) ---
0x480f0b6f:
End traceback...
dumping to dev 0,1 offset 3911591
dump 1535 1534 1533 1532 1531 1530 1529 1528 1527 1526 1525 1524 1523 1522 1521
...
>How-To-Repeat:
It seem to be: Try to use any machine ( at least i386 )
with option IPFILTER_DEFAULT_BLOCK enabled - you can't/won't reach
a multi-user prompt. The process running when the system dies is
usually but not always network related ( route and ifconfig are
common points at which the machines die, but I've seen several
deaths where the current porcess was lfs_cleanerd and even once
was /bin/sh.
>Fix:
Don't know; but I can make available kernels and core dumps for the
two crashes above ( including a "-g" kernel for the second ). Warning: After
compression the kernels and core dumps are a 49M download, expanded they're
over 3G.
>Release-Note:
>Audit-Trail:
>Unformatted: