Port-vax archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]
Hung machine
There seems to be some serious problem in NetBSD/vax. On the 8650, which
I'm testing rather heavily at the moment, I've now had three hangs
within a week. Always the same symptoms.
The processes that are running keep working, but no new processes get
started.
On the console, I captured the following:
Stopped in pid 0.2 (system) at netbsd:kpreempt_enable+0x2: subl2
$, sp
db> bt
Process 0.2
PCB contents:
KSP = 0x8613ff10
ESP = 0x8613e064
SSP = 0x83bbfd20
USP = 0x0
R[00] = 0x00000001 R[06] = 0x83be109c
R[01] = 0x00000008 R[07] = 0x83bbfd20
R[02] = 0x8613e000 R[08] = 0x80376410
R[03] = 0x00000018 R[09] = 0x800a67ac
R[04] = 0x80376400 R[10] = 0x800a14be
R[05] = 0x00000055 R[11] = 0x800a1066
AP = 0x8613ff38
FP = 0x8613ff24
PC = 0x80000733
PSL = 0x1f0000
Trap frame pointer: 0x8613ffb4
Stack traceback :
0x8613ff24: sched_curcpu_runnable_p+0x28(void)
0x8613ff40: idle_loop+0xce(0x83bbfd20)
0x8613ff64: cpu_lwp_bootstrap+0x15(0)
0x8613ffb4: trap type=0x0 code=0x0 pc=0x0 psl=0x3c00000
0x8613ff98: 0(panic: Segv in kernel mode: pc 800159e8 addr aaaaaaae
Stopped in pid 0.2 (system) at netbsd:upcallret: function
"upcallret()", entry-mask 0x7c0
remqhi *0x4ac(r0), r6
db> ps
PID LID S CPU FLAGS STRUCT LWP * NAME WAIT
19390 1 3 0 0 8168e000 tcsh vm_map
25249 1 3 0 0 8168e2a0 cron vm_map
15807 1 3 0 0 8168e540 cron wait
19136 1 3 0 0 8168e7e0 cron vm_map
19106 1 3 0 0 8168ea80 cron wait
19452 1 3 0 0 8168ed20 cron vm_map
20407 1 3 0 0 81d9e020 cron vm_map
19591 1 3 0 0 81d9e2c0 cron wait
20239 1 3 0 0 81d9e560 cron wait
8420 1 3 0 0 81d9e800 sshd vm_map
26442 1 3 0 0 81d9eaa0 cron vm_map
8876 1 3 0 0 81d9ed40 cron wait
6882 1 3 0 0 80f0ba80 sshd vm_map
20393 1 3 0 0 80f0bd20 cron vm_map
19517 1 3 0 0 80f0b7e0 cron wait
7858 1 3 0 0 80f0b540 cron vm_map
14547 1 3 0 0 80f0b2a0 cron wait
18075 1 3 0 0 836f72a0 cron vm_map
15846 1 3 0 0 836f77e0 cron wait
20437 1 3 0 0 836f7000 cron vm_map
19350 1 3 0 0 836f7a80 cron wait
26405 1 3 0 0 836f7d20 cron vm_map
19666 1 3 0 0 81562540 cron vm_map
21219 1 3 0 0 815627e0 cron wait
7655 1 3 0 0 81562000 cron wait
18335 1 3 0 0 81039aa0 cron vm_map
7720 1 3 0 0 81039800 cron wait
19916 1 3 0 0 81039560 cron vm_map
19121 1 3 0 0 81039d40 cron wait
16238 1 3 0 0 8394aa80 cron vm_map
19448 1 3 0 0 82059540 cron wait
20219 1 3 0 0 820592a0 cron vm_map
19946 1 3 0 0 82059a80 cron wait
20171 1 3 0 0 82059000 cron vm_map
13369 1 3 0 0 808512c0 cron wait
20420 1 3 0 0 80851560 sh vm_map
20538 1 3 0 0 82323540 master vm_map
21001 1 3 0 0 80851d40 cron vm_map
19817 1 3 0 0 80851020 cron vm_map
25573 1 3 0 0 80851800 cron wait
19400 1 3 0 0 80851aa0 cron wait
14134 1 3 0 0 80f0b000 sh wait
18669 1 3 0 80 820597e0 postdrop netio
18931 1 3 0 80 8188f800 sendmail pipe_rd
20332 1 3 0 80 81039020 tee pipe_rd
19664 1 3 0 80 810392c0 sh wait
24161 1 3 0 80 82059d20 sh wait
20369 1 3 0 80 8188fd40 cron pipe_rd
12824 1 3 0 80 836f7540 comsat netio
731 1 3 0 80 81562d20 top select
440 1 3 0 80 815622a0 tcsh pause
1047 1 3 0 80 81562a80 sshd select
1199 1 3 0 80 8188f560 systat ttyraw
899 1 3 0 80 8188f2c0 tcsh pause
96 1 3 0 80 8188f020 sshd netio
1015 1 3 0 80 82323a80 sshd select
136 1 3 0 80 8188faa0 sshd netio
444 1 3 0 0 8394ad20 tcsh wait
682 1 3 0 80 83b612c0 tcsh pause
780 1 3 0 80 83b61020 sshd select
502 1 3 0 80 83b757e0 sshd netio
772 1 3 0 0 83b75540 getty vm_map
683 1 3 0 80 823237e0 qmgr kqueue
753 1 3 0 80 823232a0 cron nanoslp
699 1 3 0 80 82323000 inetd kqueue
693 1 3 0 80 82b9a560 master kqueue
410 1 3 0 80 82323d20 sshd select
377 1 3 0 80 82b9a020 rwhod select
351 1 3 0 80 82b9a2c0 ntpd pause
344 1 3 0 80 8394a000 lpd select
274 1 3 0 80 82b9a800 rpc.lockd select
275 1 3 0 80 82b9aaa0 rpc.statd select
254 5 3 0 80 82b9ad40 slave nfsd
254 4 3 0 80 8329e000 slave nfsd
254 3 3 0 80 8329e2a0 slave nfsd
254 2 3 0 80 8329e540 slave nfsd
254 1 3 0 80 8329e7e0 master select
244 1 3 0 80 839e2800 mountd select
197 1 3 0 80 839e2560 mount_mfs mfsidl
174 1 3 0 80 839e2d40 ypbind select
176 1 3 0 80 8394a7e0 rpcbind select
173 1 3 0 80 839e2aa0 syslogd kqueue
1 1 3 0 80 83b61aa0 init wait
0 36 3 0 200 8329ea80 nfsio nfsiod
0 35 3 0 200 8329ed20 nfsio nfsiod
0 34 3 0 200 839e2020 nfsio nfsiod
0 33 3 0 200 839e22c0 nfsio nfsiod
0 32 3 0 200 8394a540 ccd0 ccdthr
0 31 3 0 200 8394a2a0 physiod physiod
0 30 3 0 200 83b752a0 aiodoned aiodoned
0 29 3 0 200 83b61d40 ioflush syncer
0 28 3 0 200 83b75000 pgdaemon pgdaemon
0 25 3 0 200 83b61800 unpgc unpgc
0 24 3 0 200 83b61560 vmem_rehash vmem_rehash
0 15 3 0 200 83b75a80 mscp_wq mscp_wq
0 14 3 0 200 83b75d20 mscp_wq mscp_wq
0 13 3 0 200 83ba8020 pmfsuspend pmfsuspend
0 12 3 0 200 83ba82c0 pmfevent pmfevent
0 11 3 0 200 83ba8560 nfssilly nfssilly
0 10 3 0 200 83ba8800 cachegc cachegc
0 9 3 0 200 83ba8aa0 vrele vrele
0 8 3 0 200 83ba8d40 modunload mod_unld
0 7 3 0 200 83bbf000 xcall/0 xcall
0 6 1 0 200 83bbf2a0 softser/0
0 5 1 0 200 83bbf540 softclk/0
0 4 1 0 200 83bbf7e0 softbio/0
0 3 1 0 200 83bbfa80 softnet/0
0 > 2 7 0 201 83bbfd20 idle/0
0 1 3 0 200 80206500 swapper uvm
Note the number of cron processes, and that a lot of them are waiting
for vm_map.
The system is letting the idle process run, and there don't seem to be
anything funny about that part.
The header from top, at this point:
load averages: 23.3, 21.9, 20.2; up 1+09:06:25
10:23:0
79 processes: 24 runnable, 54 sleeping, 1 on CPU
CPU states: 9.2% user, 0.0% nice, 19.3% system, 0.8% interrupt, 70.7%
idle
Memory: 28M Act, 14M Inact, 4464K Wired, 5580K Exec, 27M File, 984K Free
Swap: 256M Total, 39M Used, 217M Free
Lots of idle time, there is free memory around, so I don't know what is
going on here... The only other thing I noticed is that a lot of those
cron processes have 0 memory resident, so it would appear they are
swapped out, and not just paged out. Anything broken in the swapping
part perhaps?
Johnny
--
Johnny Billquist || "I'm on a bus
|| on a psychedelic trip
email: bqt%softjar.se@localhost || Reading murder books
pdp is alive! || tryin' to stay hip" - B. Idol
Home |
Main Index |
Thread Index |
Old Index