Port-vax archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Hung machine



There seems to be some serious problem in NetBSD/vax. On the 8650, which I'm testing rather heavily at the moment, I've now had three hangs within a week. Always the same symptoms. The processes that are running keep working, but no new processes get started.

On the console, I captured the following:

Stopped in pid 0.2 (system) at netbsd:kpreempt_enable+0x2: subl2 $, sp
db> bt
Process 0.2
         PCB contents:
 KSP = 0x8613ff10
 ESP = 0x8613e064
 SSP = 0x83bbfd20
 USP = 0x0
 R[00] = 0x00000001       R[06] = 0x83be109c
 R[01] = 0x00000008       R[07] = 0x83bbfd20
 R[02] = 0x8613e000       R[08] = 0x80376410
 R[03] = 0x00000018       R[09] = 0x800a67ac
 R[04] = 0x80376400       R[10] = 0x800a14be
 R[05] = 0x00000055       R[11] = 0x800a1066
 AP = 0x8613ff38
 FP = 0x8613ff24
 PC = 0x80000733
 PSL = 0x1f0000
 Trap frame pointer: 0x8613ffb4
Stack traceback :
0x8613ff24: sched_curcpu_runnable_p+0x28(void)
0x8613ff40: idle_loop+0xce(0x83bbfd20)
0x8613ff64: cpu_lwp_bootstrap+0x15(0)
0x8613ffb4: trap type=0x0 code=0x0 pc=0x0 psl=0x3c00000
0x8613ff98: 0(panic: Segv in kernel mode: pc 800159e8 addr aaaaaaae
Stopped in pid 0.2 (system) at    netbsd:upcallret:       function
 "upcallret()", entry-mask 0x7c0
                remqhi  *0x4ac(r0), r6
db> ps
PID    LID S CPU     FLAGS       STRUCT LWP *               NAME WAIT
19390    1 3   0         0           8168e000               tcsh vm_map
25249    1 3   0         0           8168e2a0               cron vm_map
15807    1 3   0         0           8168e540               cron wait
19136    1 3   0         0           8168e7e0               cron vm_map
19106    1 3   0         0           8168ea80               cron wait
19452    1 3   0         0           8168ed20               cron vm_map
20407    1 3   0         0           81d9e020               cron vm_map
19591    1 3   0         0           81d9e2c0               cron wait
20239    1 3   0         0           81d9e560               cron wait
8420     1 3   0         0           81d9e800               sshd vm_map
26442    1 3   0         0           81d9eaa0               cron vm_map
8876     1 3   0         0           81d9ed40               cron wait
6882     1 3   0         0           80f0ba80               sshd vm_map
20393    1 3   0         0           80f0bd20               cron vm_map
19517    1 3   0         0           80f0b7e0               cron wait
7858     1 3   0         0           80f0b540               cron vm_map
14547    1 3   0         0           80f0b2a0               cron wait
18075    1 3   0         0           836f72a0               cron vm_map
15846    1 3   0         0           836f77e0               cron wait
20437    1 3   0         0           836f7000               cron vm_map
19350    1 3   0         0           836f7a80               cron wait
26405    1 3   0         0           836f7d20               cron vm_map
19666    1 3   0         0           81562540               cron vm_map
21219    1 3   0         0           815627e0               cron wait
7655     1 3   0         0           81562000               cron wait
18335    1 3   0         0           81039aa0               cron vm_map
7720     1 3   0         0           81039800               cron wait
19916    1 3   0         0           81039560               cron vm_map
19121    1 3   0         0           81039d40               cron wait
16238    1 3   0         0           8394aa80               cron vm_map
19448    1 3   0         0           82059540               cron wait
20219    1 3   0         0           820592a0               cron vm_map
19946    1 3   0         0           82059a80               cron wait
20171    1 3   0         0           82059000               cron vm_map
13369    1 3   0         0           808512c0               cron wait
20420    1 3   0         0           80851560                 sh vm_map
20538    1 3   0         0           82323540             master vm_map
21001    1 3   0         0           80851d40               cron vm_map
19817    1 3   0         0           80851020               cron vm_map
25573    1 3   0         0           80851800               cron wait
19400    1 3   0         0           80851aa0               cron wait
14134    1 3   0         0           80f0b000                 sh wait
18669    1 3   0        80           820597e0           postdrop netio
18931    1 3   0        80           8188f800           sendmail pipe_rd
20332    1 3   0        80           81039020                tee pipe_rd
19664    1 3   0        80           810392c0                 sh wait
24161    1 3   0        80           82059d20                 sh wait
20369    1 3   0        80           8188fd40               cron pipe_rd
12824    1 3   0        80           836f7540             comsat netio
731      1 3   0        80           81562d20                top select
440      1 3   0        80           815622a0               tcsh pause
1047     1 3   0        80           81562a80               sshd select
1199     1 3   0        80           8188f560             systat ttyraw
899      1 3   0        80           8188f2c0               tcsh pause
96       1 3   0        80           8188f020               sshd netio
1015     1 3   0        80           82323a80               sshd select
136      1 3   0        80           8188faa0               sshd netio
444      1 3   0         0           8394ad20               tcsh wait
682      1 3   0        80           83b612c0               tcsh pause
780      1 3   0        80           83b61020               sshd select
502      1 3   0        80           83b757e0               sshd netio
772      1 3   0         0           83b75540              getty vm_map
683      1 3   0        80           823237e0               qmgr kqueue
753      1 3   0        80           823232a0               cron nanoslp
699      1 3   0        80           82323000              inetd kqueue
693      1 3   0        80           82b9a560             master kqueue
410      1 3   0        80           82323d20               sshd select
377      1 3   0        80           82b9a020              rwhod select
351      1 3   0        80           82b9a2c0               ntpd pause
344      1 3   0        80           8394a000                lpd select
274      1 3   0        80           82b9a800          rpc.lockd select
275      1 3   0        80           82b9aaa0          rpc.statd select
254      5 3   0        80           82b9ad40              slave nfsd
254      4 3   0        80           8329e000              slave nfsd
254      3 3   0        80           8329e2a0              slave nfsd
254      2 3   0        80           8329e540              slave nfsd
254      1 3   0        80           8329e7e0             master select
244      1 3   0        80           839e2800             mountd select
197      1 3   0        80           839e2560          mount_mfs mfsidl
174      1 3   0        80           839e2d40             ypbind select
176      1 3   0        80           8394a7e0            rpcbind select
173      1 3   0        80           839e2aa0            syslogd kqueue
1        1 3   0        80           83b61aa0               init wait
0       36 3   0       200           8329ea80              nfsio nfsiod
0       35 3   0       200           8329ed20              nfsio nfsiod
0       34 3   0       200           839e2020              nfsio nfsiod
0       33 3   0       200           839e22c0              nfsio nfsiod
0       32 3   0       200           8394a540               ccd0 ccdthr
0       31 3   0       200           8394a2a0            physiod physiod
0       30 3   0       200           83b752a0           aiodoned aiodoned
0       29 3   0       200           83b61d40            ioflush syncer
0       28 3   0       200           83b75000           pgdaemon pgdaemon
0       25 3   0       200           83b61800              unpgc unpgc
0       24 3   0       200           83b61560        vmem_rehash vmem_rehash
0       15 3   0       200           83b75a80            mscp_wq mscp_wq
0       14 3   0       200           83b75d20            mscp_wq mscp_wq
0       13 3   0       200           83ba8020         pmfsuspend pmfsuspend
0       12 3   0       200           83ba82c0           pmfevent pmfevent
0       11 3   0       200           83ba8560           nfssilly nfssilly
0       10 3   0       200           83ba8800            cachegc cachegc
0        9 3   0       200           83ba8aa0              vrele vrele
0        8 3   0       200           83ba8d40          modunload mod_unld
0        7 3   0       200           83bbf000            xcall/0 xcall
0        6 1   0       200           83bbf2a0          softser/0
0        5 1   0       200           83bbf540          softclk/0
0        4 1   0       200           83bbf7e0          softbio/0
0        3 1   0       200           83bbfa80          softnet/0
0    >   2 7   0       201           83bbfd20             idle/0
0        1 3   0       200           80206500            swapper uvm


Note the number of cron processes, and that a lot of them are waiting for vm_map. The system is letting the idle process run, and there don't seem to be anything funny about that part.

The header from top, at this point:

load averages: 23.3, 21.9, 20.2; up 1+09:06:25 10:23:0
79 processes: 24 runnable, 54 sleeping, 1 on CPU
CPU states: 9.2% user, 0.0% nice, 19.3% system, 0.8% interrupt, 70.7% idle
Memory: 28M Act, 14M Inact, 4464K Wired, 5580K Exec, 27M File, 984K Free
Swap: 256M Total, 39M Used, 217M Free


Lots of idle time, there is free memory around, so I don't know what is going on here... The only other thing I noticed is that a lot of those cron processes have 0 memory resident, so it would appear they are swapped out, and not just paged out. Anything broken in the swapping part perhaps?

        Johnny

--
Johnny Billquist                  || "I'm on a bus
                                  ||  on a psychedelic trip
email: bqt%softjar.se@localhost             ||  Reading murder books
pdp is alive!                     ||  tryin' to stay hip" - B. Idol


Home | Main Index | Thread Index | Old Index