NetBSD-Bugs archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

kern/59339: heartbeat watchdog fires since 10.99.14



>Number:         59339
>Category:       kern
>Synopsis:       heartbeat watchdog fires since 10.99.14
>Confidential:   no
>Severity:       serious
>Priority:       medium
>Responsible:    kern-bug-people
>State:          open
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Mon Apr 21 19:40:00 +0000 2025
>Originator:     Patrick Welche
>Release:        NetBSD-10.99.14/amd64
>Organization:
>Environment:
src of 8:05 UTC 17 April 2025
>Description:
Possibly related to other hangs in the thread
https://mail-index.netbsd.org/current-users/2025/04/17/msg046436.html

A formerly stable laptop has had two heartbeat panics

System panicked: cpu0: softints stuck for 16 seconds
crash> bt
end() at 0
kern_reboot() at kern_reboot+0x93
vpanic() at vpanic+0x16b
panic() at vprintf
heartbeat() at heartbeat+0x1f2
hardclock() at hardclock+0x9c
Xresume_lapic_ltimer() at Xresume_lapic_ltimer+0x1e
--- interrupt ---
mutex_spin_exit() at mutex_spin_exit+0x5a
callout_softclock() at callout_softclock+0xad
softint_dispatch() at softint_dispatch+0x8f
crash> ps
PID     LID S CPU     FLAGS       STRUCT LWP *               NAME WAIT
2917 > 2917 7   0   8060000   ffff8052e4a14000                tar
0    >    5 7   0       200   ffff8055abee1c00          softclk/0



System panicked: cpu1[27 softclk/1]: heart stopped beating
crash> bt
end() at 0
kern_reboot() at kern_reboot+0x93
vpanic() at vpanic+0x16b
panic() at vprintf
defibrillator() at defibrillator+0x6b
ipi_msg_cpu_handler() at ipi_msg_cpu_handler+0x50
ipi_cpu_handler() at ipi_cpu_handler+0x73
x86_ipi_handler() at x86_ipi_handler+0x6b
Xresume_lapic_ipi() at Xresume_lapic_ipi+0x18
--- interrupt ---
getnanotime() at getnanotime+0x53
tshzto() at tshzto+0x1b
itimer_callout() at itimer_callout+0xb1
callout_softclock() at callout_softclock+0xad
softint_dispatch() at softint_dispatch+0x8f
crash: _kvm_kvatop(ffff9704a471b0b8)
crash: kvm_read(0xffff9704a471b0b8, 8): invalid translation (invalid PTE)
crash> ps
PID     LID S CPU     FLAGS       STRUCT LWP *               NAME WAIT
1266 > 1266 7   0   8020100   ffff81f02e1a7800               cron
992  >  992 7   1   8020000   ffff81f026788c00                  X
857  >  857 7   2   8020100   ffff81f027510400                xdm
0    >   27 7   1       200   ffff81f020f56400          softclk/1


I can't think of any similarities between the two situations.
>How-To-Repeat:
Not obviously reproducible, e.g., the untar during which the first
hang happened succeeded on reboot. firefox was running the first, not
the second, in fact not much was going on in the second.
>Fix:



Home | Main Index | Thread Index | Old Index