Port-sparc archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]
Re: SS20 - Upgrade to 9.2 crashes with memory errors
Hi,
David Brownlee wrote:
> You mention it was working fine with 9.0 - do you still have the
> /onetbsd kernel you can boot?
Yes, out of luck I have a "netbsd.mp" kernel lying around which appears
to be 9.0 MP. I will test it, e.g. also to download your 9.1.
However I am very confused: I see this:
-rwxr-xr-x 1 root wheel 5241684 May 12 15:15 netbsd
-rw-r--r-- 1 multix users 5212232 May 13 01:42 netbsd-install
netbsd is the one installed by the system upgrade, the second is the
netbsd-install kernel I downloaded.
What is a netbsd-install kernel actually? it boots to login (I hoped it
was like bsd.rd of openbsd).
It booted fine. I attach dmesg below!
However, while I was tinkering with dmesg to copy some data, also
netbsd-install died!
[ 712.7409470] memory
error:
[ 712.7409470] EFSR:
0x12601<CE,DW=0x0,SYNDROME=0x26,ME>
[ 712.7409470] MBus transaction:
0xafc74d50<VAH=0x0,TYPE=0x5,SIZE=0x5,C,VA=0x1d,S,MID=0xa>
[ 712.7409470] address:
0x0ab5fd48
[ 712.7409470] module location:
J0202
[ 712.7409470] Async registers (mid 10): afsr=0xf0<AFA=0xf>;
afva=0xf0
[ 712.7409470] Async registers (mid 8): afsr=0xf0<AFA=0xf>;
afva=0xf0
[ 712.7409470] cpu0: NMI: system interrupts:
0x10088000<VME=0x0,SBUS=0x0,S,T,M>
[ 712.7409470] SX STATUS: 00005400
[ 712.7409470] SX ERROR : 00000000
[ 712.7409470] SX DIAG : 00000000
[ 712.7409470] memory error:
and a reboot:
db{0}> reboot
syncing disks... xcall(cpu0,0xf0008d54) from 0xf000b87c: couldn't ping
cpus: cpu1
xcall(cpu0,0xf0008c78) from 0xf00362ec: couldn't ping cpus: cpu1
xcall(cpu0,0xf0008c3c) from 0xf0034414: couldn't ping cpus: cpu1
xcall(cpu0,0xf0008c3c) from 0xf0034414: couldn't ping cpus: cpu1
xcall(cpu0,0xf0008c78) from 0xf00362ec: couldn't ping cpus: cpu1
xcall(cpu0,0xf0008c3c) from 0xf0034414: couldn't ping cpus: cpu1
I lost one :) argh!
>
> Alternatively, I'd be curious to see how a current or netbsd-9.1
> perform (once you have a running system just copy the various kernels
> to some /netbsdXXX name and then test boot. The latest current (HEAD)
> and branch kernels are always available at http://nyftp.netbsd.org/
Hmm... I couldn't install it. I booted 9.0 kernel and it too after boot
enters in that memory error loop.
Did my hardware get faulty? I hope not!
I got crazy getting two replacement HyperSparc modules!!! and they
worked some months ago like a charme even under high load. Then the box
sat unused all summer.... and now I turned it on again, it is not even
hot here.
Or memory?
Or, ineed something in the kernel. Could it be that the "new" os causes
issues even with the old kernel?
PS: I actually found out that while spitting out hese errors, the
machine usually continues to slowly work, although it might go into the
debugger (where one can "continue").
Thus, using this computer usually headless through network,, one does
not notice. Only with a console attached (or checking dmesg).
Booting netbsd-install
4440060+133068+139324 [296752+281752]=0x51c168
OBP version 3, revision 2.25 (plugin rev 2)
[ 1.0000000] Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001, 2002,
2003, 2004, 2005,
[ 1.0000000] 2006, 2007, 2008, 2009, 2010, 2011, 2012, 2013, 2014,
2015, 2016, 2017,
[ 1.0000000] 2018, 2019, 2020 The NetBSD Foundation, Inc. All
rights reserved.
[ 1.0000000] Copyright (c) 1982, 1986, 1989, 1991, 1993
[ 1.0000000] The Regents of the University of California. All
rights reserved.
[ 1.0000000] NetBSD 9.2 (GENERIC.MP) #0: Wed May 12 13:15:55 UTC 2021
[ 1.0000000]
mkrepro%mkrepro.NetBSD.org@localhost:/usr/src/sys/arch/sparc/compile/GENERIC.MP
[ 1.0000000] total memory = 223 MB
[ 1.0000000] avail memory = 213 MB
[ 1.0000000] bootpath:
/iommu@f,e0000000/sbus@f,e0001000/espdma@f,400000/esp@f,800000/sd@1,0
[ 1.0000000] mainbus0 (root): SUNW,SPARCstation-20: hostid 724c434b
[ 1.0000000] cpu0 at mainbus0: mid 8: Ross,RT625 @ 125 MHz, on-chip FPU
[ 1.0000000] cpu0: 256K byte write-back, 64 bytes/line, sw flush:
cache enabled
[ 1.0000000] cpu1 at mainbus0: mid 10: Ross,RT625 @ 125 MHz, on-chip FPU
[ 1.0000000] cpu1: 256K byte write-back, 64 bytes/line, sw flush:
cache enabled
[ 1.0000000] sx0 at mainbus0 ioaddr 0x80000000
[ 1.0000000] sx0: architecture rev. 3 chip rev. 3
[ 1.0000000] obio0 at mainbus0
[ 1.0000000] clock0 at obio0 slot 0 offset 0x200000: mk48t08
[ 1.0000000] timer0 at obio0 slot 0 offset 0x300000: delay constant
40, frequency = 2000000 Hz
[ 1.0000060] zs0 at obio0 slot 0 offset 0x100000 level 12 softpri 6
[ 1.0000060] zstty0 at zs0 channel 0 (console i/o)
[ 1.0000060] zstty1 at zs0 channel 1
[ 1.0000060] zs1 at obio0 slot 0 offset 0x0 level 12 softpri 6
[ 1.0000060] zstty4 at zs1 channel 0
[ 1.0000060] kbd0 at zstty4
[ 1.0000060] zstty5 at zs1 channel 1
[ 1.0000060] ms0 at zstty5
[ 1.0000060] wsmouse0 at ms0 mux 0
[ 1.0000060] fdc0 at obio0 slot 0 offset 0x700000 level 11 softpri 4:
chip 82077
[ 1.0000060] fd0 at fdc0 drive 0: 1.44MB 80 cyl, 2 head, 18 sec
[ 1.0000060] auxreg0 at obio0 slot 0 offset 0x800000
[ 1.0000060] power0 at obio0 slot 0 offset 0xa01000 level 2
[ 1.0000060] iommu0 at mainbus0 ioaddr 0xe0000000: version 0x3/0x1,
page-size 4096, range 64MB
[ 1.0000060] sbus0 at iommu0: clock = 25 MHz
[ 1.0000060] dma0 at sbus0 slot 15 offset 0x400000: DMA rev 2
[ 1.0000060] esp0 at dma0 slot 15 offset 0x800000 level 4: ESP200,
40MHz, SCSI ID 7
[ 1.0000060] scsibus0 at esp0: 8 targets, 8 luns per target
[ 1.0000060] ledma0 at sbus0 slot 15 offset 0x400010: DMA rev 2
[ 1.0000060] le0 at ledma0 slot 15 offset 0xc00000 level 6: address
08:00:20:4c:43:4b
[ 1.0000060] le0: 8 receive buffers, 2 transmit buffers
[ 1.0000060] bpp0 at sbus0 slot 15 offset 0x4800000 level 2 (ipl 3):
DMA rev 2
[ 1.0000060] dbri0 at sbus0 slot 14 offset 0x10000 level 9: rev e
[ 1.0000060] cgsix0 at sbus0 slot 3 offset 0x0 level 9: SUNW,501-2325,
1152 x 900, rev 11
[ 1.0000060] cgsix0: attached to /dev/fb0
[ 1.0000060] cgsix0: framebuffer size: 1 MB
[ 1.0000060] wsdisplay1 at cgsix0 kbdmux 1
[ 1.0000060] eccmemctl0 at mainbus0 ioaddr 0x0: version 0x0/0x2
[ 1.0000060] cpu0: booting secondary processors: cpu1
[ 1.4585455] scsibus0: waiting 2 seconds for devices to settle...
[ 1.4688470] wskbd0 at kbd0 mux 1
[0 target 1 lun 0: <WDIGTL, ENTERPRISE, 1.91> disk fixed
[ 3.8586420] sd0: 4157 MB, 5720 cyl, 8 head, 186 sec, 512 bytes/sect x
8515173 sectors
[ 3.9586575] sd0: sync (100.00ns offset 15), 8-bit (10.000MB/s)
transfers, tagged queueing
[ 4.2086580] sd1 at scsibus0 target 3 lun 0: <WDIGTL, ENTERPRISE,
1.91> disk fixed
[ 4.3986675] sd1: 4157 MB, 5720 cyl, 8 head, 186 sec, 512 bytes/sect x
8515173 sectors
[ 4.4892465] sd1: sync (100.00ns offset 15), 8-bit (10.000MB/s)
transfers, tagged queueing
[ 4.5986770] kbd0: reset failed
[ 4.9686985] cd0 at scsibus0 target 6 lun 0: <TOSHIBA,
XM-4101TASUNSLCD, 3424> cdrom removable
[ 5.0688925] cd0: async, 8-bit transfers
[
[ 5.3668120] dbri0: cs4215 rev E found at offset 8
[ 5.3691930] audio0 at dbri0: playback, capture, full duplex
[ 5.4492530] audio0: slinear_be:16 2ch 48000Hz, blk 7680 bytes (40ms)
for playback
[ 5.5290695] audio0: slinear_be:16 2ch 48000Hz, blk 7680 bytes (40ms)
for recording
[ 5.6287420] root on sd0a dumps on sd0b
[ 5.6887415] root file system type: ffs
[ 5.7387500] kern.module.path=/stand/sparc/9.2/modules
Tue Oct 19 17:35:37 GMT 2021
Starting root file system check:
/dev/rsd0a: FREE BLK COUNT(S) WRONG IN SUPERBLK (SALVAGED)
/dev/rsd0a: 7182 files, 111175 used, 1163425 free (457 frags, 145371
blocks, 0.0% fragmentation)
/dev/rsd0a: MARKING FILE SYSTEM CLEAN
swapctl: setting dump device to /dev/sd0b
swapctl: adding /dev/sd0b as swap device at priority 0
Starting file system checks:
/dev/rsd0d: 50292 files, 712379 used, 43677 free (4885 frags, 4849
blocks, 0.6% fragmentation)
/dev/rsd0d: MARKING FILE SYSTEM CLEAN
Home |
Main Index |
Thread Index |
Old Index