Port-sgimips archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]
Cache error on R5000 Challenge S
Hi,
I'm using a R5000/180 Challenge S as a mail/name/web server on my home
network. It's running 4.0 and after being up for 61 days it crashed with
a cache error. I've included the dmesg, stack backtrace and process list
below. I tried rebooting from the db> prompt but it threw another cache
error while running the rc scripts, details also included below. I then
did a hardware reset and it's now been running for 3 days with no
further errors. Could this be a hardware problem or is it more likely to
be software? If software, is there any additional information I should
collect the next time it happens?
Regards,
George
=== console output showing initial cache error then a second error
=== on reboot
db> dmesg
tlp0: receive error: dribbling bit
tlp0: receive error: CRC error
# lots of tlp0 errors deleted to save space
tlp0: receive error: CRC error
panic: cache error @ EPC 0x880696e0 ErrCtl 0x0 CacheErr 0xa42c77e3
db> bt
cpu_Debugger+4 (97fff000,d,0,bfbd9830) ra 8823cdbc sz 0
panic+190 (97fff000,880696e0,0,a42c77e3) ra 882c3278 sz 48
pmap_copy_page+b4 (97fff000,880696e0,0,a42c77e3) ra 881cb070 sz 40
uvmfault_promote+94 (97fff000,880696e0,ffffffff,a42c77e3) ra 881cc358 sz
64 uvm_fault_internal+d28 (96ceb2d8,1000,2,0) ra 882c56b8 sz 296
trap+684 (2000ff13,1000,2,7dea1ea0) ra 882beb44 sz 112
mips3_UserGenException+cc (2000ff13,1000,2,7dea1ea0) ra 0 sz 0
User-level: pid 831.1
db> ps
PID PPID PGRP UID S FLAGS LWPS COMMAND WAIT
16735 10000 3901 1006 2 0x4000 1 spamc netio
10000 3901 3901 1006 2 0x100 1 procmail wait
3901 26259 3901 1006 2 0x4100 1 procmail piperd
26259 361 361 0 2 0x4108 1 local poll
1229 361 361 12 2 0x4108 1 cleanup kqread
1960 361 361 12 2 0x4108 1 trivial-rewrite kqread
18082 361 361 12 2 0x4108 1 proxymap kqread
5078 361 361 12 2 0x4108 1 smtpd kqread
20014 361 361 12 2 0x4108 1 pickup kqread
>831 18753 18753 0 2 0x100 1 perl
15942 24126 15942 1006 2 0x4002 1 wish8.4 select
24126 21598 24126 1006 2 0x4002 1 bash wait
21598 1 7549 1006 2 0x4002 1 xterm select
3655 1 3655 0 2 0 1 rarpd select
8678 16722 8678 0 2 0x4002 1 bash ttyin
16722 21360 16722 0 2 0x4002 1 sh wait
21360 5542 21360 1006 2 0x4002 1 bash wait
5542 1 23082 1006 2 0x4002 1 xterm select
26901 3985 26901 1006 2 0x4002 1 bash ttyin
3985 5022 3985 0 2 0x4102 1 login wait
5022 669 669 0 2 0x4000 1 telnetd poll
28628 18753 18753 0 2 0x100 1 perl select
27387 25797 25797 1000 2 0x100 1 httpd netcon
8951 25797 25797 1000 2 0x100 1 httpd netcon
8458 25797 25797 1000 2 0x100 1 httpd netcon
19596 1 19596 0 2 0 1 named select
3326 25797 25797 1000 2 0x100 1 httpd netcon
26162 25797 25797 1000 2 0x100 1 httpd netcon
10353 25797 25797 1000 2 0x100 1 httpd netcon
12748 25797 25797 1000 2 0x100 1 httpd netcon
2313 25797 25797 1000 2 0x100 1 httpd netcon
16064 25797 25797 1000 2 0x100 1 httpd netcon
29410 25797 25797 1000 2 0x100 1 httpd netcon
25797 1 25797 0 2 0 1 httpd select
18753 1 18753 0 2 0x4002 1 perl select
4616 1 9385 1006 2 0x4002 1 tclsh8.4 select
2883 1 24611 0 2 0 1 snmpd select
8118 1 8118 0 2 0x4002 1 getty ttyin
755 0 0 0 2 0x20200 1 nfsio nfsidl
352 0 0 0 2 0x20200 1 nfsio nfsidl
356 0 0 0 2 0x20200 1 nfsio nfsidl
647 0 0 0 2 0x20200 1 nfsio nfsidl
227 1 227 0 2 0 1 cron nanosle
225 361 361 12 2 0x4108 1 qmgr kqread
669 1 669 0 2 0 1 inetd kqread
361 1 361 0 2 0x4108 1 master kqread
502 1 502 0 2 0 1 sshd select
467 1 467 0 2 0 1 mopd poll
428 1 428 0 2 0 1 dhcpd select
404 1 404 0 2 0 1 rpc.bootparamd select
366 359 359 0 2 0 1 nfsd nfsd
363 359 359 0 2 0 1 nfsd nfsd
348 359 359 0 2 0 1 nfsd nfsd
346 359 359 0 2 0 1 nfsd nfsd
359 1 359 0 2 0 1 nfsd poll
328 1 328 0 2 0 1 mountd select
271 1 271 0 2 0 1 rpcbind poll
242 1 242 0 2 0 1 syslogd kqread
201 1 201 0 2 0 1 routed select
27 0 0 0 2 0x20200 1 physiod physiod
5 0 0 0 2 0x20200 1 aiodoned aiodone
4 0 0 0 2 0x20200 1 ioflush syncer
3 0 0 0 2 0x20200 1 pagedaemon pgdaemo
2 0 0 0 2 0x20200 1 scsibus0 sccomp
1 0 1 0 2 0x4001 1 init wait
0 -1 0 0 2 0x20200 1 swapper schedul
db> reboot
syncing disks... 12 tlp0: receive ring overrun
12 12 11 10 9 8 7 6 6 5 5 5 4 3 2 1 done
rebooting...
Starting up the system...
NetBSD/sgimips 4.0_BETA2 Bootstrap, Revision 1.2
(root%indy3.ceiridos.co.uk@localhost, Mon Mar 19 22:13:55 GMT 2007)
devopen: scsi(0)disk(2)rdisk(0)partition(0) type scsi file netbsd
3314416+192508 [187408+179406]=0x3b1f98
Found bootinfo at 0x8800d710
Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004,
2005, 2006, 2007
The NetBSD Foundation, Inc. All rights reservd.
Copyright (c) 1982, 1986, 1989, 1991, 1993
The Regents of he University of California. All rights reservd.
NetBSD 4.0 (GENERIC32_IP2x) #0: Sun Dec 1 02:11:52 PST 2007
builds@wb41:/home/builds/ab/netbsd-4-0-RELEASE/sgimips/200712160005Z-ob
j/home/builds/ab/netbsd-4-0-RELEASE/src/sys/arch/sgimips/compile/GENERI
C32_IP2x total memory = 256 MB
(768 KB reserved for ARCS)
avail memory = 245 MB
timeconter: Timecounters tick every 10.000 msec
mainbus0 (root): SGI-IP22 [SGI, 690ac9fb], 1 processor
cpu0 at mainbus0: MIPS R5000 CPU (0x2310) Rev. 1.0 with built-in FPU
Rev. 1.0 cpu0: 32KB/32B 2-way set-associative L1 Instruction cache, 48
TLB etries cpu0: 32KB/32B 2-way set-associative wrie-back L1 Data cach
cpu0: 512KB/32B direct-mapped write-through L2 Data cache
ioc0 at mainbus0 addr 0x1fbd9800: rev 0, machine Indy (Guiness), board
rev 0 int0 at mainbus0 addr 0x1fbd9880: bus 90MHz, CPU 180MHz
imc0 at mainbus0 addr 0x1fa00000: revision 3
gio0 at imc0
giopci0 at gio0 slot 1 addr 0x1f400000: Phobos G130 10/100 Ethernet
pci0 at giopci0 bus 0
pci0: memory space enabled
tlp0 at pci0 dev 0 function 0: DECchip 21143 Ethernet, pass 4.1
tlp0: interrupting at slot EXP0
tlp0: Ethernet address 00:60:f5:08:23:07
lxtphy0 at tlp0 phy 1: LXT970 10/100 media interface, rev. 3
lxtphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
Synchronous ISDN (product 0x04 revision 0x00) at gio0 slot 0 addr
0x1f000000 not configured hpc0 at gio0 addr 0x1fb80000: SGI HPC3
zsc0 at hpc0 offset 0x59830
zstty0 at zsc0 channel 1 (console i/o)
zstty1 at zsc0 channel 0
pckbc0 at hpc0 offset 0x59840
sq0 at hpc0 offset 0x54000: SGI Seeq 80c03
sq0: Ethernet address 08:00:69:0a:c9:fb
wdsc0 at hpc0 offset 0x44000: WD33C93B revision 0, 10.0 MHz, SCSI ID 0
scsibus0 at wdsc0: 8 targets, 8 luns per target
dsclock0 at hpc0 offset 0x60000
pi1ppc0 at hpc0 offset 0x58000
pi1ppc0: capabilities=8<PS2>
ppbus0 at pi1ppc0
ppbus0: No IEEE1284 device found.
pi1ppc at hpc0 offset 0x59800 not configured
hpc1 at gio0 addr 0x1fb00000: SGI HPC3
sq at hpc1 offset 0x100 not configured
biomask 07 netmask 07 ttymask 0f clockmask bf
timecounter: Timecounter "clockinterrupt" frequency 100 Hz quality 0
timecounter: Timecounter "mips3_cp0_counter" frequency 90000000 Hz
quality 100 scsibus0: waiting 2 seconds for devices to settle...
sd0 at scsibus0 target 2 lun 0: <IBM, DNES-309170, SAH0> disk fixed
sd0: 8748 MB, 11474 cyl, 5 head, 312 sec, 512 bytes/sect x 17916240
sectors sd0: sync (200.00ns offset 12), 8-bit (5.000MB/s) transfers,
tagged queueing boot device: sd0
root on sd0a dumps on sd0b
root file system type: ffs
WARNING: clock gained 3 days
WARNING: CHECK AND RESET THE DATE!
Wed Mar 26 08:10:22 GMT 2008
swapctl: adding /dev/sd0b as swap device at priority 0
Checking for botched superblock upgrades:bus error: cpu_stat 00000380
addr 17ac77e0, gio_stat 00000000 addr 1fbc4003 panic: cache error @ EPC
0x88069740 ErrCtl 0x0 CacheErr 0xa42c77e3 Stopped in pid 26.1 (cat) at
netbsd:cpu_Debugger+0x4: jr ra bdslot: nop
db> bt
cpu_Debugger+4 (97fff000,d,0,bfbd9830) ra 8823cdbc sz 0
panic+190 (97fff000,88069740,0,a42c77e3) ra 882c3ed4 sz 48
pmap_zero_page+bc (97fff000,88069740,0,a42c77e3) ra 881dabf4 sz 40
uvm_pagealloc_strat+1d4 (97fff000,88069740,0,0) ra 881cb058 sz 64
uvmfault_promote+7c (97fff000,88069740,ffffffff,0) ra 881cc5c8 sz 64
uvm_fault_internal+f98 (97dc5780,1000,1,0) ra 882c56b8 sz 296
trap+684 (ff13,1000,1,7df2fd2c) ra 882beb44 sz 112
mips3_UserGenException+cc (ff13,1000,1,7df2fd2c) ra 0 sz 0
User-level: pid 26.1
db> ps
PID PPID PGRP UID S FLAGS LWPS COMMAND WAIT
27 0 0 0 2 0x20200 1 physiod physiod
>26 23 6 0 2 0x4002 1 cat
25 23 6 0 2 0x4002 1 dd
24 23 6 0 2 0x4002 1 dd pipdwt
23 22 6 0 2 0x2 1 sh wait
22 16 6 0 2 0x2 1 sh wait
16 6 6 0 2 0x2 1 sh piperd
6 1 6 0 2 0x4002 1 sh wait
5 0 0 0 2 0x20200 1 aiodoned aiodone
4 0 0 0 2 0x20200 1 ioflush syncer
3 0 0 0 2 0x20200 1 pagedaemon pgdaemo
2 0 0 0 2 0x20200 1 scsibus0 sccomp
1 0 1 0 2 0x4000 1 init wait
0 -1 0 0 2 0x20200 1 swapper schedul
db>
Home |
Main Index |
Thread Index |
Old Index