Subject: Re: kern/32162: [netbsd-3.0] kernel dead-lock in MP system
To: None <kern-bug-people@netbsd.org, gnats-admin@netbsd.org,>
From: Andreas Wrede <andreas@planix.com>
List: netbsd-bugs
Date: 11/27/2005 00:40:02
The following reply was made to PR kern/32162; it has been noted by GNATS.

From: Andreas Wrede <andreas@planix.com>
To: Manuel Bouyer <bouyer@antioche.eu.org>
Cc: gnats-bugs@NetBSD.org, kern-bug-people@NetBSD.org,
	gnats-admin@NetBSD.org, netbsd-bugs@NetBSD.org
Subject: Re: kern/32162: [netbsd-3.0] kernel dead-lock in MP system
Date: Sat, 26 Nov 2005 19:39:25 -0500

 --Apple-Mail-25-125943089
 Content-Transfer-Encoding: 7bit
 Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed
 
 
 On Nov 26, 2005, at 18:08 , Manuel Bouyer wrote:
 
 > On Sat, Nov 26, 2005 at 05:18:40PM -0500, Andreas Wrede wrote:
 >>
 >> On Nov 26, 2005, at 15:29 , Manuel Bouyer wrote:
 >>
 >>> On Fri, Nov 25, 2005 at 03:13:00AM +0000, Andreas Wrede wrote:
 >>>>> Environment:
 >>>> 	
 >>>> 	
 >>>> System: NetBSD whome.planix.com 3.0_RC3 NetBSD 3.0_RC3
 >>>> (PLANIX.MPACPI) #0: Thu Nov 24 20:57:09 EST 2005
 >>>> root@whome.planix.com:/u1/netbsd-3.0/src/sys/arch/i386/compile/
 >>>> obj.i386/PLANIX.MPACPI i386
 >>>> Architecture: i386
 >>>> Machine: i386
 >>>>> Description:
 >>>> 	Over the last week I have experienced 3 kernel dead-locks on a
 >>>> NetBSD 3.0_RC1/2/3 system.
 >>>> The motherboard is a Tylan K8S Pro S2882G3NR with 2 AMD Opteron
 >>>> 244 CPUs installed. The kernel
 >>>> is differs from GENERIC.MPACPI in the value for some SYSVSEM
 >>>> variables, maxusers and some
 >>>> other variables.
 >>>
 >>> Can you try a kernel with DIAGNOSTIC, DEBUG and LOCKDEBUG ?
 >>
 >> Right now, I am running with LOCKDEBUG. I will add DIAGNOSTIC and  
 >> DEBUG.
 >
 > Yes, if you have the problem I'm thinking about, it will only be
 > detected if you have DIAGNOSTIC. But LOCKDEBUG and DEBUG can't hurt,
 > maybe these will catch something else.
 >
 >>
 >> Not knowing much about kernel debugging, and since creating a core
 >> dump is not possible,
 >
 > Why ? Have you tried reboot(0x104) ?
 
 Yes - it locks the machine completely so that I cannot re-enter the  
 debugger.
 >
 >> what commands should I run the next time the
 >> dead-lock occurs?
 >
 > I can't see at anything more than what you have provided for now ...
 
 OK.
 
 -- 
      aew
 
 
 --Apple-Mail-25-125943089
 content-type: application/pgp-signature; x-mac-type=70674453;
 	name=PGP.sig
 content-description: This is a digitally signed message part
 content-disposition: inline; filename=PGP.sig
 content-transfer-encoding: 7bit
 
 -----BEGIN PGP SIGNATURE-----
 Version: GnuPG v1.4.1 (Darwin)
 
 iD8DBQFDiQBAEh/h9J/TQyERAlc7AKCkYqnpvxXDakzCke+ERD5/BTDJqgCfTkSj
 mN/LsR/SMbxqbj9PHiqTbvI=
 =bmt/
 -----END PGP SIGNATURE-----
 
 --Apple-Mail-25-125943089--