Port-alpha archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Re: Some alpha problems in current



In article <20080208.091951.116646721.jarle%uninett.no@localhost>,
Jarle Greipsland  <jarle%uninett.no@localhost> wrote:
>"Michael L. Hitch" <mhitch%NetBSD.org@localhost> writes:
>> On Wed, 6 Feb 2008, Jarle Greipsland wrote:
>> >>    The machine survived 16 hours (until 04:55), at which point it paniced
>> >> with "fpsave ipi didn't".  CPU 1 was hung and I couldn't get a backtrace
>> >> from it.  This would also explain the panic - CPU 0 is waiting for the
>> >> other CPU to process the fpusave ipi, but it's probably spinning somewhere
>> >> and unable to process the ipi.  This type of panic used to be fairly
>> >> frequent before, but I was able to track down the deadlock at the time and
>> >> that particular problem was fixed.  It looks like it may be back again.
>> > This is all very similar to the problems I reported in
>> > port-alpha/37712.
>> 
>>    I'd say it's the same problem.
>> 
>>    I know what's happening, but not why or how to fix it.
>Probably unrelated, but the timekeeping on my CS20 is also quite
>wacky.  Shortly after booting a 4.99.53 -current kernel, the NTP
>time synchronization can no longer keep the time in sync with the
>NTP servers used.  Also, I am quite puzzled by the following
>output:
># while true; do date; sleep 1; done
>Thu Feb  7 13:09:54 CET 2008
>Thu Feb  7 13:09:55 CET 2008
>Thu Feb  7 13:09:55 CET 2008
>Thu Feb  7 13:09:56 CET 2008
>Thu Feb  7 13:09:56 CET 2008
>Thu Feb  7 13:09:57 CET 2008
>Thu Feb  7 13:09:57 CET 2008
>Thu Feb  7 13:09:58 CET 2008
>Thu Feb  7 13:09:58 CET 2008
>Thu Feb  7 13:09:59 CET 2008
>Thu Feb  7 13:09:59 CET 2008
>Thu Feb  7 13:10:00 CET 2008
>Thu Feb  7 13:10:00 CET 2008
>Thu Feb  7 13:10:01 CET 2008
>Thu Feb  7 13:10:01 CET 2008
>
>The wall time seems to advance slower by a factor of 2 compared
>to whatever timer the sleep call uses.
>
>This was all with the time settings the kernel arrived on by
>itself:
>kern.timecounter.choice = PCC(q=1000, f=1666149600 Hz)
>clockinterrupt(q=0, f=1024 Hz) dummy(q=-1000000, f=1000000 Hz)
>kern.timecounter.hardware = PCC
>kern.timecounter.timestepwarnings = 0
>
>I then switched the timecounter source to clockinterrupt, and
>immediately the time keeping started behaving sanely again:
># while true; do date; sleep 1; done
>Thu Feb  7 13:10:42 CET 2008
>Thu Feb  7 13:10:43 CET 2008
>Thu Feb  7 13:10:44 CET 2008
>Thu Feb  7 13:10:45 CET 2008
>
>I notice that the kernel thinks that the frequency of the PCC is
>1666149600, while I seem to recall that the marketing blurb for
>the CS20 claimed that the 21264 CPUs were clocked at 833MHz.  Of
>course there may be some clock doubling going on somewhere, but
>the factor of 2 made me a bit curious.

Yes, time is completely broken even with 4.99.48 :-)

christos




Home | Main Index | Thread Index | Old Index