Re: kern/39242: NetBSD 4.0 will start busy-loop an hang on machines with more than 4 GB memory

To: gnats-bugs%NetBSD.org@localhost
Subject: Re: kern/39242: NetBSD 4.0 will start busy-loop an hang on machines with more than 4 GB memory
From: Wolfgang Stukenbrock <Wolfgang.Stukenbrock%nagler-company.com@localhost>
Date: Thu, 31 Jul 2008 12:56:33 +0200

Hi - once again ...

I've removed 4 GB of memory now and the SCSI-controller seems to worknow, but ....


kernel: protection fault trap, code=0

Stopped in pid 28.1 (aiodoned) at netbsd:uvm_tree_RB_REMOVE+0x50:movq %

r14,0x10(%r15)
db{0}> trace
uvm_tree_RB_REMOVE() at netbsd:uvm_tree_RB_REMOVE+0x50
uvm_rb_remove() at netbsd:uvm_rb_remove+0x1c
uvm_unmap_remove() at netbsd:uvm_unmap_remove+0x179
uvm_pagermapout() at netbsd:uvm_pagermapout+0x110
uvm_aio_aiodone() at netbsd:uvm_aio_aiodone+0xc4
uvm_aiodone_daemon() at netbsd:uvm_aiodone_daemon+0xd2
db{0}>

At the time of the crash top reports:

load averages: 1.03, 0.47, 0.19up 0 days, 0:38 12:27:08

67 processes:  1 runnable, 64 sleeping, 2 on processor

CPU0 states: 0.0% user, 0.0% nice, 11.9% system, 2.5% interrupt,85.6% idleCPU1 states: 0.0% user, 0.0% nice, 3.0% system, 0.0% interrupt,97.0% idleMemory: 2565M Act, 1253M Inact, 8924K Wired, 6296K Exec, 1316M File,228K Free

Swap: 24G Total, 9435M Used, 15G Free

  PID USERNAME PRI NICE   SIZE   RES STATE      TIME   WCPU    CPU COMMAND
10722 root      -5    0   868K 1792K biowai/0   0:13 78.93%  7.52% tar

23 root -6 0 0K 28M RUN/1 0:10 5.37% 5.37%[raidio0]26 root -18 0 0K 28M pgdaem/0 0:04 1.71% 1.71%[pagedaemon]28 root -18 0 0K 28M aiodon/0 0:04 1.17% 1.17%[aiodoned]21 root -6 0 0K 28M raidio/1 0:05 1.07% 1.07%[raidio1]

 9957 root      28    0   120K  820K CPU/1      0:00  0.00%  0.98% cp
   22 root      -6    0     0K   28M rfwcon/0   0:01  0.68%  0.68% [raid0]
   20 root      -6    0     0K   28M rfwcon/0   0:00  0.05%  0.05% [raid1]

9 root -6 0 0K 28M sccomp/1 0:02 0.00% 0.00%[scsibus0]

  198 ncadmin   28    0   572K 1588K CPU/0      0:00  0.00%  0.00% top

27 root 18 0 0K 28M syncer/0 0:00 0.00% 0.00%[ioflush]

  816 root      18    0   988K 4188K pause/0    0:00  0.00%  0.00% ntpd
 9303 wgstuken  18    0   260K 1168K pause/1    0:00  0.00%  0.00% <csh>
10024 root      18    0   236K 1144K pause/1    0:00  0.00%  0.00% <csh>
 1515 root      18    0   240K 1088K pause/1    0:00  0.00%  0.00% <csh>

18 root 14 0 0K 28M crypto/0 0:00 0.00% 0.00%[cryptoret]

   13 root      10    0     0K   28M usbevt/0   0:00  0.00%  0.00% [usb7]
   12 root      10    0     0K   28M usbevt/0   0:00  0.00%  0.00% [usb6]
   11 root      10    0     0K   28M usbevt/0   0:00  0.00%  0.00% [usb5]
   10 root      10    0     0K   28M usbevt/0   0:00  0.00%  0.00% [usb4]
  571 root      10    0     0K   28M nfsidl/0   0:00  0.00%  0.00% [nfsio]
  570 root      10    0     0K   28M nfsidl/0   0:00  0.00%  0.00% [nfsio]
  545 root      10    0     0K   28M nfsidl/0   0:00  0.00%  0.00% [nfsio]
  499 root      10    0     0K   28M nfsidl/0   0:00  0.00%  0.00% [nfsio]
    3 root      10    0     0K   28M usbevt/0   0:00  0.00%  0.00% [usb0]

4 root 10 0 0K 28M usbtsk/0 0:00 0.00% 0.00%[usbtask-hc]5 root 10 0 0K 28M usbtsk/0 0:00 0.00% 0.00%[usbtask-dr]

    6 root      10    0     0K   28M usbevt/0   0:00  0.00%  0.00% [usb1]
    7 root      10    0     0K   28M usbevt/0   0:00  0.00%  0.00% [usb2]
    8 root      10    0     0K   28M usbevt/0   0:00  0.00%  0.00% [usb3]
10303 root      10    0   484K 2768K wait/0     0:00  0.00%  0.00% <login>
  195 root      10    0   620K 1944K wait/1     0:00  0.00%  0.00% <login>
 1566 root      10    0   476K 1936K wait/0     0:00  0.00%  0.00% <login>
  196 ncadmin   10    0   284K 1048K wait/0     0:00  0.00%  0.00% <sh>
 1443 root      10    0   268K  880K nanosl/1   0:00  0.00%  0.00% <cron>
    1 root      10    0   100K  812K wait/0     0:00  0.00%  0.00% <init>
  557 root       2    0  1060K 3988K select/0   0:00  0.00%  0.00% <amd>
10740 root       2    0   492K 1536K poll/0     0:00  0.00%  0.00% <rlogind>
  872 root       2    0   344K 1456K select/0   0:00  0.00%  0.00% <sshd>
 1385 postfix    2    0   628K 1216K kqread/0   0:00  0.00%  0.00% <qmgr>


There seems to be a bigger problem in uvm as expected by me before!


db{0}> show uvmexp
Current UVM status:
  pagesize=4096 (0x1000), pagemask=0xfff, pageshift=12
  1018418 VM pages: 656891 active, 320807 inactive, 2230 wired, 5 free
  pages  634067 anon, 344367 file, 1574 exec
  freemin=64, free-target=85, wired-max=339472
  faults=3609406, traps=3722187, intrs=7567575, ctxswitch=9890365
  softint=380503, syscalls=14268860, swapins=245, swapouts=270
  fault counts:
    noram=24, noanon=0, pgwait=24, pgrele=0
    ok relocks(total)=1148(1150), anget(retrys)=114762(394), amapcopy=55274
    neighbor anon/obj pg=115362/837538, gets(lock/unlock)=198035/756

cases: anon=77487, anoncow=37248, obj=164994, prcopy=33039,przero=98761

  daemon and swap counts:
    woke=15991, revs=10979, scans=2430998, obscans=2420929, anscans=2524
    busy=0, freed=2423039, reactivate=730, deactivate=2791838
    pageouts=156825, pending=2266288, nswget=297605
    nswapdev=1, swpgavail=6291455
    swpages=6291455, swpginuse=2422999, swpgonly=2125395, paging=78
db{0}>  ps

PID PPID PGRP UID S FLAGS LWPS COMMANDWAIT9957 1515 9957 0 2 0x4002 1 cpuvn_fp1

 10722        10024    10722          0 2  0x4002    1              tar

10024 9303 10024 0 2 0x4002 1 cshpause9303 10303 9303 1002 2 0x4002 1 cshpause10303 10740 10303 0 2 0x4103 1 loginwait10740 1262 1262 0 2 0x4100 1 rlogindpoll198 196 198 500 2 0x4002 1 toppoll196 195 196 500 2 0x4002 1 shwait195 194 195 0 2 0x4103 1 loginwait194 1262 1262 0 2 0x4100 1 rlogindpoll1515 1566 1515 0 2 0x4002 1 cshpause1566 1 1566 0 2 0x4102 1 loginwait1443 1 1443 0 2 0 1 cronnanosle1262 1 1262 0 2 0 1 inetdkqread1385 1263 1263 12 2 0x4108 1 qmgrkqread950 1263 1263 12 2 0x4108 1 pickupkqread1263 1 1263 0 2 0x4108 1 masterkqread872 1 872 0 2 0 1 sshdselect816 1 816 0 2 0 1 ntpdpause757 1 757 0 2 0 1 lpdpoll631 1 631 0 2 0 1 rpc.lockdselect672 1 672 0 2 0xa0008 1 rpc.statdselect682 614 614 0 2 0 1 nfsdnfsd677 614 614 0 2 0 1 nfsdnfsd637 614 614 0 2 0 1 nfsdnfsd671 614 614 0 2 0 1 nfsdnfsd614 1 614 0 2 0 1 nfsdpoll615 1 615 0 2 0 1 mountdselect571 0 0 0 2 0x20200 1 nfsionfsidl570 0 0 0 2 0x20200 1 nfsionfsidl545 0 0 0 2 0x20200 1 nfsionfsidl499 0 0 0 2 0x20200 1 nfsionfsidl557 1 557 0 2 0 1 amdselect505 1 505 0 2 0 1 ypbindselect497 1 497 0 2 0 1 rpcbindpoll574 1 574 0 2 0 1 syslogdkqread248 1 248 0 2 0 1 routedselect108 0 0 0 2 0x20200 1 physiodphysiod

>28               0        0          0 2 0x20200    1         aiodoned

27 0 0 0 2 0x20200 1 ioflushsyncer26 0 0 0 2 0x20200 1 pagedaemonpgdaemo25 0 0 0 2 0x20200 1 raidio2raidiow24 0 0 0 2 0x20200 1 raid2rfwcond

 23               0        0          0 2 0x20200    1          raidio0
 22               0        0          0 2 0x20200    1            raid0
 21               0        0          0 2 0x20200    1          raidio1

20 0 0 0 2 0x20200 1 raid1rfwcond19 0 0 0 2 0x20200 1 atapibus0sccomp18 0 0 0 2 0x20200 1 cryptoretcrypto_17 0 0 0 2 0x20200 1 atabus3atath16 0 0 0 2 0x20200 1 atabus2atath15 0 0 0 2 0x20200 1 atabus1atath14 0 0 0 2 0x20200 1 atabus0atath13 0 0 0 2 0x20200 1 usb7usbevt12 0 0 0 2 0x20200 1 usb6usbevt11 0 0 0 2 0x20200 1 usb5usbevt10 0 0 0 2 0x20200 1 usb4usbevt9 0 0 0 2 0x20200 1 scsibus0sccomp8 0 0 0 2 0x20200 1 usb3usbevt7 0 0 0 2 0x20200 1 usb2usbevt6 0 0 0 2 0x20200 1 usb1usbevt5 0 0 0 2 0x20200 1 usbtask-drusbtsk4 0 0 0 2 0x20200 1 usbtask-hcusbtsk3 0 0 0 2 0x20200 1 usb0usbevt2 0 0 0 2 0x20200 1 sysmonsmtaskq1 0 1 0 2 0x4001 1 initwait0 -1 0 0 2 0x20200 1 swapperschedul

db{0}>

At the time of the crash "tar" is extracting an archive into onefilesystem and "cp" is copying a large file into another filesystem -both on the same raid-device (raid1 on SATA) - but that has caused noproblem in the past.

The source of both commands is in /tmp (tmpfs) - ca. 11 GB resides in /tmp.

Is tmpfs known to be stable? Or may the problem be there?
Any Idea how to debug further on?

I will see if I'm able to replace tmpfs by a real filesystem and tryagain ...

By the way: I've recognized another problem in uvm_plistalloc_simple()by thinking about the strategie used there.If some processes tries to allocate more more memory in sum as thesystem has at all, the strategie used there may deadlock the whole system.If e.g. 30 % of memory is given to each of tree processes that requestseach 50% of the whole memory, it will be impossible for any of them tocomplete because the required memory is locked by the other processesand cannot be stolen there (there is no way to manipulate the localvariable "num" from outside ...). It would be possible to satisfy therequest of the first on and after it has freed the memory again the nextand so on. Therefore a way is needed to "steal" steal the memory fromwaiting processes again.This would be a very rare case and strange situation, but it is notdetected by the system at the moment. I'm not shure if there is aninexpensive and easy way to detect such a situation, but this "possible"problem should be at least documented in a comment in the source file.


best regards

W. Stukenbrock

PS. I've removed the privious mail content - it is in the gnats systemand can be reviewed there.

References:
- Re: kern/39242: NetBSD 4.0 will start busy-loop an hang on machines with more than 4 GB memory
  - From: Wolfgang Stukenbrock

Prev by Date: Re: kern/39242: NetBSD 4.0 will start busy-loop an hang on machines with more than 4 GB memory
Next by Date: Re: kern/39242: NetBSD 4.0 will start busy-loop an hang on machines with more than 4 GB memory
Previous by Thread: Re: kern/39242: NetBSD 4.0 will start busy-loop an hang on machines with more than 4 GB memory
Next by Thread: Re: kern/39242: NetBSD 4.0 will start busy-loop an hang on machines with more than 4 GB memory
Indexes:

Home | Main Index | Thread Index | Old Index