|
|||||||||||
|
System freeze on i386 (Mail from sendbug was rejected)
From: Manfred Wierich <wierich(at)trustcenter.de>
Date: Mon Jun 30 2003 - 13:54:06 EDT
>Submitter-Id: net
System : OpenBSD 3.3
Architecture: OpenBSD.i386
Machine : i386
several weeks ago under OpenBSD. When I had sent and fetched several hundreds
of thousands of mails I took it into production at our company. Embarrasingly
enough, it froze the next morning at 6 a.m. Then we switched back to our old
mailserver.
After some time I got the idea to stress-test the failed server and got a program called "stress" from Freshmeat.net: http://weather.ou.edu/[..]apw/projects/stress/stress-0.17.2.tar.gz When I ran "stress -v --cpu 4 --hdd 4 --io 4 --vm 4" on the box, it crashed after maybe 10 minutes. I thought that this was maybe due to raidconf wich is not in GENERIC. I tried the same command on two other machines which are the same hardware as the mailserver: A Suse Linux 7.3, and an OpenBSD 3.2. The Linux system of course became very slow, but survived a whole weekend without crashing, the OpenBSD box also became unresponsive after a short time. To be sure, I did a default Install from CD, this time 3.3, on another hardware. It had the same problem. Also, if you run "stress --cpu 4 --hdd 4" as a normal user you can use up all system resources easily. "Top" won't work anymore, "stress" can't be interrupted by pressing ^c, ssh-connections are dropped, root can't log in on another console. If I wait for some 5 Minutes, the system crashes: _Debugger(0.d08db000.d094e06c.d029b1bc,3f006ed5) at _Debugger+0x4 _panic(d029a2b8.595c98.2000.d02ad0e0.0) at _panic+0x81 _ffs_reallocblks(db38ad1c.a80b.2000.ffffffff.d094e068) at _ffs_reallocblks+0xc3 _VOP_REALLOCBLKS(db2d4cac,094e040.db38ad88.d01f31d0.db2d4cac.db2d592c.d38f9388.14)at _VOP_REALLOCBLKS+0x2e _cluster_write(d38f9388.db2d592c.1501a000.0.db2d4cac) at _cluster_write+0x130 _ffs_write(db38ae44,3000042,db383280,d095c420,fffff) at _ffs_write+0x3df _VOP_WRITE(db2d4cac,db38aed8,da8db0f0,db38af88) at _VOP_WRITE+0x3b _vn_write(db2a4960,db2a497c,db38aed8,da8db0f0,db38af88) at _vn_write+0x94 _dofilewrite(db383280,db38af88,db2a4960,cfafd894,fffff) at _dofilewrite+0x81 _sys_write(db383280,db38af88,db38af7c,d03028db,0) at _sys_write+0x47 _syscall() at _syscall+0x26d --- syscall (number 4) --- 0x400a39f3 ddb>ps PID PPID PGRP UID S FLAGS WAIT COMMAND 31063 17612 19612 1000 3 0x6 getnewbuf stress 31063 17612 19612 1000 3 0x6 getnewbuf stress 31063 17612 19612 1000 3 0x6 getnewbuf stress 31063 17612 19612 1000 3 0x6 getnewbuf stress 31063 17612 19612 1000 3 0x6 getnewbuf stress 31063 17612 19612 1000 3 0x6 getnewbuf stress 31063 17612 19612 1000 3 0x6 getnewbuf stress 31063 17612 19612 1000 3 0x6 getnewbuf stress As I was playing around a bit further dmesg of this box (running Version 3.3): OpenBSD 3.3 (GENERIC) #44: Sat Mar 29 13:22:05 MST 2003
deraadt@i386.openbsd.org:/usr/src/sys/arch/i386/compile/GENERIC
cpu0: Intel Pentium III (Coppermine) ("GenuineIntel" 686-class) 665 MHz
cpu0:
dmesg for the other boxens, the mailserver, the 3.2 system and the SuSe Linux system (they're all the same hardware, maybe except NICs): OpenBSD 3.2-current (GENERIC) #1: Fri Oct 11 12:51:22 CEST 2002
root@warthog:/usr/src/sys/arch/i386/compile/GENERIC
cpu0: Intel Pentium III (Coppermine) ("GenuineIntel" 686-class) 930 MHz
cpu0:
pcibios0: PCI IRQ Routing Table rev. 1.0 @ 0xf2c40/224 (12 entries)
pcibios0: PCI Interrupt Router at 000:31:0 ("Intel 82371FB PCI-ISA" rev 0x00)
pcibios0: PCI bus #2 is the last bus
bios0: ROM list: 0xc0000/0x8000 0xc8000/0x800 0xc8800/0x800 pci0 at mainbus0 bus 0: configuration mode 1 (no bios) pchb0 at pci0 dev 0 function 0 "Intel 82815 Hub" rev 0x02 ppb0 at pci0 dev 1 function 0 "Intel 82815 AGP" rev 0x02 pci1 at ppb0 bus 1 vga1 at pci1 dev 0 function 0 "ATI Rage Fury" rev 0x00 wsdisplay0 at vga1: console (80x25, vt100 emulation) wsdisplay0: screen 1-5 added (80x25, vt100 emulation) ppb1 at pci0 dev 30 function 0 "Intel 82801BA AGP" rev 0x02 pci2 at ppb1 bus 2 xl0 at pci2 dev 9 function 0 "3Com 3c905C 100Base-TX" rev 0x78: irq 3xl0: command never completed! xl0: command never completed! xl0: command never completed! address 00:04:76:35:f4:73 exphy0 at xl0 phy 24: 3Com internal media interface xl0: command never completed! xl0: command never completed! xl0: command never completed! xl1 at pci2 dev 11 function 0 "3Com 3c905C 100Base-TX" rev 0x78: irq 11xl1: command never completed! xl1: command never completed! xl1: command never completed! address 00:04:76:36:29:d1 exphy1 at xl1 phy 24: 3Com internal media interface xl1: command never completed! xl1: command never completed! xl1: command never completed! pcib0 at pci0 dev 31 function 0 "Intel 82801BA LPC" rev 0x02 pciide0 at pci0 dev 31 function 1 "Intel 82801BA IDE" rev 0x02: DMA, channel 0 wired to compatibility, channel 1 wired to compatibility wd0 at pciide0 channel 0 drive 0: <Maxtor 5T020H2> wd0: 16-sector PIO, LBA, 19073MB, 16383 cyl, 16 head, 63 sec, 39062500 sectors wd0(pciide0:0:0): using PIO mode 4, Ultra-DMA mode 5 atapiscsi0 at pciide0 channel 1 drive 0 scsibus0 at atapiscsi0: 2 targets cd0 at scsibus0 targ 0 lun 0: <Lite-On, LTN486 48x Max, YD01> SCSI0 5/cdrom removable cd0(pciide0:1:0): using PIO mode 4, Ultra-DMA mode 2 uhci0 at pci0 dev 31 function 2 "Intel 82801BA USB" rev 0x02: irq 10 usb0 at uhci0: USB revision 1.0 uhub0 at usb0 uhub0: vendor 0x0000 UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 2 ports with 2 removable, self powered "Intel 82801BA SMBus" rev 0x02 at pci0 dev 31 function 3 not configured isa0 at pcib0 isadma0 at isa0 pckbc0 at isa0 port 0x60/5 pckbd0 at pckbc0 (kbd slot) pckbc0: using irq 1 for kbd slot wskbd0 at pckbd0: console keyboard, using wsdisplay0 pcppi0 at isa0 port 0x61 midi0 at pcppi0: <PC speaker> sysbeep0 at pcppi0 lpt0 at isa0 port 0x378/4 irq 7 npx0 at isa0 port 0xf0/16: using exception 16 pccom0 at isa0 port 0x3f8/8 irq 4: ns16550a, 16 byte fifo fdc0 at isa0 port 0x3f0/6 irq 6 drq 2 fd0 at fdc0 drive 0: 1.44MB 80 cyl, 2 head, 18 sec biomask c440 netmask cc48 ttymask ccca pctr: 686-class user-level performance counters enabled mtrr: Pentium Pro MTRR support dkcsum: wd0 matched BIOS disk 80 root on wd0a rootdev=0x0 rrootdev=0x300 rawdev=0x302 WARNING: / was not properly unmounted free inode //3607730 had 163840 blocks dev = 0x0, block = 2664, fs = / panic: ffs_blkfree: freeing free block OpenBSD 3.2-current (GENERIC) #1: Fri Oct 11 12:51:22 CEST 2002
root@warthog:/usr/src/sys/arch/i386/compile/GENERIC
cpu0: Intel Pentium III (Coppermine) ("GenuineIntel" 686-class) 930 MHz
cpu0:
pcibios0: PCI IRQ Routing Table rev. 1.0 @ 0xf2c40/224 (12 entries) pcibios0: PCI Interrupt Router at 000:31:0 ("Intel 82371FB PCI-ISA" rev 0x00) pcibios0: PCI bus #2 is the last bus bios0: ROM list: 0xc0000/0x8000 0xc8000/0x800 0xc8800/0x800 pci0 at mainbus0 bus 0: configuration mode 1 (no bios) pchb0 at pci0 dev 0 function 0 "Intel 82815 Hub" rev 0x02 ppb0 at pci0 dev 1 function 0 "Intel 82815 AGP" rev 0x02 pci1 at ppb0 bus 1 vga1 at pci1 dev 0 function 0 "ATI Rage Fury" rev 0x00 wsdisplay0 at vga1: console (80x25, vt100 emulation) wsdisplay0: screen 1-5 added (80x25, vt100 emulation) ppb1 at pci0 dev 30 function 0 "Intel 82801BA AGP" rev 0x02 pci2 at ppb1 bus 2 xl0 at pci2 dev 9 function 0 "3Com 3c905C 100Base-TX" rev 0x78: irq 3xl0: command never completed! xl0: command never completed! xl0: command never completed! address 00:04:76:35:f4:73 exphy0 at xl0 phy 24: 3Com internal media interface xl0: command never completed! xl0: command never completed! xl0: command never completed! xl1 at pci2 dev 11 function 0 "3Com 3c905C 100Base-TX" rev 0x78: irq 11xl1: command never completed! xl1: command never completed! xl1: command never completed! address 00:04:76:36:29:d1 exphy1 at xl1 phy 24: 3Com internal media interface xl1: command never completed! xl1: command never completed! xl1: command never completed! pcib0 at pci0 dev 31 function 0 "Intel 82801BA LPC" rev 0x02 pciide0 at pci0 dev 31 function 1 "Intel 82801BA IDE" rev 0x02: DMA, channel 0 wired to compatibility, channel 1 wired to compatibility wd0 at pciide0 channel 0 drive 0: <Maxtor 5T020H2> wd0: 16-sector PIO, LBA, 19073MB, 16383 cyl, 16 head, 63 sec, 39062500 sectors wd0(pciide0:0:0): using PIO mode 4, Ultra-DMA mode 5 atapiscsi0 at pciide0 channel 1 drive 0 scsibus0 at atapiscsi0: 2 targets cd0 at scsibus0 targ 0 lun 0: <Lite-On, LTN486 48x Max, YD01> SCSI0 5/cdrom removable cd0(pciide0:1:0): using PIO mode 4, Ultra-DMA mode 2 uhci0 at pci0 dev 31 function 2 "Intel 82801BA USB" rev 0x02: irq 10 usb0 at uhci0: USB revision 1.0 uhub0 at usb0 uhub0: vendor 0x0000 UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 2 ports with 2 removable, self powered "Intel 82801BA SMBus" rev 0x02 at pci0 dev 31 function 3 not configured isa0 at pcib0 isadma0 at isa0 pckbc0 at isa0 port 0x60/5 pckbd0 at pckbc0 (kbd slot) pckbc0: using irq 1 for kbd slot wskbd0 at pckbd0: console keyboard, using wsdisplay0 pcppi0 at isa0 port 0x61 midi0 at pcppi0: <PC speaker> sysbeep0 at pcppi0 lpt0 at isa0 port 0x378/4 irq 7 npx0 at isa0 port 0xf0/16: using exception 16 pccom0 at isa0 port 0x3f8/8 irq 4: ns16550a, 16 byte fifo fdc0 at isa0 port 0x3f0/6 irq 6 drq 2 fd0 at fdc0 drive 0: 1.44MB 80 cyl, 2 head, 18 sec biomask c440 netmask cc48 ttymask ccca pctr: 686-class user-level performance counters enabled mtrr: Pentium Pro MTRR support dkcsum: wd0 matched BIOS disk 80 root on wd0a rootdev=0x0 rrootdev=0x300 rawdev=0x302 Received on Mon Jun 30 14:09:18 2003 This archive was generated by hypermail 2.1.8 : Wed Aug 23 2006 - 13:29:59 EDT |
||||||||||
|
|||||||||||