Re: system hangup - I'm lost



On Tue, Sep 30, 2008 at 12:39:27PM +0200, Bartosz Stec wrote:
Oliver Lehmann wrote:
Hi,

My fileserver has sporadical hangups running 6.3:

FreeBSD 6.3-STABLE #0: Thu Jun 19 00:21:00 CEST 2008
olivleh1@xxxxxxxxxxxxxxxxxxxxxxxx:/usr/obj/i386-pentium3-6.3/usr/src/sys/NUDEL

The exact release doesn't matter since it happened before. It always
happens afer some time of having some load on the system (I'm building
ports with tinderbox and during the build process it just hangs up).

The system does nothing write out on the console, neither the CRT, nor
the serial console.

The system itself is:

CPU: Intel Pentium III (845.64-MHz 686-class CPU)
Origin = "GenuineIntel" Id = 0x683 Stepping = 3
Features=0x387fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,PN,MMX,FXSR,SSE>
real memory = 805240832 (767 MB)
avail memory = 778481664 (742 MB)
ACPI APIC Table: <Intel N440BX >
FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs
cpu0 (BSP): APIC ID: 1
cpu1 (AP): APIC ID: 0
ioapic0 <Version 1.1> irqs 0-23 on motherboard

while the diskspace is provided by an 3ware RAID:

twa0: <3ware 9000 series Storage Controller> port 0x2400-0x24ff mem 0xf4101000-0xf41010ff,0xf4800000-0xf4ffffff irq 18 at device 11.0 on pci0
twa0: INFO: (0x04: 0x0053): Battery capacity test is overdue: twa0:
INFO: (0x15: 0x1300): Controller details:: Model 9500S-4LP, 4 ports,
Firmware FE9X 2.08.00.009, BIOS BE9X 2.03.01.052

da0 at twa0 bus 0 target 0 lun 0
da0: <AMCC 9500S-4LP DISK 2.08> Fixed Direct Access SCSI-3 device
da0: 100.000MB/s transfers
da0: 715224MB (1464778752 512 byte sectors: 255H 63S/T 91178C)

I had - in the past - sometimes messages left which where indicating,
that the system was not able to allocate swap space fast enough if I
recall it correctly (_not_ out of swap space!) but the RAID is kinda
fast imho.

Any idea what I could do to shed some more light on this behaviour?
Why it is happening and what really is causing it?
Would enabling the kernel debugger really help here? I mean the system
is really hanging up - except ping response it is not responding to
anything except the reset switch ;)

Greetings, Oliver



Personally I'd rather bet on some hardware problem (overheating?) Try to
install mbmon from ports. I had also similiar problems with old
motherboards with swelled capacitors.

Be careful with mbmon and healthd -- just because they compile and run
does not mean they're working properly (the values shown may be
completely unreliable/incorrect).

It's best to check such things in the system BIOS, unless you have
absolute certainty that your motherboard is supported by mbmon/healthd.

--
| Jeremy Chadwick jdc at parodius.com |
| Parodius Networking http://www.parodius.com/ |
| UNIX Systems Administrator Mountain View, CA, USA |
| Making life hard for others since 1977. PGP: 4BD6C0CB |

_______________________________________________
freebsd-stable@xxxxxxxxxxx mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscribe@xxxxxxxxxxx"



Relevant Pages

  • Re: TEST and thanks
    ... What's this project consist of (other than a P66 CPU)? ... Oh well) I thought at the time that I had a bunch of Socket ... extraordinary details of rounding up about 20 motherboards so far. ...
    (comp.sys.ibm.ps2.hardware)
  • Re: Monitor CPU temperature in a Supermicro X6DH8-XG2
    ... I need to get the CPU temperature in a Supermicro X6DH8-XG2 motherboard, ... No sensors found! ... Make sure you loaded all the kernel drivers you need. ... I use Supermicro motherboards, and assuming you did install the ipmi card ...
    (Debian-User)
  • Re: MCE 2005 Crashes few times a week - No Errors whatsoever
    ... Who's the manufacturer of your memory? ... Try installing some kind of program that causes the CPU to be under ... Oh, one last thought, who's the manufacturer of your hard drive? ... determining what KIND of hardware problem is the trick. ...
    (microsoft.public.windows.mediacenter)
  • Re: Time for a new PC, which motherboard?
    ... Very few current motherboards are purported to be compatible ... Conroe will represent the first substantial ... decrease in price of things after the next model is available. ... looked at CPU prices much yet, but I guess I would like to pay around a ...
    (alt.comp.hardware.pc-homebuilt)
  • Re: Blue Dump Screen
    ... Brandon wrote: ... I have tested the ram and that is ok. ... > know how I could test my cpu. ... The difficulty with testing motherboards and processors is that usually ...
    (microsoft.public.windowsxp.security_admin)