RE: hang with raid, postgresql

From: Don Bowman (don_at_sandvine.com)
Date: 05/30/04

  • Next message: Paul Murphy: "Re: reboot and shutdown don't work, suspect acpi"
    To: 'Doug White' <dwhite@gumbysoft.com>, Don Bowman <don@sandvine.com>
    Date: Sun, 30 May 2004 16:19:38 -0400
    
    

    From: Doug White [mailto:dwhite@gumbysoft.com]
    > On Sun, 30 May 2004, Don Bowman wrote:
    >
    > >
    > > I have a system with 2x 2.8GHz XEON (P4), intel e7501 chipset,
    > > 4GB of ram, aac [adaptec 2200s] raid with 4 scsi
    > > disks. I have also tried asr (adaptec 2015).
    > > I have tried two different motherboards.
    > > The only application the machine runs is postgresql,
    > > with about ~30 databases, about ~250GB of data.
    > >
    > > I'm finding the machine locks up solid once a day
    > > or so (sometimes more, sometimes less, no pattern
    > > of time of day). I know its not a hardware issue, it
    > > is reliable with FreeBSD 4.7. I've run through memory
    > > test, disk test, etc.
    > >
    > > There appears to be a correlation between
    > > disk activity (postgresql vacuum) and the lockup,
    > > but i can't be sure.
    >
    > Temperature?
    >
    > What motherboard is it exactly?

    lmmon shows the mobo temperature @ 28C. It is in
    an AC-controlled environment (~20C ambient). The system
    has 6 blower fans, ducted over the CPU's, with the
    copper heat sinks designed for the 3.2GHz XEON.
    It has 3 power supplies, each with separate AC
    inlet, fed from a UPS with filtered power.
    It should have ~150% airflow redundancy, and
    ~200% power redundancy.
    This is a supermicro X5DPE motherboard.
    http://www.supermicro.com/products/chassis/3U/933/SC933S2-R760.cfm
    shows the system.
    It was tested for ~1week with FreebSD 4.7
    at temperature in an environmental chamber,
    including cycling into memtest86 every 2 hours.

    I've been battling this hang for ~6weeks, this is
    a swap-out of all the hardware (new system).

    --don
    _______________________________________________
    freebsd-current@freebsd.org mailing list
    http://lists.freebsd.org/mailman/listinfo/freebsd-current
    To unsubscribe, send any mail to "freebsd-current-unsubscribe@freebsd.org"


  • Next message: Paul Murphy: "Re: reboot and shutdown don't work, suspect acpi"

    Relevant Pages

    • RE: hang with raid, postgresql
      ... > an AC-controlled environment. ... underperforming power supply or a scsi terminator. ... > This is a supermicro X5DPE motherboard. ... > at temperature in an environmental chamber, ...
      (freebsd-current)
    • Re: a7v600-x stuck in reboot loop?
      ... I forgot to say that I think this is a motherboard defect. ... > [CPU Specification] ... So I hold the power button in till computer shuts off ... > once or twice sometimes more before the video initializes. ...
      (alt.comp.periphs.mainboard.asus)
    • Re: a7v600-x stuck in reboot loop?
      ... [Motherboard Specification] ... [CPU Specification] ... So I hold the power button in till computer shuts off ... once or twice sometimes more before the video initializes. ...
      (alt.comp.periphs.mainboard.asus)
    • RE: Need help with random computer shutdowns
      ... connectors can make the power uncleanness or not consistent power supplied ... to the Motherboard components. ... "nass" wrote: ... If still then Backup the data and Perform a clean Install, ...
      (microsoft.public.windowsxp.help_and_support)
    • Re: class action lawsuit against toshiba? - no purposeful deception.
      ... If a product is constructed in such a way as to fail under normal use, the UCC implied warranty may extend beyond any warranty limit the manufacturer wrote into the agreement. ... If Toshiba sold a million units with the power connector attached to the motherboard causing each end user and expense of two hundred dollars in five percent of those sales, the damage to the public is rather significant, no? ... The cause of the problem is either poor workmanship or excessive mechanical stress that caused the solder joint to fail (such as someone tripping over the wire from the AC adapter to the laptop while the laptop was plugged in, or someone tilting the laptop up putting the entire weight of the laptop on the power plug sticking out of the back of the machine). ...
      (comp.sys.laptops)