RE: kern/59719 Re: 4.9 Stable Crashes on SuperMicro with SMP

From: Don Bowman (don_at_sandvine.com)
Date: 11/29/03

  • Next message: Don Bowman: "RE: Time jumping on both 4.x and 5.x ..."
    To: 'Uwe Doering' <gemini@geminix.org>, freebsd-gnats-submit@FreeBSD.org
    Date: Sat, 29 Nov 2003 11:33:58 -0500
    
    

    From: Uwe Doering [mailto:gemini@geminix.org]
    > Jonathan Gilpin wrote:
    > > I've run memtest (memtest86.com) kindly provided by Don and
    > it passed all
    > > the tests. I've installed installed a kernel module to test
    > for memory
    > > errors and found that again no memory errors are found...
    > So this means it's
    > > either a problem with the CPU's or a geniune bug in the
    > kernel. (bugger!)
    >
    > No, that's unfortunately not what it means. If a memory test
    > fails you
    > can draw the conclusion that you have bad memory, but this
    > doesn't work
    > the other way round. If a memory test passes there is still a
    > possibility that a memory chip is the culprit since memory
    > test software
    > cannot find all errors.
    >
    > Also, there is the chip set on the mainboard that coordinates
    > bus access
    > etc. for the two CPUs. Mainboard and chip set developers are
    > known to
    > make errors, too. In this case you would have to swap the entire
    > mainboard, possible with one from a different manufacturer.
    > I can tell
    > you from my own experience that it is really hard to find reliable PC
    > hardware these days, in light of ever shorter and faster
    > product release
    > cycles.

    I have several hundred of the motherboard the poster is using,
    and it works reliably with MP operation with 4.X.
    The memtest86 that i sent him understands the ECC registers
    on the e7501 MCH, it should find all correctable and uncorrectable
    errors.

    --don
    _______________________________________________
    freebsd-stable@freebsd.org mailing list
    http://lists.freebsd.org/mailman/listinfo/freebsd-stable
    To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org"


  • Next message: Don Bowman: "RE: Time jumping on both 4.x and 5.x ..."

    Relevant Pages

    • Re: kern/59719 Re: 4.9 Stable Crashes on SuperMicro with SMP
      ... I've installed installed a kernel module to test for memory ... there is the chip set on the mainboard that coordinates bus access ...
      (freebsd-stable)
    • Re: Large array in KVM
      ... I'm working on a kernel module that needs to maintain a large ... Could using a memory-mapped file help? ... a process that maps a large file into memory, ... which means a dynamically-allocated syscall via ...
      (freebsd-hackers)
    • Re: Memory testing in a kernel thread
      ... > Here is my first attempt at a kernel module for testing memory: ... > offline for hours or even days running memtest86. ... I wrote `burnBX` and later `burnMMX` as userland pgms to very ...
      (comp.os.linux.development.system)
    • Re: copy-on-write anonymous memory?
      ... is it possible to create a memory object that represents ... Then I pass the addr to the user space, ... I have created a kernel module that stores references to ... that memory area should create a shadow object. ...
      (freebsd-hackers)
    • How to use vmalloc form kernel module
      ... I read that source from Kasper Dupont, showing how to read files from within a ... kernel module. ... I used vmalloc to get my memory and this was ... Asuming this is correct, do I need to specify mem to ioremap, or can I use an ...
      (comp.os.linux.development.system)