Re: Kernel panic on PowerEdge 1950 under certain stress load



You are right, they may not be the same. From first look it seems like they
are similar based on the description of the problems -- system stable, then
under load related to network, get panic after different time intervals. I
just assumed that kernel is typically stable enough that this kind of panic
are rare (been using FBSD for 7 or 8 years now and in heavy loads as well,
never had kernel panics to deal with).

Upon closer look at the trace and the problem, they may not be the same,
since one on those web pages was about the route code and my breaks only in
one place - waiting for a lock. Again, I will see if I could get a dump when
I return to the office.

I did reboot the system and set mpsafenet to 0 and I have not had a crash
since then (almost a day) running the same load, so that's positive: at
least it may be that that's the workaround, and I don't need Dell to send me
new memory modules to try...

Kris or Ivan: I was wondering if you could briefly explain what your guess
the problem might be. I am curious what the cause of the problem is. E.g. it
seems like a race condition, but I am curious to know more of the details...


Thanks,
Benjie



On 9/25/07, Kris Kennaway <kris@xxxxxxxxxxx> wrote:

Benjie Chen wrote:
Ivan and Kris,

I will try to get a kernel trace -- it may not happen for awhile since I
am
not in the office and working remotely for awhile so it may not be easy
to
get a trace... but I will check.

It looks like the problem reported by that link, and some of the links
from
there though...

Does it really? i.e. did you compare the function names in detail and
find that they match precisely, or do you just mean "they are both
panics of some description and I dunno what it all means"? :) I ask
because the linked trace does not involve a spinlock, which means it
cannot be precisely the same trace.

Kris




--
Benjie Chen, Ph.D.
Addgene, a better way to share plasmids
www.addgene.org
_______________________________________________
freebsd-hackers@xxxxxxxxxxx mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
To unsubscribe, send any mail to "freebsd-hackers-unsubscribe@xxxxxxxxxxx"



Relevant Pages

  • Re: Kernel panic on PowerEdge 1950 under certain stress load
    ... I increased the load even more and it crashed again. ... I will try kernel trace next. ... never had kernel panics to deal with). ... I will try to get a kernel trace -- it may not happen for awhile since ...
    (freebsd-hackers)
  • Re: ULE status, invalid load, buildkernel times.
    ... i've just updated my kernel and it paniced right after ... KDB_TRACE shows a trace during the panic. ... The problem is that ddb is ... activated before the machine actually panics, ...
    (freebsd-current)
  • Re: panic in propagate_priority w/ postgresql under heavy load
    ... > the kernel down with debug code seems to avoid the panic. ... result of a corrupted mutex, and when the mutex code goes to perform ... you can trace them to find one that is in panic. ... overhead), and when the system panics, you'll get a db> prompt. ...
    (freebsd-hackers)
  • Re: Random panics with 5.3-REL, SMP
    ... "Random panics" is a little vague as a starting point, ... explicit call to panicby code that has detected a failure of a kernel ... A stack trace can be generated using the "trace" ... - In terms of debugging information, it would be very useful if you could ...
    (freebsd-current)
  • Re: if_dc panics with 3Com OfficeConnect 10/100B PCI
    ... panics after a port mapping failure. ... Booting with 'dc' compiled into the kernel doens't quite work, ... panic happens too early for dumpdev to be set. ... Using a module doesn't really help with obtaining a trace, ...
    (freebsd-current)