Re: mute crash / DDB
- From: "Steve at fivetrees" <steve@xxxxxxxxxxxxxxxxxxxxx>
- Date: Thu, 6 Jul 2006 13:20:29 +0100
"jpd" <read_the_sig@xxxxxxxxxxxxxxxxxxxxxx> wrote in message
news:4h4b2rF1q2r04U1@xxxxxxxxxxxxxxxxx
Begin <IradnQyJctWRcDHZnZ2dnUVZ8qednZ2d@xxxxxxxxx>
On 2006-07-06, Steve at fivetrees <steve@xxxxxxxxxxxxxxxxxxxxx> wrote:
These machines, and their predecessors, and *their* predecessors, have
been very reliable - apart from the odd event such as this every few
months.
I don't know if your boxes are on a UPS, but if it really is that
sporadic and apparently fairly independent of the hardware, it even
might be anomalies in the power. If you have logs of previous incidents,
how regular are they, really?
They are indeed on a UPS. It's the machine that's taking most of the load
(active webserver) that goes mute every once in a long while (in the old
days, it'd reboot rather than die, as previously noted).
Re regularity: somewhere around 4-6 months. My guess (as I've said) is that
it's a resource issue. Note that I've seen this issue (assuming it's the
same issue, which seems likely) over several generations of both OpenBSD and
hardware [1]. I wasn't too worried about it when the symptom was a reboot;
I'm slightly more concerned now that it freezes, since it knocks out the
webserver etc until I notice and/or start getting phone calls. I have
monitoring enabled via my coloco provider, but since this works on the basis
of pings, it doesn't help :(.
I note recent discussion on the misc@ list re "3.9 freeze" - which exactly
describes what I'm seeing - i.e. completely dead but still responds to
pings.
[1] Except 2.6, on which I managed to get around 480 days of uptime. I'm a
bit more proactive on patches and controlled reboots these days ;). Back
then I ran a custom kernel; I've double-checked for significant differences,
but I'll repeat the exercise.
Steve
http://www.fivetrees.com
.
- References:
- mute crash / DDB
- From: Steve at fivetrees
- Re: mute crash / DDB
- From: Steve at fivetrees
- Re: mute crash / DDB
- From: jKILLSPAM . schipper
- Re: mute crash / DDB
- From: Steve at fivetrees
- Re: mute crash / DDB
- From: DoN. Nichols
- Re: mute crash / DDB
- From: Steve at fivetrees
- Re: mute crash / DDB
- From: jpd
- mute crash / DDB
- Prev by Date: Re: mute crash / DDB
- Next by Date: DNS zone transfers - which port?
- Previous by thread: Re: mute crash / DDB
- Next by thread: openBSD 3.7 tar command - tar: Invalid header, starting valid header search
- Index(es):
Relevant Pages
|