Re: Missing Files After Panic/Fsck?

From: Tim Bradshaw (tfb_at_cley.com)
Date: 11/06/03

  • Next message: Bruce Allen: "Re: environmental monitoring via prtdiag"
    Date: Thu, 06 Nov 2003 09:32:57 +0000
    
    

    * Gavin Maltby wrote:
    > If this happens then it's a bug and you should complain to the
    > cluster vendor. It is possible to knobble some heartbeat designs
    > with device interrupts, but you can design around that.

    Yes, I'm not disputing it's a bug: I'm just wondering whether it can
    happen in current cluster systems. Complaining to the vendor may not
    get a fix to something like this very fast, if it happens, because I
    suspect it's not something a three-line change will fix.

    And just to play devil's advocate: *is* it a bug? We've all dealt
    with systems where some horrible thing has happened (runaway memory
    use, fork bomb or what have you), and while the system is in theory
    still up, it's not actually doing any useful work or likely to any
    time soon, and the kindest thing is to put it out of its misery with
    the big red button. OK, *those* problems can, in theory, be worked
    around as well, but they still happen. Maybe the right thing in that
    case is for the cluster to just decide that machine is gone, and fail
    over? Of course it should probably make a conscious decision (`load
    average is 903, memory shortfall is 20GB, last user code ran 10mins
    ago, time to die') rather than just failing to respond...

    --tim


  • Next message: Bruce Allen: "Re: environmental monitoring via prtdiag"

    Relevant Pages

    • Re: Missing Files After Panic/Fsck?
      ... > If this happens then it's a bug and you should complain to the ... > cluster vendor. ... I'm not disputing it's a bug: I'm just wondering whether it can ... happen in current cluster systems. ...
      (comp.unix.solaris)
    • Re: C# Service Terminating Itself
      ... >> Ok, I see your point, you aren't discussing services based on the FCL. ... >> privilege, just like any other application in the system at least if you ... services, or in the case of cluster server might crash the cluster service, ... until the first bug shows up. ...
      (microsoft.public.dotnet.languages.csharp)
    • Re: Ctrl+Y invokes Delete Current Line instead of Redo...
      ... It should be a bug. ... I'm not trying to start a flame war here, but my goodness you complain ... Visual Basic .Net is not C++ and the VB 2005 ide is not the ide ... so either adjust to the new language/ide (or ...
      (microsoft.public.dotnet.languages.vb)
    • Re: Ctrl+Y invokes Delete Current Line instead of Redo...
      ... It should be a bug. ... I'm not trying to start a flame war here, but my goodness you complain ... Visual Basic .Net is not C++ and the VB 2005 ide is not the ide ... so either adjust to the new language/ide (or ...
      (microsoft.public.dotnet.languages.vb)
    • Re: dsdt buggy acpi
      ... iasl will complain about code that the Linux interpreter will happily ... then it's unlikely that there's any functional difference as a result. ... As for a bug report I already have one filed. ...
      (Linux-Kernel)