cluster_lockd.scr(check) timed out - error polling `cluster_lockd` -

From: David Knight (dknight_at_fitzandfloyd.com)
Date: 09/25/03

  • Next message: Daniel Clar: "SUMMARY : System crashes but why ?"
    Date: Thu, 25 Sep 2003 08:54:54 -0500
    To: tru64-unix-managers@ornl.gov
    
    

    Admins,
    Does anyone have insight on the below 3 errors:

     

    ============================ Syslog event ============================

    EVM event name: sys.unix.syslog.daemon

    Syslog daemon events are posted by system daemons to alert the

    administrator to an unusual condition. The user name field usually

    indicates which daemon posted the event. The text of the message

    indicates the reason for the event.

    ======================================================================

    Formatted Message:

    CAAD[1573493]: RTD #0: Action Script

    /var/cluster/caa/script/cluster_lockd.scr(check) timed out! (timeout=60)

    Event Data Items:

    Event Name : sys.unix.syslog.daemon

    Priority : 600

    PID : 1573365

    PPID : 1572865

    Event Id : 20623

    Member Id : 3

    Timestamp : 25-Sep-2003 01:31:40

    Host IP address : 10.10.5.170

    Cluster IP address: 10.10.5.151

    Host Name : dalunix170.clubcorp.com

    Cluster Name : dalunixcl

    User Name : root

    Format : CAAD[1573493]: RTD #0: Action Script

    /var/cluster/caa/script/cluster_lockd.scr(check) timed

    out! (timeout=60)

    Reference : cat:evmexp.cat:200

    Variable Items:

    None

    ======================================================================

    ============================ Syslog event ============================

    EVM event name: sys.unix.syslog.daemon

    Syslog daemon events are posted by system daemons to alert the

    administrator to an unusual condition. The user name field usually

    indicates which daemon posted the event. The text of the message

    indicates the reason for the event.

    ======================================================================

    Formatted Message:

    CAAD[1573493]: An error was encountered while polling `cluster_lockd`

    Event Data Items:

    Event Name : sys.unix.syslog.daemon

    Priority : 600

    PID : 1573365

    PPID : 1572865

    Event Id : 20624

    Member Id : 3

    Timestamp : 25-Sep-2003 01:31:40

    Host IP address : 10.10.5.170

    Cluster IP address: 10.10.5.151

    Host Name : dalunix170.clubcorp.com

    Cluster Name : dalunixcl

    User Name : root

    Format : CAAD[1573493]: An error was encountered while polling

    `cluster_lockd`

    Reference : cat:evmexp.cat:200

    Variable Items:

    None

    ======================================================================

     

    ---------- Problem Found: Received Error on Data Packet at Sep 24, 2003 6:14:57 PM GMT-05:00 ----------

    Problem Report Times:

    Event Time: Sep 20, 2003 6:11:21 PM GMT-05:00

    Report Time: Sep 24, 2003 6:14:57 PM GMT-05:00

    Expiration Time: Sep 21, 2003 6:04:41 PM GMT-05:00

    Managed Entity:

    ------ Product Information ------

    Computer Name: dalunix170

     

    Service Obligation Data:

    Service Obligation: Valid

    Service Obligation Number: AY20500939

    System Serial Number: AY20500939

    Service Provider Company Name: Hewlett-Packard Company

     

    Brief Description:

    Received Error on Data Packet

     

    Callout ID:

    RPE01x0018x1011-08

    Severity:

    2

    Reporting Node:

    dalunix170

    Full Description:

    ----- Standard Hub Error Description and FRU Callout -----

    This reporting Adapter received a data error in a data packet during

    a transaction. The packet will be dis-missed and retried by software.

    This error is recoverable by software.

    -----------------------------------------------------------------

    The data path is from the transmitting Linecard, through the HUB,

    to the Linecard connected to this Adapter, and finally, this Adapter.

     

    FRU List:

    Standard Hub FRU List:

    Highest Probability: The Receiving CCMLB-AA Linecard

    Manufacturer: Compaq

    Description: Memory Channel HUB Linecard

    PartNumber: 54-24966-01

    Location:

    This Adapter is connected to the Linecard in slot 2 of the HUB.

    ------------------------------------------

    Next Highest Probability: This Reporting CCMAB-AA Adapter

    Manufacturer: Compaq

    Description: PCI Memory Channel Adapter

    Location: PCI Slot: x00000001

    Part Number: 54-24962-01

    ------------------------------------------

    Third Most Probability: The Transmitting CCMLB-AA Linecard

    Manufacturer: Compaq

    Description: Memory Channel HUB Linecard

    Part Number: 54-24966-01

    Location:

    The transmitting Linecard is in slot 4 of the HUB.

     

    Evidence:

    Local Time of Event: Sep 20, 2003 6:11:21 PM GMT-05:00

    Record Number: Prefix: x6210 Count: x000C

    Link Control and Status Register: x00A8807A

    Memory Channel Error Register: x02021002

     

    SEA Version:

    System Event Analyzer for Tru64 UNIX V4.2 (Build 113)

     

    WCC Version:

    Web-based Enterprise Service Common Components for

    Tru64 UNIX V4.2 (Build 113), member of WEB-based

    Enterprise Service Suite for Tru64 UNIX V4.2 (Build

    113)

     

     


  • Next message: Daniel Clar: "SUMMARY : System crashes but why ?"