Disk drive errors - is the drive dead?

From: Franz Fischer (Franz.Fischer_at_franz-fischer.de)
Date: 03/13/05

  • Next message: Franz Fischer: "DPW500au and RADEON 7500 PCI --- any experiences"
    Date: Sun, 13 Mar 2005 11:59:54 +0100 (MET)
    To: tru64-unix-managers@ornl.gov
    
    

    Hi all,

    I was trying to move a COMPAQ DGHS18Y 18GB drive from a retiered file server
    to my Alpha box, but I see repeated hard errors reported by uerf, sometimes
    the system gets hung for a while due to SCSI bus timeouts / resets.

    Does this indicate the drive is (almost) dead?

    Current setup is AlphaStation 255, Tru64 UNIX 4.0G, narrow internal SCSI
    bus, DGHS18Y connected via 50pin to 80pin SCA adapter.

    Uerf report below.

    Thanks in advance for your help

            \franz

    ----- EVENT INFORMATION -----

    EVENT CLASS ERROR EVENT
    OS EVENT TYPE 199. CAM SCSI
    SEQUENCE NUMBER 4.
    OPERATING SYSTEM DEC OSF/1
    OCCURRED/LOGGED ON Sun Mar 13 11:49:06 2005
    OCCURRED ON SYSTEM moco
    SYSTEM ID x0006000D CPU TYPE: DEC 7000
    SYSTYPE x00000000

    ----- UNIT INFORMATION -----

    CLASS x0000 DISK
    SUBSYSTEM x0000 DISK
    BUS # x0000
                                  x0010 LUN x0
                                            TARGET x2

    ----- CAM STRING -----

    ROUTINE NAME cdisk_check_sense

    ----- CAM STRING -----

                                            Device aborted command - parity error?

    ----- CAM STRING -----

    ERROR TYPE Hard Error Detected

    ----- CAM STRING -----

    DEVICE NAME COMPAQ DGHS18Y 01C0

    ----- CAM STRING -----

                                            Active CCB at time of error

    ----- CAM STRING -----

                                            CCB request completed with an error

    ----- ENT_CCB_SCSIIO -----

    *MY ADDR x09F9D580
    CCB LENGTH x00C0
    FUNC CODE x01
    CAM_STATUS x00C4 CAM_REQ_CMP_ERR
                                            SIM QFRZN
                                            AUTOSNS_VALID
    PATH ID 0.
    TARGET ID 2.
    TARGET LUN 0.
    CAM FLAGS x00000442
                                            CAM_QUEUE_ENABLE
                                            CAM_DIR_IN
                                            CAM_SIM_QFRZDIS
    *PDRV_PTR x09F9D228
    *NEXT_CCB x00000000
    *REQ_MAP x09F74200
    VOID (*CAM_CBFCNP)() x00465210
    *DATA_PTR x40039800
    DXFER_LEN x00002000
    *SENSE_PTR x09F9D250
    SENSE_LEN x40
    CDB_LEN x0A
    SGLIST_CNT x0000
    CAM_SCSI_STATUS x0002 SCSI_STAT_CHECK_CONDITION
    SENSE_RESID x20
    RESID x00000000
    CAM_CDB_IO x000000100000B02EF5010028
    CAM_TIMEOUT x0000003C
    MSGB_LEN x0000
    VU_FLAGS x4000
    TAG_ACTION x20

    ----- CAM STRING -----

                                            Error, exception, or abnormal
                                             _condition

    ----- CAM STRING -----

                                            ABORTED COMMAND - Target aborted
                                             _command

    ----- ENT_SENSE_DATA -----

    ERROR CODE x0070 CODE x70
    SEGMENT x00
    SENSE KEY x000B ABORTED CMD
    INFO BYTE 3 x00
    INFO BYTE 2 x00
    INFO BYTE 1 x00
    INFO BYTE 0 x00
    ADDITION LEN x18
    CMD SPECIFIC 3 x00
    CMD SPECIFIC 2 x00
    CMD SPECIFIC 1 x00
    CMD SPECIFIC 0 x00
    ASC x1B
    ASQ x00
    FRU x00
    SENSE SPECIFIC x000000
    ADDITIONAL SENSE
    0000: 05010000 00000000 00000000 00000000 *................*
    0010: 00000000 00000000 00000000 00000000 *................*
    0020: 00000000 00000000 00000000 00000000 *................*
    0030: 7E250000 00005E3C 00000000 00000000 *..%~<^..........*

    --
    Franz G. Fischer ------------ Franz dot Fischer at franz-fischer dot de
    

  • Next message: Franz Fischer: "DPW500au and RADEON 7500 PCI --- any experiences"