SCSI errors

From: Julian Grunnell (julian.grunnell_at_pipex.net)
Date: 11/23/04

  • Next message: LAI Yiu Fai: "serverless backup with EBS"
    To: <sunmanagers@sunmanagers.org>
    Date: Tue, 23 Nov 2004 09:25:10 -0000
    
    

    Hi - can anyone please help me with some errors that have just been seen
    on one of our web servers:
     
    Nov 18 01:16:04 g-web1 scsi: [ID 365881 kern.info] /pci@6,4000/scsi@2
    (glm2):
    Nov 18 01:16:04 g-web1 Cmd (0x2304828) dump for Target 2 Lun 0:
    Nov 18 01:16:04 g-web1 scsi: [ID 365881 kern.info] /pci@6,4000/scsi@2
    (glm2):
    Nov 18 01:16:04 g-web1 cdb=[ 0x0 0x0 0x0 0x0 0x0 0x0 ]
    Nov 18 01:16:04 g-web1 scsi: [ID 365881 kern.info] /pci@6,4000/scsi@2
    (glm2):
    Nov 18 01:16:04 g-web1 pkt_flags=0x14000 pkt_statistics=0x61
    pkt_state=0x7
    Nov 18 01:16:04 g-web1 scsi: [ID 365881 kern.info] /pci@6,4000/scsi@2
    (glm2):
    Nov 18 01:16:04 g-web1 pkt_scbp=0x0 cmd_flags=0xe1
    Nov 18 01:16:04 g-web1 scsi: [ID 107833 kern.warning] WARNING:
    /pci@6,4000/scsi@2 (glm2):
    Nov 18 01:16:04 g-web1 Disconnected tagged cmd(s) (1) timeout for
    Target 2.0
    Nov 18 01:16:04 g-web1 genunix: [ID 408822 kern.info] NOTICE: glm2:
    fault detected in device; service still available
    Nov 18 01:16:04 g-web1 genunix: [ID 611667 kern.info] NOTICE: glm2:
    Disconnected tagged cmd(s) (1) timeout for Target 2.0
    Nov 18 01:16:04 g-web1 glm: [ID 401478 kern.warning] WARNING:
    ID[SUNWpd.glm.cmd_timeout.6018]
    Nov 18 01:16:04 g-web1 scsi: [ID 107833 kern.warning] WARNING:
    /pci@6,4000/scsi@2 (glm2):
    Nov 18 01:16:04 g-web1 got SCSI bus reset
    Nov 18 01:16:04 g-web1 genunix: [ID 408822 kern.info] NOTICE: glm2:
    fault detected in device; service still available
    Nov 18 01:16:04 g-web1 genunix: [ID 611667 kern.info] NOTICE: glm2: got
    SCSI bus reset
    Nov 23 04:21:06 g-web1 scsi: [ID 365881 kern.info] /pci@6,4000/scsi@2
    (glm2):
    Nov 23 04:21:06 g-web1 Cmd (0x3c0c680) dump for Target 2 Lun 0:
    Nov 23 04:21:06 g-web1 scsi: [ID 365881 kern.info] /pci@6,4000/scsi@2
    (glm2):
    Nov 23 04:21:06 g-web1 cdb=[ 0x0 0x0 0x0 0x0 0x0 0x0 ]
    Nov 23 04:21:06 g-web1 scsi: [ID 365881 kern.info] /pci@6,4000/scsi@2
    (glm2):
    Nov 23 04:21:06 g-web1 pkt_flags=0x14000 pkt_statistics=0x61
    pkt_state=0x7
    Nov 23 04:21:06 g-web1 scsi: [ID 365881 kern.info] /pci@6,4000/scsi@2
    (glm2):
    Nov 23 04:21:06 g-web1 pkt_scbp=0x0 cmd_flags=0xe1
    Nov 23 04:21:06 g-web1 scsi: [ID 107833 kern.warning] WARNING:
    /pci@6,4000/scsi@2 (glm2):
    Nov 23 04:21:06 g-web1 Disconnected tagged cmd(s) (1) timeout for
    Target 2.0
    Nov 23 04:21:06 g-web1 genunix: [ID 408822 kern.info] NOTICE: glm2:
    fault detected in device; service still available
    Nov 23 04:21:06 g-web1 genunix: [ID 611667 kern.info] NOTICE: glm2:
    Disconnected tagged cmd(s) (1) timeout for Target 2.0
    Nov 23 04:21:06 g-web1 glm: [ID 401478 kern.warning] WARNING:
    ID[SUNWpd.glm.cmd_timeout.6018]
    Nov 23 04:21:06 g-web1 scsi: [ID 107833 kern.warning] WARNING:
    /pci@6,4000/scsi@2 (glm2):
    Nov 23 04:21:06 g-web1 got SCSI bus reset
    Nov 23 04:21:06 g-web1 genunix: [ID 408822 kern.info] NOTICE: glm2:
    fault detected in device; service still available
    Nov 23 04:21:06 g-web1 genunix: [ID 611667 kern.info] NOTICE: glm2: got
    SCSI bus reset
    Nov 23 05:22:14 g-web1 pseudo: [ID 129642 kern.info] pseudo-device:
    devinfo0
    Nov 23 05:22:14 g-web1 genunix: [ID 936769 kern.info] devinfo0 is
    /pseudo/devinfo@0

     
    ** Platform **
    System Configuration: Sun Microsystems sun4u
    Memory size: 2048 Megabytes
    System Peripherals (Software Nodes):
     
    SUNW,Ultra-4
        options, instance #0
        pci, instance #0
            ebus, instance #0
                se, instance #0
                su, instance #0
                su, instance #1
                fdthree, instance #0
                SUNW,envctrl, instance #0
            network, instance #0
            scsi, instance #0
                sd, instance #0
                sd, instance #1
                sd, instance #2
                sd, instance #3
            scsi, instance #1
                sd, instance #21
            TSI,gfxp, instance #0
        pci, instance #1
        pci, instance #2
        pci, instance #3
        pci, instance #4
            scsi, instance #2
                sd, instance #30
                sd, instance #31
                sd, instance #32
                sd, instance #33
            scsi, instance #3
        pci, instance #5
        pseudo, instance #0

    SunOS g-web1 5.8 Generic_108528-19 sun4u sparc SUNW,Ultra-4
     
    Searching for disks...done
     

    AVAILABLE DISK SELECTIONS:
           0. c0t0d0 <SUN18G cyl 7506 alt 2 hd 19 sec 248>
              /pci@1f,4000/scsi@3/sd@0,0
           1. c0t1d0 <SUN18G cyl 7506 alt 2 hd 19 sec 248>
              /pci@1f,4000/scsi@3/sd@1,0
           2. c0t2d0 <SUN18G cyl 7506 alt 2 hd 19 sec 248>
              /pci@1f,4000/scsi@3/sd@2,0
           3. c0t3d0 <SUN18G cyl 7506 alt 2 hd 19 sec 248>
              /pci@1f,4000/scsi@3/sd@3,0
           4. c2t0d0 <SUN18G cyl 7506 alt 2 hd 19 sec 248>
              /pci@6,4000/scsi@2/sd@0,0
           5. c2t1d0 <SUN18G cyl 7506 alt 2 hd 19 sec 248>
              /pci@6,4000/scsi@2/sd@1,0
           6. c2t2d0 <SUN18G cyl 7506 alt 2 hd 19 sec 248>
              /pci@6,4000/scsi@2/sd@2,0
           7. c2t3d0 <SUN18G cyl 7506 alt 2 hd 19 sec 248>
              /pci@6,4000/scsi@2/sd@3,0

    bash-2.03# iostat -En
    c0t0d0 Soft Errors: 0 Hard Errors: 0 Transport Errors: 0
    Vendor: FUJITSU Product: MAG3182L SUN18G Revision: 1111 Serial No:
    02500395
    Size: 18.11GB <18110967808 bytes>
    Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
    Illegal Request: 0 Predictive Failure Analysis: 0
    c0t1d0 Soft Errors: 0 Hard Errors: 0 Transport Errors: 0
    Vendor: SEAGATE Product: ST318203LSUN18G Revision: 034A Serial No:
    LRC0752900001030
    Size: 18.11GB <18110967808 bytes>
    Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
    Illegal Request: 0 Predictive Failure Analysis: 0
    c0t2d0 Soft Errors: 0 Hard Errors: 0 Transport Errors: 0
    Vendor: FUJITSU Product: MAG3182L SUN18G Revision: 1111 Serial No:
    01598932
    Size: 18.11GB <18110967808 bytes>
    Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
    Illegal Request: 0 Predictive Failure Analysis: 0
    c0t3d0 Soft Errors: 0 Hard Errors: 0 Transport Errors: 0
    Vendor: FUJITSU Product: MAG3182L SUN18G Revision: 1111 Serial No:
    01599095
    Size: 18.11GB <18110967808 bytes>
    Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
    Illegal Request: 0 Predictive Failure Analysis: 0
    c1t6d0 Soft Errors: 0 Hard Errors: 0 Transport Errors: 0
    Vendor: TOSHIBA Product: XM6201TASUN32XCD Revision: 1103 Serial No:
    12/12/97
    Size: 0.54GB <538818560 bytes>
    Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
    Illegal Request: 0 Predictive Failure Analysis: 0
    c2t0d0 Soft Errors: 0 Hard Errors: 2 Transport Errors: 0
    Vendor: FUJITSU Product: MAG3182L SUN18G Revision: 1111 Serial No:
    01598837
    Size: 18.11GB <18110967808 bytes>
    Media Error: 0 Device Not Ready: 0 No Device: 2 Recoverable: 0
    Illegal Request: 0 Predictive Failure Analysis: 0
    c2t1d0 Soft Errors: 0 Hard Errors: 2 Transport Errors: 0
    Vendor: FUJITSU Product: MAG3182L SUN18G Revision: 1111 Serial No:
    01599115
    Size: 18.11GB <18110967808 bytes>
    Media Error: 0 Device Not Ready: 0 No Device: 2 Recoverable: 0
    Illegal Request: 0 Predictive Failure Analysis: 0
    c2t2d0 Soft Errors: 0 Hard Errors: 2 Transport Errors: 2
    Vendor: SEAGATE Product: ST318203LSUN18G Revision: 034A Serial No:
    LRB9463200001030
    Size: 18.11GB <18110967808 bytes>
    Media Error: 0 Device Not Ready: 0 No Device: 2 Recoverable: 0
    Illegal Request: 0 Predictive Failure Analysis: 0
    c2t3d0 Soft Errors: 0 Hard Errors: 2 Transport Errors: 0
    Vendor: SEAGATE Product: ST318203LSUN18G Revision: 034A Serial No:
    LRB0739700007027
    Size: 18.11GB <18110967808 bytes>
    Media Error: 0 Device Not Ready: 0 No Device: 2 Recoverable: 0
    Illegal Request: 0 Predictive Failure Analysis: 0

     
    The disks are setup with Disksuite, but the disk which I think the
    errors are referring to (c2t2d0) is not part of any disksuite set. I am
    just running a format / analyse on the disk at the moment to see if it
    brings anything up. Looking at the iostat output does show that
    Controller 2 has "2" hard errors for each disk. Maybe the Controller
    itself is at fault.
     
    Thanks in advance - Julian.
     

    Julian Grunnell
    3rd Line Technical Support
    PIPEX

    Telephone: 0113 302 1005
    Mobile: 07803 649593
    Website: http://www.pipex.net <http://www.pipex.net/>
    _______________________________________________
    sunmanagers mailing list
    sunmanagers@sunmanagers.org
    http://www.sunmanagers.org/mailman/listinfo/sunmanagers


  • Next message: LAI Yiu Fai: "serverless backup with EBS"