Re: A1000: Determining bad disk

From: Vikas Agnihotri (fornewsgroups_at_vikas.mailshell.com)
Date: 09/27/03


Date: Sat, 27 Sep 2003 00:24:17 -0400

On Thu, 25 Sep 2003 18:33:09 GMT, Darren Dunham <ddunham@redwood.taos.com>
wrote:

>>> I am seeing some SCSI transport failures in /var/adm/messages on one of
>>> my LUNs. The A1000 has all RAID5 luns.
>>>
>>> I suspect the disk is going bad.
>
> Why? If you do, you should run rm6 and run a healthcheck.

I dont like the rm6 GUI, the CLI equivalent is 'healthck', right? I did a
'healthck -a' and got 'Optimal'. I didnt expect anything else.

I dont know how thorough 'healthck' is anyway. Say the disk was going bad,
and I knew about it proactively, I could, on-demand, mark the drive failed
using 'drivutil' and take the reconstruction hit when I want to instead of
waiting for it to happen anytime!

How about 'parityck', is that a more exhaustive disk check?

Anyway, in this particular case, as it turned out, my SCSI errors were due
to the "disconnected tagged commands", for which Sun support suggested that
I consider reducing 'set sd:sd_max_throttle' (in /etc/system) to something
like 10 (default is 256) or so.

Is this common practice to throttle down the 'sd' driver with the RAID
A1000? Is this because the disks are too fast for the sd driver? [Or is it
the other way around?]

Thanks



Relevant Pages

  • Re: A1000: Determining bad disk
    ... I dont like the rm6 GUI, the CLI equivalent is 'healthck', right? ... is that a more exhaustive disk check? ...
    (comp.unix.solaris)
  • Re: A1000: Determining bad disk
    ... I dont like the rm6 GUI, the CLI equivalent is 'healthck', right? ... is that a more exhaustive disk check? ...
    (comp.sys.sun.admin)
  • Re: A1000: Determining bad disk
    ... I dont like the rm6 GUI, the CLI equivalent is 'healthck', right? ... is that a more exhaustive disk check? ...
    (comp.unix.solaris)
  • Re: How to show disk size?
    ... AVAILABLE DISK SELECTIONS: ... things wrongly configured there could cause delays on certain commands, ... I have an A3500 but the commands are the same... ... healthck succeeded! ...
    (comp.unix.solaris)
  • Raid Manager problem
    ... All commands hangs and should ... Truss output for "healthck c1t0d0" ... (c1t0d0 is disk from A1000) ...
    (SunManagers)