Re: errpt not clear to me
- From: hercom <hercom@xxxxxxxxxx>
- Date: Tue, 31 Jan 2006 16:59:04 -0500
When rebuilding or scrubbing a data strip:
If the controller detects that the data and its parity doesn't fit each
other shows up this error. Usually, is presented after replace a damaged
disk unit, and can be caused by a partial (surface) failure of one of the
remaining disks. While the array can still be used, the part resident on the
damaged surface is not recoverable, even by means of the parity. Think of it
as a double disk failure. The array must be recreated, after certifying all
the disks in it (and replacing the bad ones). No utility provided by AIX can
recover all the data from such an array, but there's a chance that the data
contained in the damaged surface corresponds to a lost+found or other
directory without vital data.
Hcamargo
-----Original Message-----
From: IBM AIX Discussion List [mailto:aix-l@xxxxxxxxxxxxx] On Behalf Of Juan
Camilo Yanquen
Sent: Wednesday, January 25, 2006 3:19 PM
To: aix-l@xxxxxxxxxxxxx
Subject: Re: errpt not clear to me
SRN FRU List Problem
47500 None Description: Part of the array data might have been lost.
Action: Go to "MAP 2324: SSA RAID" on page 260.
=================
Attention: Part of the data that is on the array has been damaged and
cannot be
recovered. Before any other action is taken, the user must recover all the
data that
is not damaged, and create a backup of that data.
a. Type smitty ssaraid and press Enter.
b. Select List Status Of All Defined SSA RAID Arrays.
===============================================
taked from "SSA Adapters User and Maintenance Information"
>
> A PROBLEM WAS DETECTED ON Wed Jan 25 13:26:21 CST 2006
>
> 801014
>
> The Service Request Number(s)/Probable Cause(s)
> (causes are listed in descending order of probability):
>
> 47500: Use the Service Guide for your SSA Adapter or SSA Subsystem.
> SSA-SUBSYSTEM P2-I4/Q1
>
> Use Enter to continue.
>
> From the SSA adapter guide:
> 47500: Description: Part of the array data might have been lost.
>
> I'm following along in the guide now se we'll see what happens.
> Thanks for the suggestion.
>
>>>>Lamar.Saxon@xxxxxxxxxxxxxxx 01/25/06 12:07 pm >>>
>
> Just as easy, go to diags and problem determination and see what SRN you
> get.
>
> My best guess you have lost hot spare capability...
>
> Lamar
>
> From: IBM AIX Discussion List [mailto:aix-l@xxxxxxxxxxxxx] On Behalf Of
> Patrick B. O'Brien
> Sent: Wednesday, January 25, 2006 12:21 PM
> To: aix-l@xxxxxxxxxxxxx
> Subject: Re: errpt not clear to me
>
>
> You can certify each disk without harming the data, within SSA task.
>
>
>
> From: IBM AIX Discussion List [mailto:aix-l@xxxxxxxxxxxxx] On Behalf Of
> Mark Schlechte
> Sent: Wednesday, January 25, 2006 9:52 AM
> To: aix-l@xxxxxxxxxxxxx
> Subject: Re: errpt not clear to me
>
> Yep, been there done that. Status is good for all.
> I think I've been through all those sections and everything seems to
> verify as ok.
>
>>>>pobrien@xxxxxxxxxxx 01/25/06 10:36 am >>>
>
> How about running;
> diag, go to tasks, ssa-service aids, link verification, choose your card.
> Look in the right column for "Good" or a clue.
>
>
>
> From: IBM AIX Discussion List [mailto:aix-l@xxxxxxxxxxxxx] On Behalf Of
> Mark Schlechte
> Sent: Wednesday, January 25, 2006 9:20 AM
> To: aix-l@xxxxxxxxxxxxx
> Subject: errpt not clear to me
>
> I getting this in my errpt on Aix 433 server.
>
> IDENTIFIER TIMESTAMP T C RESOURCE_NAME DESCRIPTION
> B4C00618 0125050006 P H ssa0 RESOURCE UNAVAILABLE
> B4C00618 0125040006 P H ssa0 RESOURCE UNAVAILABLE
> B4C00618 0125030006 P H ssa0 RESOURCE UNAVAILABLE
> B4C00618 0125020006 P H ssa0 RESOURCE UNAVAILABLE
> B4C00618 0125010006 P H ssa0 RESOURCE UNAVAILABLE
> B4C00618 0125000006 P H ssa0 RESOURCE UNAVAILABLE
> 1581762B 0124232506 T H hdisk0 DISK OPERATION ERROR
> B4C00618 0124230006 P H ssa0 RESOURCE UNAVAILABLE
> B4C00618 0124220006 P H ssa0 RESOURCE UNAVAILABLE
> 1581762B 0124211606 T H hdisk0 DISK OPERATION ERROR
> B4C00618 0124210006 P H ssa0 RESOURCE UNAVAILABLE
> 1581762B 0124205906 T H hdisk0 DISK OPERATION ERROR
> 1581762B 0124204606 T H hdisk0 DISK OPERATION ERROR
>
> I know the ssa errors are being generated from the run_ssa_healthcheck
> script.
> I've checked via smitty ssaraid and the status is good and not
> rebuilding.
> I've also checked and I have a hot spare.
> I've now just checked all the ssa disks via certify and nothing.
>
> hdisk0 happens to be a scsi rootvg disk so I plan on taking down the
> server to cerify it as well.
> hdisk2 is my ssa raid5 disk over 16 pdisks in a d40.
>
>>lsdev -Cc disk
> hdisk0 Available 10-60-00-9,0 16 Bit LVD SCSI Disk Drive
> hdisk1 Available 10-60-00-11,0 16 Bit LVD SCSI Disk Drive
> hdisk2 Available 10-70-L SSA Logical Disk Drive
> >ssaxlate -l hdisk2
> pdisk0 pdisk1 pdisk2 pdisk3 pdisk4 pdisk5 pdisk6 pdisk7 pdisk8 pdisk9
> pdisk10 pdisk11 pdisk12 pdisk13 pdisk14
>
> So far I can't really find anything wrong but as I said I'll take the
> server down later to run diags on.
>
> Anything anyone else thinks I should look at?
>
> Mark
> DISCLAIMER: The information transmitted is intended only
> for the addressee and may contain confidential,
> proprietary and/or privileged material. Any
> unauthorized review, distribution or other use
> of or the taking of any action in reliance upon
> this information is prohibited. If you received
> this in error, please contact the sender and
> delete or destroy this message and any copies.
>
> As of January 20, any e-mails addressed
> to cityregina.com will not be delivered.
>
>
>
> DISCLAIMER: The information transmitted is intended only
> for the addressee and may contain confidential,
> proprietary and/or privileged material. Any
> unauthorized review, distribution or other use
> of or the taking of any action in reliance upon
> this information is prohibited. If you received
> this in error, please contact the sender and
> delete or destroy this message and any copies.
>
> As of January 20, any e-mails addressed
> to "cityregina.com" will not be delivered.
- References:
- Re: errpt not clear to me
- From: Juan Camilo Yanquen
- Re: errpt not clear to me
- Prev by Date: Re: Latest 4.3.2 ML?
- Next by Date: Re: Latest 4.3.2 ML?
- Previous by thread: Re: errpt not clear to me
- Next by thread: Re: errpt not clear to me
- Index(es):
Relevant Pages
|