Re: errpt not clear to me
- From: Lamar Saxon <Lamar.Saxon@xxxxxxxxxxxxxxx>
- Date: Wed, 25 Jan 2006 16:27:07 -0600
|
That invalid data strip
is bad to see. You are correct, the solution is to backup the array;
destroy it; rebuild it; and restore it.
I know others having this
problem before; but it is a rare one for me. It might be a code issue with
the SSA device drivers or something.
The reason diags did not
see it is because it is a problem with the logical drive not the physical
drive. ssa_health_check picked up on a LUN issue.
Lamar From: IBM AIX Discussion List [mailto:aix-l@xxxxxxxxxxxxx] On Behalf Of Mark Schlechte Sent: Wednesday, January 25, 2006 4:11 PM To: aix-l@xxxxxxxxxxxxx Subject: Re: errpt not clear to me Command: OK
stdout: yes
stderr: no
Before command completion, additional instructions may appear below.
Unsynced
Parity Strips Unbuilt Data Strips
hdisk2
0
0
Invalid data strip
It looks to me like even though the certify process didn't find an error I
may
have to backup my data, delete the array and try formatting the disks to
try and find the problem.
>>>Lamar.Saxon@xxxxxxxxxxxxxxx 01/25/06 1:06 pm >>>
Under smitty ssaraid what do you see when
you select:
List Status Of All Defined SSA RAID
Arrays
Select RAID5 and adapter ?
Thanks,
Lamar From: IBM AIX Discussion List [mailto:aix-l@xxxxxxxxxxxxx] On Behalf Of Mark Schlechte Sent: Wednesday, January 25, 2006 1:38 PM To: aix-l@xxxxxxxxxxxxx Subject: Re: errpt not clear to me A PROBLEM WAS DETECTED ON Wed Jan 25 13:26:21 CST
2006 801014
The Service Request Number(s)/Probable Cause(s)
(causes are listed in descending order of probability):
47500: Use the Service Guide for your SSA
Adapter or SSA Subsystem.
SSA-SUBSYSTEM P2-I4/Q1
Use Enter to continue.
From the SSA adapter guide:
47500: Description: Part of the array data might have been lost.
I'm following along in the guide now se we'll see what happens.
Thanks for the suggestion.
>>>Lamar.Saxon@xxxxxxxxxxxxxxx 01/25/06 12:07 pm >>>
Just
as easy, go to diags and problem determination and see what SRN you
get.
My
best guess you have lost hot spare capability...
Lamar
From: IBM AIX Discussion List [mailto:aix-l@xxxxxxxxxxxxx] On Behalf Of Patrick B. O'Brien Sent: Wednesday, January 25, 2006 12:21 PM To: aix-l@xxxxxxxxxxxxx Subject: Re: errpt not clear to me You can certify each
disk without harming the data, within SSA task. From: IBM AIX Discussion List
[mailto:aix-l@xxxxxxxxxxxxx] On Behalf Of Mark
Schlechte Yep, been there done that. Status is good for
all. I think I've been through all those sections and everything seems
to verify as ok. >>>pobrien@xxxxxxxxxxx 01/25/06 10:36 am >>> How about running;
diag, go to tasks,
ssa-service aids, link verification, choose your card. Look in the right column
for “Good” or a clue. From: IBM AIX Discussion List
[mailto:aix-l@xxxxxxxxxxxxx] On Behalf Of Mark
Schlechte Sent: Wednesday, January 25, 2006 9:20 AM
To: aix-l@xxxxxxxxxxxxx Subject: errpt not clear to
me I getting this in my errpt on Aix 433
server. IDENTIFIER TIMESTAMP T C
RESOURCE_NAME DESCRIPTION B4C00618 0125050006 P H
ssa0 RESOURCE
UNAVAILABLE B4C00618 0125040006 P H
ssa0 RESOURCE
UNAVAILABLE B4C00618 0125030006 P H
ssa0 RESOURCE
UNAVAILABLE B4C00618 0125020006 P H
ssa0 RESOURCE
UNAVAILABLE B4C00618 0125010006 P H
ssa0 RESOURCE
UNAVAILABLE B4C00618 0125000006 P H
ssa0 RESOURCE
UNAVAILABLE 1581762B 0124232506 T H
hdisk0 DISK OPERATION ERROR
B4C00618 0124230006 P H
ssa0 RESOURCE
UNAVAILABLE B4C00618 0124220006 P H
ssa0 RESOURCE
UNAVAILABLE 1581762B 0124211606 T H
hdisk0 DISK OPERATION ERROR
B4C00618 0124210006 P H
ssa0 RESOURCE
UNAVAILABLE 1581762B 0124205906 T H
hdisk0 DISK OPERATION ERROR
1581762B 0124204606 T H
hdisk0 DISK OPERATION ERROR
I know the ssa errors are being generated from the
run_ssa_healthcheck script. I've checked via smitty ssaraid and the status is good and not
rebuilding. I've also checked and I have a hot spare. I've now just checked all the ssa disks via certify and nothing. hdisk0 happens to be a scsi rootvg disk so I plan on taking down
the server to cerify it as well. hdisk2 is my ssa raid5 disk over 16 pdisks in a d40. >lsdev -Cc disk hdisk0 Available 10-60-00-9,0 16 Bit LVD SCSI Disk Drive
hdisk1 Available 10-60-00-11,0 16 Bit LVD SCSI Disk Drive hdisk2 Available
10-70-L SSA Logical Disk Drive >ssaxlate -l hdisk2 pdisk0 pdisk1 pdisk2 pdisk3 pdisk4 pdisk5 pdisk6 pdisk7 pdisk8
pdisk9 pdisk10 pdisk11 pdisk12 pdisk13 pdisk14 So far I can't really find anything wrong but as I said I'll take
the server down later to run diags on. Anything anyone else thinks I should look at? Mark
|
- Prev by Date: Re: errpt not clear to me
- Next by Date: Re: errpt not clear to me - SUMMARY I THINK
- Previous by thread: Re: errpt not clear to me
- Next by thread: Re: errpt not clear to me - SUMMARY I THINK
- Index(es):
Relevant Pages
- Re: Sun Storage Array help needed...
... > booted the system up, 2 of them were mentioned on the display, so I guess ...
> What I wanted to ask is: Does anybody who has a storage array running, ... disks
are 2,1 Gb disks, 114 if 4,2 Gb disks). ... SSA are old arrays, but you can still find
docs on: ... (comp.sys.sun.hardware) - Re: RAID 0
... You seem to suggest that this makes backup less necessary, ... if you lose one
of those drives the system won't). ... Create a 3 drive RAID5 array, using the 4th
drive as a hotspare. ... I can only assume that it is striping over all four disks - am
I ... (microsoft.public.windows.server.sbs) - Re: RAID Performance Questions
... performance hit with only 2 disks per raid, ... My big concern with a single,
large array is that should the array become corrupted for any reason, I'll lose both the live
and the backup. ... All data access will be via FastEthernet, and that's more of a bottleneck
than any disks I have. ... (freebsd-questions) - Re: Trouble Backing Up
... Every time I tried looking through help files, I was always referred to the 'system disks'
that I don't have. ... I called the store back when I bought my computer and they told me the
same thing, that Win XP is 'partitioned' with everything I need in case I want to restore or recover
my system, but here's problem #1: I wanted to use backup, but it wasn't automatically installed.
... I was to open a certain file from the 'mystery disks' and find the backup program and install
it that way. ... I've had a Support Ticket out with Norton for 2 weeks now and so far,
all I've gotten in return was the automated message everybody gets when you first send the request for
support! ... (microsoft.public.windowsxp.perform_maintain) - Re: building on your own a large data storage ...
... RAID-5 array because doing I/O on two devices on the same IDE ... primary
IDE port and one on the secondary IDE port. ... understanding the trade-offs (partioning an
array of disks vs. ... When a fan goes ... (comp.sys.ibm.pc.hardware.storage)