Re: Fibre Channel Array Troubles.
- From: Hans-Dieter Kutz <hdkutz@xxxxxxxxx>
- Date: Wed, 16 Aug 2006 16:31:15 +0200
On Wed, Aug 16, 2006 at 09:53:28AM -0400, Brian Wheeler wrote:
Hey all, first a little background:Brian,
A few weeks ago we had a cache battery go out on our fastt 200
(controller b) so I set the preferred path to controller a, but it kept
switching back to controller b. After running diagnostics, the
external wrap plug test failed on the pci card which was connected to
controller a, so in order to get everything functioning, I swapped the
cables on the PCI cards and I was able to switch to controller a. No
problems...
Original config:
slot 4 -> controller a
slot 5 -> controller b
After futzing around:
slot 5 -> controller a
slot 4 -> (no connection)
(no connection) -> controller b
After I replaced the battery and got a replacement FC card, the disk
array was acting funny. Sometimes it won't come up after a reboot and
failover isn't working. The current configuration is:
slot 5 -> controller a
slot 4 -> (no card)
slot 2 -> controller b
To get AIX less confused I had to put the card into slot 2 because for
some reason it was terribly flaky in slot 4. The replacement was a
different revision. The firmware is (now) the latest on both cards.
I have a bunch of defined devices that represent the various
configurations I had and I can't get them to go away:
dac0 Defined 1Z-08-01 3542 (200) Disk Array Controller
dac1 Defined 1Z-08-01 3542 (200) Disk Array Controller
dac2 Defined 1D-08-01 3542 (200) Disk Array Controller
dac4 Defined 1D-08-01 fcparray Disk Array Controller
fcnet0 Defined 1Z-08-02 Fibre Channel Network Protocol
Device
fcnet1 Defined 1D-08-02 Fibre Channel Network Protocol
Device
fcnet2 Defined 1H-08-02 Fibre Channel Network Protocol
Device
fcs0 Defined 1Z-08 FC Adapter
fscsi0 Defined 1Z-08-01 FC SCSI I/O Controller Protocol
Device
The 'working' devices are Available:
dac3 Available 1D-08-01 3542 (200) Disk Array Controller
dac5 Available 1H-08-01 3542 (200) Disk Array Controller
dar0 Available 3542 (200) Disk Array Router
fcs1 Available 1D-08 FC Adapter
fcs2 Available 1H-08 FC Adapter
fscsi1 Available 1D-08-01 FC SCSI I/O Controller Protocol
Device
fscsi2 Available 1H-08-01 FC SCSI I/O Controller Protocol
Device
hdisk4 Available 1D-08-01 3542 (200) Disk Array Device
When I try running rmdev (or using the smit equiv) on the Defined
devices that aren't there anymore, I get an error telling me they're
not in the right state.
The dar0 device is using an Available device and a Defined device:
bash-2.04# fget_config -l dar0
dac3 ACTIVE dacNONE ACTIVE
hdisk4 dac3
bash-2.04# lsattr -El dar0
act_controller dac3 Active Controllers
False
aen_freq 600 Polled AEN frequency in seconds
True
all_controller dac3,dac4 Available Controllers
False
autorecovery no Autorecover after failure is corrected
True
balance_freq 600 Dynamic Load Balancing frequency in seconds
True
cache_size 88 Cache size for both controllers
False
fast_write_ok no Fast Write available
False
held_in_reset none Held-in-reset controller
True
hlthchk_freq 600 Health check frequency in seconds
True
load_balancing no Dynamic Load Balancing
True
switch_retries 5 Number of times to retry failed switches
True
HELP! I'm pretty confused, but luckily the array is working for now,
but its flakier than I'd rather it be.
So, how do I:
* Get rid of 'old' devices and have the configuration accurately
represent the current state?
* How do I get dar0 to use fcs3 & fcs5
* catch a clue on how all of this is supposed to work...
- plan a service outage
Then:
- umount Filesystems
- varyoffvg VG
- rmdev -dl hdisks
- look whith lsdev -Cc adapter at your fcs Location Codes
- look whith lsdev -C |grep Location Codes (for later remove)
- rmdev -Rdl (all dar-devices)
- rmdev -Rdl (all fcs-devices)
- cfgmgr (this will configure your fastt devices)
- look with fget_config -Av at your Fastt-Array (does it show what you want?)
- importvg -y VG hdisk?
- mount your Filesystems
We're changing several times Fastt-Storage from one machine to another. If you
go through like described above no problems will occur.
Cheers,
ku
--
Han Solo:
Well Princess, it looks like you managed to keep me
here a while longer.
Princess Leia:
I had nothing to do with it. General Rieekan thinks
it's dangerous for anyone to leave the system until
they've activated the energy shield.
Han Solo:
That's a good story. I think you just can't bear to
let a gorgeous guy like me out of your sight.
Princess Leia:
I don't know where you get you delusions, laser brain!
Chewbacca laughs
Han Solo:
Laugh it up, fuzzball!
_________________________________________________________________________
Dieses Mail/Fax ist ausschließlich für den genannten Empfänger bestimmt.
Es enthält persönliche oder vertrauliche Informationen. Jede unerlaubte
Verbreitung des Inhalts, auch teilweise, ist untersagt. Falls Sie dieses
Mail/Fax versehentlich erhielten, informieren Sie bitte unverzüglich den
Absender und löschen Sie dieses Mail/Fax endgültig von jedem Rechner, auch
Ihrem Mail-/Faxserver.
This mail/fax contains private or confidential information and is intended
only for the person to which it is addressed. Any unauthorized
dissemination, even partly, is prohibited. If you received this mail/fax
in error, please contact the sender and delete this mail/fax finally from
any computer, including your mail-/faxserver.
- Follow-Ups:
- Re: Fibre Channel Array Troubles.
- From: Brian Wheeler
- Re: Fibre Channel Array Troubles.
- References:
- Fibre Channel Array Troubles.
- From: Brian Wheeler
- Fibre Channel Array Troubles.
- Prev by Date: Re: Fibre Channel Array Troubles.
- Next by Date: Re: Job Posting
- Previous by thread: Re: Fibre Channel Array Troubles.
- Next by thread: Re: Fibre Channel Array Troubles.
- Index(es):
Relevant Pages
|
|