Fibre Channel Array Troubles.
- From: Brian Wheeler <bdwheele@xxxxxxxxxxx>
- Date: Wed, 16 Aug 2006 09:53:28 -0400
Hey all, first a little background:
A few weeks ago we had a cache battery go out on our fastt 200
(controller b) so I set the preferred path to controller a, but it kept
switching back to controller b. After running diagnostics, the
external wrap plug test failed on the pci card which was connected to
controller a, so in order to get everything functioning, I swapped the
cables on the PCI cards and I was able to switch to controller a. No
problems...
Original config:
slot 4 -> controller a
slot 5 -> controller b
After futzing around:
slot 5 -> controller a
slot 4 -> (no connection)
(no connection) -> controller b
After I replaced the battery and got a replacement FC card, the disk
array was acting funny. Sometimes it won't come up after a reboot and
failover isn't working. The current configuration is:
slot 5 -> controller a
slot 4 -> (no card)
slot 2 -> controller b
To get AIX less confused I had to put the card into slot 2 because for
some reason it was terribly flaky in slot 4. The replacement was a
different revision. The firmware is (now) the latest on both cards.
I have a bunch of defined devices that represent the various
configurations I had and I can't get them to go away:
dac0 Defined 1Z-08-01 3542 (200) Disk Array Controller
dac1 Defined 1Z-08-01 3542 (200) Disk Array Controller
dac2 Defined 1D-08-01 3542 (200) Disk Array Controller
dac4 Defined 1D-08-01 fcparray Disk Array Controller
fcnet0 Defined 1Z-08-02 Fibre Channel Network Protocol
Device
fcnet1 Defined 1D-08-02 Fibre Channel Network Protocol
Device
fcnet2 Defined 1H-08-02 Fibre Channel Network Protocol
Device
fcs0 Defined 1Z-08 FC Adapter
fscsi0 Defined 1Z-08-01 FC SCSI I/O Controller Protocol
Device
The 'working' devices are Available:
dac3 Available 1D-08-01 3542 (200) Disk Array Controller
dac5 Available 1H-08-01 3542 (200) Disk Array Controller
dar0 Available 3542 (200) Disk Array Router
fcs1 Available 1D-08 FC Adapter
fcs2 Available 1H-08 FC Adapter
fscsi1 Available 1D-08-01 FC SCSI I/O Controller Protocol
Device
fscsi2 Available 1H-08-01 FC SCSI I/O Controller Protocol
Device
hdisk4 Available 1D-08-01 3542 (200) Disk Array Device
When I try running rmdev (or using the smit equiv) on the Defined
devices that aren't there anymore, I get an error telling me they're
not in the right state.
The dar0 device is using an Available device and a Defined device:
bash-2.04# fget_config -l dar0
dac3 ACTIVE dacNONE ACTIVE
hdisk4 dac3
bash-2.04# lsattr -El dar0
act_controller dac3 Active Controllers
False
aen_freq 600 Polled AEN frequency in seconds
True
all_controller dac3,dac4 Available Controllers
False
autorecovery no Autorecover after failure is corrected
True
balance_freq 600 Dynamic Load Balancing frequency in seconds
True
cache_size 88 Cache size for both controllers
False
fast_write_ok no Fast Write available
False
held_in_reset none Held-in-reset controller
True
hlthchk_freq 600 Health check frequency in seconds
True
load_balancing no Dynamic Load Balancing
True
switch_retries 5 Number of times to retry failed switches
True
HELP! I'm pretty confused, but luckily the array is working for now,
but its flakier than I'd rather it be.
So, how do I:
* Get rid of 'old' devices and have the configuration accurately
represent the current state?
* How do I get dar0 to use fcs3 & fcs5
* catch a clue on how all of this is supposed to work...
Thanks!
Brian
- Follow-Ups:
- Re: Fibre Channel Array Troubles.
- From: Hans-Dieter Kutz
- Re: Fibre Channel Array Troubles.
- From: Jeff Barratt-McCartney
- Re: Fibre Channel Array Troubles.
- Prev by Date: Job Posting
- Next by Date: Re: Fibre Channel Array Troubles.
- Previous by thread: Job Posting
- Next by thread: Re: Fibre Channel Array Troubles.
- Index(es):
Relevant Pages
|
|