Re: Strange HACMP config error

From: Jason Mather (goz02451_at_yahoo.com)
Date: 07/24/03


Date: Thu, 24 Jul 2003 17:35:35 -0400

Patrick Agsten wrote:
> Hi there,
>
> I've got some strange problems here. Running AIX 4.3.3 an HACMP 4.4 we
> had a full network blackout yesterday. The cluster was instable after
> that. I thought it would be a good idea to restart both nodes. Now the
> cluster has problems I can't overcome:
> First one IP interface is gone. It is not to be seen in cllsif and
> clshowres but if I try to re-define it smitty says the interface already
> exists. Cluster verification didn't say anything regarding this phenomen.
> Second the cluster verification log states that a volume group doesn't
> exist at one node but if I have a look at that, the vg is to be see in
> lsvg and ist active and the filesystems are all initialized correctly.
>
> For I don't have any idea how to reactivate the interface I have bound
> an alias to the network adapter manually for ensuring production but if
> any failure occures it will not be recoverd by the cluster so if anyone
> has any idea hearing from it would be greatly appreciated.
>
> Thanks for any help,
> Patrick
>

All problems are strange until they're solved. This one sounds like
IP address conflicts. When you had the network outage, did each
server try to take over the other's IP address? Hopefully
you do not also take over MAC addresses; duplicate MAC addresses
make for interesting routing problems.

ifconfig will tell you what aliases are assigned on each interface.

ping will tell you if IP exists elsewhere on the net.

arp will show you MAC address

network switch logs can show routing problems if you have access to them.
If not, its time for netstat and tcpdump.

-- Jason



Relevant Pages

  • Re: Failover on public network failiure?
    ... There are a number of ways the cluster service checks the state of the ... If the network interface is determined to be failed then any ... the IP address on the current node the groupwill failover to another ...
    (microsoft.public.windows.server.clustering)
  • Re: How to change the heartbeat rate or should I?
    ... Communication between Server Cluster nodes is critical for smooth cluster ... each cluster network must fail independently of all other ... traffic from the network adapter that is set to Internal Cluster ...
    (microsoft.public.windows.server.clustering)
  • RE: cluster completely unavailable
    ... The network peoples says that switches was ok, network was ok, dns and wins ... Only the cluster suffered from this situation. ... completed update seq 225906 type 2 context 15 ...
    (microsoft.public.windows.server.clustering)
  • Cluster x does not appear to have a dedicated heartbeat network connection. - Unterschiedliche A
    ... nachdem ich gerade in den letzten Zuegen der Migration auf Exchange 2007 CCR ... Cluster DEFRA-EX1MBX1 does not appear to have a dedicated heartbeat network ...
    (microsoft.public.de.exchange)
  • cluster completely unavailable
    ... The network peoples says that switches was ok, network was ok, dns and wins ... Only the cluster suffered from this situation. ... completed update seq 225906 type 2 context 15 ...
    (microsoft.public.windows.server.clustering)