HACMP6.1拔网线测试的怪现象
环境:2个节点+DS4700。AIX6.1-06,HACMP6.1-04,使用IP Alias方式,热备。
节点A手动“Move Resource Groups”到节点B,一切正常,重新启动节点A,RG也能自动移回来。
节点A拔掉Service IP所在网卡网线,Service IP切换到Standby网卡,再拔掉Standby网卡网线,应该是资源切换到节点B,可出现的现象是节点B没有接管Service IP和VG,从errpt中查看报了2条硬盘操作错误,然后就处于recover状态。重新启动2个节点,查看节点B硬盘和VG状态,都正常,可varyon和mount 文件系统。
请教什么原因?谢谢。
节点A手动“Move Resource Groups”到节点B,一切正常,重新启动节点A,RG也能自动移回来。
节点A拔掉Service IP所在网卡网线,Service IP切换到Standby网卡,再拔掉Standby网卡网线,应该是资源切换到节点B,可出现的现象是节点B没有接管Service IP和VG,从errpt中查看报了2条硬盘操作错误,然后就处于recover状态。重新启动2个节点,查看节点B硬盘和VG状态,都正常,可varyon和mount 文件系统。
请教什么原因?谢谢。
作者: dugong 发布时间: 2011-03-11
要看的东西多了
作者: 老农 发布时间: 2011-03-11
# ./cldump
Obtaining information via SNMP from Node: pora_Node1...
_____________________________________________________________________________
Cluster Name: Ora_Cluster
Cluster State: UP
Cluster Substate: STABLE
_____________________________________________________________________________
Node Name: pora_Node1 State: UP
Network Name: net_ether_01 State: UP
Address: 172.16.1.185 Label: pora_svc State: UP
Address: 192.168.100.101 Label: pora1_boot State: UP
Address: 192.168.200.101 Label: pora1_stb State: UP
Network Name: net_ether_02 State: UP
Address: 192.168.10.1 Label: pora1_hb State: UP
Network Name: net_rs232_01 State: UP
Address: Label: rs232_node1 State: UP
Node Name: pora_Node2 State: UP
Network Name: net_ether_01 State: UP
Address: 192.168.100.102 Label: pora2_boot State: UP
Address: 192.168.200.102 Label: pora2_stb State: UP
Network Name: net_ether_02 State: UP
Address: 192.168.10.2 Label: pora2_hb State: UP
Network Name: net_rs232_01 State: UP
Address: Label: rs232_node2 State: UP
Cluster Name: Ora_Cluster
Resource Group Name: Ora_RG
Startup Policy: Online On Home Node Only
Fallover Policy: Fallover To Next Priority Node In The List
Fallback Policy: Fallback To Higher Priority Node In The List
Site Policy: ignore
Node Group State
---------------------------- ---------------
pora_Node1 ONLINE
pora_Node2 OFFLINE
切换时的硬盘报错
B6267342 0310160711 P H hdisk2 DISK OPERATION ERROR
B6267342 0310160711 P H hdisk2 DISK OPERATION ERROR
LABEL: SC_DISK_ERR2
IDENTIFIER: B6267342
Date/Time: Fri Mar 11 11:52:41 GMT+08:00 2011
Sequence Number: 204
Machine Id: 00F675E64C00
Node Id: pora2
Class: H
Type: PERM
WPAR: Global
Resource Name: hdisk2
Resource Class:
Resource Type:
Location:
VPD:
Manufacturer................IBM
Machine Type and Model......1814 FAStT
ROS Level and ID............30393136
Serial Number...............
Device Specific.(Z0)........0000053245004032
Device Specific.(Z1)........
Description
DISK OPERATION ERROR
Probable Causes
DASD DEVICE
Failure Causes
DISK DRIVE
DISK DRIVE ELECTRONICS
Recommended Actions
PERFORM PROBLEM DETERMINATION PROCEDURES
Detail Data
PATH ID
0
SENSE DATA
0A00 2800 0000 0000 0000 0104 0000 0000 0000 0000 0000 0000 0118 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
还需要看什么内容?
作者: dugong 发布时间: 2011-03-11
还是差太远了。
学过HA么?
学过HA么?
作者: 老农 发布时间: 2011-03-11
看过书,看过资料,没去上过课
作者: dugong 发布时间: 2011-03-11
唉。。。。差距啊
作者: 老农 发布时间: 2011-03-11
老大,帮帮忙,说说是哪里的问题,有个大致方向也行呀。谢谢了。
作者: dugong 发布时间: 2011-03-11
实在没时间啰嗦了
作者: 老农 发布时间: 2011-03-11