由网络断开引起的问题。。
AIX 61 HA:55 ORACLE :10G 2套RAC环境
问题描述:
昨天下午机房网络突然全部断网,导致有一套RAC环境出现异常,其中一台在网络恢复后自动带起concurrent vg,而另一台不能自动带起。手动启动HA后完全恢复。
而另一套RAC中2台机器都可以自动恢复到正常情况。
errpt结果如下:
A机:
IDENTIFIER TIMESTAMP T C RESOURCE_NAME DESCRIPTION
F3931284 0110160611 I H ent9 ETHERNET NETWORK RECOVERY MODE
F3931284 0110160611 I H ent8 ETHERNET NETWORK RECOVERY MODE
3D32B80D 0110160411 P S topsvcs NIM thread blocked
AB59ABFF 0110160411 U U LIBLVM Remote node Concurrent Volume Group fail
AB59ABFF 0110160411 U U LIBLVM Remote node Concurrent Volume Group fail
AB59ABFF 0110160411 U U LIBLVM Remote node Concurrent Volume Group fail
AB59ABFF 0110160411 U U LIBLVM Remote node Concurrent Volume Group fail
173C787F 0110160411 I S topsvcs Possible malfunction on local adapter
173C787F 0110160411 I S topsvcs Possible malfunction on local adapter
EC0BCCD4 0110160411 T H ent8 ETHERNET DOWN
EC0BCCD4 0110160411 T H ent9 ETHERNET DOWN
CCC89167 1129024110 T H sissas0 ADAPTER ERROR
B机器:
IDENTIFIER TIMESTAMP T C RESOURCE_NAME DESCRIPTION
AFA89905 0110175811 I O grpsvcs Group Services daemon started
923E1911 0110175811 P S topsvcs Failed to open NIM connection
97419D60 0110175811 I O topsvcs Topology Services daemon started
A6DF45AA 0110161911 I O RMCdaemon The daemon is started.
D221BD55 0110161811 I O perftune RESTRICTED TUNABLES MODIFIED AT REBOOT
67145A39 0110161711 U S SYSDUMP SYSTEM DUMP
F48137AC 0110161611 U O minidump COMPRESSED MINIMAL DUMP
F3931284 0110161611 I H ent8 ETHERNET NETWORK RECOVERY MODE
9DBCFDEE 0110161811 T O errdemon ERROR LOGGING TURNED ON
A924A5FC 0110160511 P S SYSPROC SOFTWARE PROGRAM ABNORMALLY TERMINATED
173C787F 0110160511 I S topsvcs Possible malfunction on local adapter
173C787F 0110160511 I S topsvcs Possible malfunction on local adapter
EC0BCCD4 0110160511 T H ent9 ETHERNET DOWN
EC0BCCD4 0110160511 T H ent8 ETHERNET DOWN
A2205861 0102002211 P S SYSPROC Excessive interrupt disablement time
1BA7DF4E 1224193110 P S SRC SOFTWARE PROGRAM ERROR
12081DC6 1224193110 P S harmad SOFTWARE PROGRAM ERROR
12081DC6 1224193110 P S harmad SOFTWARE PROGRAM ERROR
CB4A951F 1224193110 I S SRC SOFTWARE PROGRAM ERROR
12081DC6 1224193110 P S harmad SOFTWARE PROGRAM ERROR
12081DC6 1224193110 P S harmad SOFTWARE PROGRAM ERROR
CB4A951F 1224193110 I S SRC SOFTWARE PROGRAM ERROR
12081DC6 1224193110 P S harmad SOFTWARE PROGRAM ERROR
12081DC6 1224193110 P S harmad SOFTWARE PROGRAM ERROR
12081DC6 1224193110 P S harmad SOFTWARE PROGRAM ERROR
CB4A951F 1224192110 I S SRC SOFTWARE PROGRAM ERROR
12081DC6 1224192110 P S harmad SOFTWARE PROGRAM ERROR
2F3E09A4 1103114310 I H sys0 REPAIR ACTION
查了半天也查不到什么问题。请各位老师看看。
问题描述:
昨天下午机房网络突然全部断网,导致有一套RAC环境出现异常,其中一台在网络恢复后自动带起concurrent vg,而另一台不能自动带起。手动启动HA后完全恢复。
而另一套RAC中2台机器都可以自动恢复到正常情况。
errpt结果如下:
A机:
IDENTIFIER TIMESTAMP T C RESOURCE_NAME DESCRIPTION
F3931284 0110160611 I H ent9 ETHERNET NETWORK RECOVERY MODE
F3931284 0110160611 I H ent8 ETHERNET NETWORK RECOVERY MODE
3D32B80D 0110160411 P S topsvcs NIM thread blocked
AB59ABFF 0110160411 U U LIBLVM Remote node Concurrent Volume Group fail
AB59ABFF 0110160411 U U LIBLVM Remote node Concurrent Volume Group fail
AB59ABFF 0110160411 U U LIBLVM Remote node Concurrent Volume Group fail
AB59ABFF 0110160411 U U LIBLVM Remote node Concurrent Volume Group fail
173C787F 0110160411 I S topsvcs Possible malfunction on local adapter
173C787F 0110160411 I S topsvcs Possible malfunction on local adapter
EC0BCCD4 0110160411 T H ent8 ETHERNET DOWN
EC0BCCD4 0110160411 T H ent9 ETHERNET DOWN
CCC89167 1129024110 T H sissas0 ADAPTER ERROR
B机器:
IDENTIFIER TIMESTAMP T C RESOURCE_NAME DESCRIPTION
AFA89905 0110175811 I O grpsvcs Group Services daemon started
923E1911 0110175811 P S topsvcs Failed to open NIM connection
97419D60 0110175811 I O topsvcs Topology Services daemon started
A6DF45AA 0110161911 I O RMCdaemon The daemon is started.
D221BD55 0110161811 I O perftune RESTRICTED TUNABLES MODIFIED AT REBOOT
67145A39 0110161711 U S SYSDUMP SYSTEM DUMP
F48137AC 0110161611 U O minidump COMPRESSED MINIMAL DUMP
F3931284 0110161611 I H ent8 ETHERNET NETWORK RECOVERY MODE
9DBCFDEE 0110161811 T O errdemon ERROR LOGGING TURNED ON
A924A5FC 0110160511 P S SYSPROC SOFTWARE PROGRAM ABNORMALLY TERMINATED
173C787F 0110160511 I S topsvcs Possible malfunction on local adapter
173C787F 0110160511 I S topsvcs Possible malfunction on local adapter
EC0BCCD4 0110160511 T H ent9 ETHERNET DOWN
EC0BCCD4 0110160511 T H ent8 ETHERNET DOWN
A2205861 0102002211 P S SYSPROC Excessive interrupt disablement time
1BA7DF4E 1224193110 P S SRC SOFTWARE PROGRAM ERROR
12081DC6 1224193110 P S harmad SOFTWARE PROGRAM ERROR
12081DC6 1224193110 P S harmad SOFTWARE PROGRAM ERROR
CB4A951F 1224193110 I S SRC SOFTWARE PROGRAM ERROR
12081DC6 1224193110 P S harmad SOFTWARE PROGRAM ERROR
12081DC6 1224193110 P S harmad SOFTWARE PROGRAM ERROR
CB4A951F 1224193110 I S SRC SOFTWARE PROGRAM ERROR
12081DC6 1224193110 P S harmad SOFTWARE PROGRAM ERROR
12081DC6 1224193110 P S harmad SOFTWARE PROGRAM ERROR
12081DC6 1224193110 P S harmad SOFTWARE PROGRAM ERROR
CB4A951F 1224192110 I S SRC SOFTWARE PROGRAM ERROR
12081DC6 1224192110 P S harmad SOFTWARE PROGRAM ERROR
2F3E09A4 1103114310 I H sys0 REPAIR ACTION
查了半天也查不到什么问题。请各位老师看看。
作者: comm 发布时间: 2011-01-11
你想查什么?不是都正常了吗
作者: DCup 发布时间: 2011-01-11
客户想知道为什么这台机器在网络恢复后不能自动带起共享VG??
而其他都可以。。
而其他都可以。。
作者: comm 发布时间: 2011-01-11
明显HA没有启动呗
作者: DCup 发布时间: 2011-01-11