2007-8-21 17:21
agres
大家看看我的这个RAC的ORACM配置有没有问题
环境:VMware6.0+REDHAT AS4+ORACLE9i
介绍:二节点t1,t2
共享磁盘采用的虚拟SCSI磁盘,划出14个RAW设备供ORACLE使用:
/dev/raw/raw1: bound to major 8, minor 17
/dev/raw/raw2: bound to major 8, minor 18
/dev/raw/raw3: bound to major 8, minor 19
/dev/raw/raw4: bound to major 8, minor 21
/dev/raw/raw5: bound to major 8, minor 22
/dev/raw/raw6: bound to major 8, minor 23
/dev/raw/raw7: bound to major 8, minor 24
/dev/raw/raw8: bound to major 8, minor 25
/dev/raw/raw9: bound to major 8, minor 26
/dev/raw/raw10: bound to major 8, minor 27
/dev/raw/raw11: bound to major 8, minor 28
/dev/raw/raw12: bound to major 8, minor 29
/dev/raw/raw13: bound to major 8, minor 30
/dev/raw/raw14: bound to major 8, minor 31
CMQuorum使用的是/dev/raw/raw1,运行./ocmstart.sh,得到的log如下:
t1----cm.log:
oracm, version[ 9.2.0.2.0.47 ] started {Tue Aug 21 16:53:13 2007 }
KernelModuleName is hangcheck-timer {Tue Aug 21 16:53:13 2007 }
OemNodeConfig(): Network Address of node0: 10.1.1.1 (port 9998)
{Tue Aug 21 16:53:13 2007 }
OemNodeConfig(): Network Address of node1: 10.1.1.2 (port 9998)
{Tue Aug 21 16:53:13 2007 }
>WARNING: OemInit2: Opened file(/dev/raw/raw1 6), tid = main:-1219988384 file = oem.c, line = 491 {Tue Aug 21 16:53:13 2007 }
Debug Hang : ClusterListener (PID=2258) Registered withwatchdog daemon. {Tue Aug 21 16:53:13 2007 }
InitializeCM: ModuleName = hangcheck-timer {Tue Aug 21 16:53:13 2007 }
InitializeCM: Kernel module hangcheck-timer is not loaded, try to load(/sbin/insmod hangcheck-timer) {Tue Aug 21 16:53:13 2007 }
InitializeCM: Kernel module hangcheck-timer successfully loaded {Tue Aug 21 16:53:13 2007 }
Debug Hang : CmConnectListener (PID=2258):Registered with watchdog daemon. {Tue Aug 21 16:53:13 2007 }
Debug Hang :StartNMMon (PID=2258) Registered with watchdog daemon. {Tue Aug 21 16:53:13 2007 }
CreateLocalEndpoint(): Network Address: 10.1.1.1
{Tue Aug 21 16:53:13 2007 }
Debug Hang :PollingThread (PID=135159169): Registered with {Tue Aug 21 16:53:13 2007 }
Debug Hang : DiskPingThread (PID=135159169): Registered with {Tue Aug 21 16:53:13 2007 }
Debug Hang :SendingThread (PID=135159169): Registered with {Tue Aug 21 16:53:13 2007 }
NMEVENT_SUSPEND [00][00][00][00][00][00][00][01] {Tue Aug 21 16:53:18 2007 }
HandleUpdate(): SYNC(0) from node(0) completed {Tue Aug 21 16:53:20 2007 }
HandleUpdate(): NODE(0) IS ACTIVE MEMBER OF CLUSTER, INCARNATION(1) {Tue Aug 21 16:53:20 2007 }
NMEVENT_RECONFIG [00][00][00][00][00][00][00][01] {Tue Aug 21 16:53:20 2007 }
Successful reconfiguration, 1 active node(s) node 0 is the master, my node num is 0 (reconfig 1) {Tue Aug 21 16:53:20 2007 }
NMEVENT_SUSPEND [00][00][00][00][00][00][00][01] {Tue Aug 21 16:54:15 2007 }
HandleUpdate(): SYNC(1) from node(0) completed {Tue Aug 21 16:54:17 2007 }
HandleUpdate(): NODE(0) IS ACTIVE MEMBER OF CLUSTER, INCARNATION(1) {Tue Aug 21 16:54:17 2007 }
HandleUpdate(): NODE(1) IS ACTIVE MEMBER OF CLUSTER, INCARNATION(2) {Tue Aug 21 16:54:17 2007 }
NMEVENT_RECONFIG [00][00][00][00][00][00][00][03] {Tue Aug 21 16:54:17 2007 }
Debug Hang : CMNodeListener(PID=2258) Registered with watchdog daemon. {Tue Aug 21 16:54:17 2007 }
Successful reconfiguration, 2 active node(s) node 0 is the master, my node num is 0 (reconfig 2) {Tue Aug 21 16:54:17 2007 }
t2------cm.log:
oracm, version[ 9.2.0.2.0.47 ] started {Tue Aug 21 16:54:17 2007 }
KernelModuleName is hangcheck-timer {Tue Aug 21 16:54:17 2007 }
OemNodeConfig(): Network Address of node0: 10.1.1.1 (port 9998)
{Tue Aug 21 16:54:17 2007 }
OemNodeConfig(): Network Address of node1: 10.1.1.2 (port 9998)
{Tue Aug 21 16:54:17 2007 }
>WARNING: OemInit2: Opened file(/dev/raw/raw1 8), tid = main:16384 file = oem.c, line = 491 {Tue Aug 21 16:54:17 2007 }
Debug Hang : ClusterListener (PID=2297) Registered withwatchdog daemon. {Tue Aug 21 16:54:17 2007 }
InitializeCM: ModuleName = hangcheck-timer {Tue Aug 21 16:54:17 2007 }
InitializeCM: Kernel module hangcheck-timer is not loaded, try to load(/sbin/insmod hangcheck-timer) {Tue Aug 21 16:54:17 2007 }
InitializeCM: Kernel module hangcheck-timer successfully loaded {Tue Aug 21 16:54:17 2007 }
Debug Hang :StartNMMon (PID=2292) Registered with watchdog daemon. {Tue Aug 21 16:54:17 2007 }
CreateLocalEndpoint(): Network Address: 10.1.1.2
{Tue Aug 21 16:54:17 2007 }
Debug Hang :PollingThread (PID=135159169): Registered with {Tue Aug 21 16:54:17 2007 }
Debug Hang : DiskPingThread (PID=135159169): Registered with {Tue Aug 21 16:54:17 2007 }
Debug Hang :SendingThread (PID=135159169): Registered with {Tue Aug 21 16:54:17 2007 }
Debug Hang : CmConnectListener (PID=2299):Registered with watchdog daemon. {Tue Aug 21 16:54:17 2007 }
UpdateNodeState(): node(1) added udpated {Tue Aug 21 16:54:19 2007 }
HandleUpdate(): SYNC(1) from node(0) completed {Tue Aug 21 16:54:19 2007 }
HandleUpdate(): NODE(0) IS ACTIVE MEMBER OF CLUSTER, INCARNATION(1) {Tue Aug 21 16:54:19 2007 }
HandleUpdate(): NODE(1) IS ACTIVE MEMBER OF CLUSTER, INCARNATION(2) {Tue Aug 21 16:54:19 2007 }
NMEVENT_RECONFIG [00][00][00][00][00][00][00][03] {Tue Aug 21 16:54:19 2007 }
Debug Hang : CMNodeListener(PID=2305) Registered with watchdog daemon. {Tue Aug 21 16:54:20 2007 }
Successful reconfiguration, 2 active node(s) node 0 is the master, my node num is 1 (reconfig 2) {Tue Aug 21 16:54:20 2007 }
>WARNING: RecvMsg: socket closed for node(0), tid = CMNodeListener:163851 file = cmipc.c, line = 1036 {Tue Aug 21 17:01:52 2007 }
Debug Hang :CMNodeListener(PID=2305) UnRegistered with watchdog daemon. {Tue Aug 21 17:01:52 2007 }
>WARNING: PollingThread(): node(0) missed(12) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:04 2007 }
>WARNING: PollingThread(): node(0) missed(13) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:05 2007 }
>WARNING: PollingThread(): node(0) missed(14) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:06 2007 }
>WARNING: PollingThread(): node(0) missed(15) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:07 2007 }
>WARNING: PollingThread(): node(0) missed(16) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:08 2007 }
>WARNING: PollingThread(): node(0) missed(17) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:09 2007 }
>WARNING: PollingThread(): node(0) missed(18) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:10 2007 }
>WARNING: PollingThread(): node(0) missed(19) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:11 2007 }
>WARNING: PollingThread(): node(0) missed(20) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:12 2007 }
>WARNING: PollingThread(): node(0) missed(20) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:13 2007 }
NMEVENT_SUSPEND [00][00][00][00][00][00][00][02] {Tue Aug 21 17:02:18 2007 }
HandleUpdate(): SYNC(2) from node(1) completed {Tue Aug 21 17:02:20 2007 }
HandleUpdate(): NODE(1) IS ACTIVE MEMBER OF CLUSTER, INCARNATION(2) {Tue Aug 21 17:02:20 2007 }
NMEVENT_RECONFIG [00][00][00][00][00][00][00][02] {Tue Aug 21 17:02:20 2007 }
Successful reconfiguration, 1 active node(s) node 1 is the master, my node num is 1 (reconfig 3) {Tue Aug 21 17:02:20 2007 }
t1上运行pe-ef|grep oracm的显示如下:
[root@t1 bin]# ps -ef|grep oracm
root 2402 1 9 17:21 pts/0 00:00:00 oracm
root 2415 2353 0 17:21 pts/0 00:00:00 grep oracm
t2上运行pe-ef|grep oracm的显示如下:
[root@t2 /]# ps -ef|grep oracm
root 2292 1 0 16:54 pts/0 00:00:00 oracm
root 2294 2292 0 16:54 pts/0 00:00:00 oracm
root 2295 2294 0 16:54 pts/0 00:00:00 oracm
root 2296 2294 0 16:54 pts/0 00:00:00 oracm
root 2297 2294 0 16:54 pts/0 00:00:00 oracm
root 2299 2294 0 16:54 pts/0 00:00:00 oracm
root 2300 2294 0 16:54 pts/0 00:00:00 oracm
root 2301 2294 0 16:54 pts/0 00:00:00 oracm
root 2302 2294 0 16:54 pts/0 00:00:00 oracm
root 2335 2294 0 17:21 pts/0 00:00:00 oracm
root 2337 2083 0 17:22 pts/0 00:00:00 grep oracm
请问,以上的配置有什么问题没有?
2007-8-21 17:24
agres
回复 #1 agres 的帖子
>WARNING: RecvMsg: socket closed for node(0), tid = CMNodeListener:163851 file = cmipc.c, line = 1036 {Tue Aug 21 17:01:52 2007 }
Debug Hang :CMNodeListener(PID=2305) UnRegistered with watchdog daemon. {Tue Aug 21 17:01:52 2007 }
>WARNING: PollingThread(): node(0) missed(12) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:04 2007 }
>WARNING: PollingThread(): node(0) missed(13) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:05 2007 }
>WARNING: PollingThread(): node(0) missed(14) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:06 2007 }
>WARNING: PollingThread(): node(0) missed(15) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:07 2007 }
>WARNING: PollingThread(): node(0) missed(16) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:08 2007 }
>WARNING: PollingThread(): node(0) missed(17) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:09 2007 }
>WARNING: PollingThread(): node(0) missed(18) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:10 2007 }
>WARNING: PollingThread(): node(0) missed(19) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:11 2007 }
>WARNING: PollingThread(): node(0) missed(20) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:12 2007 }
>WARNING: PollingThread(): node(0) missed(20) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:13 2007 }
NMEVENT_SUSPEND [00][00][00][00][00][00][00][02] {Tue Aug 21 17:02:18 2007 }
HandleUpdate(): SYNC(2) from node(1) completed {Tue Aug 21 17:02:20 2007 }
HandleUpdate(): NODE(1) IS ACTIVE MEMBER OF CLUSTER, INCARNATION(2) {Tue Aug 21 17:02:20 2007 }
NMEVENT_RECONFIG [00][00][00][00][00][00][00][02] {Tue Aug 21 17:02:20 2007 }
Successful reconfiguration, 1 active node(s) node 1 is the master, my node num is 1 (reconfig 3) {Tue Aug 21 17:02:20 2007 }
这一段报警是因为我把t1上的oracm进程用kill -9手工杀掉出现的。