LoveUnix » ORACLE等数据库 » 大家看看我的这个RAC的ORACM配置有没有问题
让LU留住您的每

一天 让LU博客留住您的每一天
2007-8-21 17:21 agres
大家看看我的这个RAC的ORACM配置有没有问题

环境:VMware6.0+REDHAT AS4+ORACLE9i
介绍:二节点t1,t2
共享磁盘采用的虚拟SCSI磁盘,划出14个RAW设备供ORACLE使用:
/dev/raw/raw1:  bound to major 8, minor 17
/dev/raw/raw2:  bound to major 8, minor 18
/dev/raw/raw3:  bound to major 8, minor 19
/dev/raw/raw4:  bound to major 8, minor 21
/dev/raw/raw5:  bound to major 8, minor 22
/dev/raw/raw6:  bound to major 8, minor 23
/dev/raw/raw7:  bound to major 8, minor 24
/dev/raw/raw8:  bound to major 8, minor 25
/dev/raw/raw9:  bound to major 8, minor 26
/dev/raw/raw10: bound to major 8, minor 27
/dev/raw/raw11: bound to major 8, minor 28
/dev/raw/raw12: bound to major 8, minor 29
/dev/raw/raw13: bound to major 8, minor 30
/dev/raw/raw14: bound to major 8, minor 31
CMQuorum使用的是/dev/raw/raw1,运行./ocmstart.sh,得到的log如下:
t1----cm.log:
oracm, version[ 9.2.0.2.0.47 ] started {Tue Aug 21 16:53:13 2007 }
KernelModuleName is hangcheck-timer {Tue Aug 21 16:53:13 2007 }
OemNodeConfig(): Network Address of node0: 10.1.1.1 (port 9998)
{Tue Aug 21 16:53:13 2007 }
OemNodeConfig(): Network Address of node1: 10.1.1.2 (port 9998)
{Tue Aug 21 16:53:13 2007 }
>WARNING:  OemInit2: Opened file(/dev/raw/raw1 6), tid = main:-1219988384 file = oem.c, line = 491 {Tue Aug 21 16:53:13 2007 }
Debug Hang : ClusterListener (PID=2258) Registered withwatchdog daemon. {Tue Aug 21 16:53:13 2007 }
InitializeCM: ModuleName = hangcheck-timer  {Tue Aug 21 16:53:13 2007 }
InitializeCM: Kernel module hangcheck-timer is not loaded, try to load(/sbin/insmod hangcheck-timer) {Tue Aug 21 16:53:13 2007 }
InitializeCM: Kernel module hangcheck-timer successfully loaded {Tue Aug 21 16:53:13 2007 }
Debug Hang : CmConnectListener (PID=2258):Registered with watchdog daemon. {Tue Aug 21 16:53:13 2007 }
Debug Hang :StartNMMon (PID=2258) Registered with watchdog daemon. {Tue Aug 21 16:53:13 2007 }
CreateLocalEndpoint(): Network Address: 10.1.1.1
{Tue Aug 21 16:53:13 2007 }
Debug Hang :PollingThread (PID=135159169): Registered with  {Tue Aug 21 16:53:13 2007 }
Debug Hang : DiskPingThread (PID=135159169): Registered with  {Tue Aug 21 16:53:13 2007 }
Debug Hang :SendingThread (PID=135159169): Registered with  {Tue Aug 21 16:53:13 2007 }
NMEVENT_SUSPEND [00][00][00][00][00][00][00][01] {Tue Aug 21 16:53:18 2007 }
HandleUpdate(): SYNC(0) from node(0) completed {Tue Aug 21 16:53:20 2007 }
HandleUpdate(): NODE(0) IS ACTIVE MEMBER OF CLUSTER, INCARNATION(1) {Tue Aug 21 16:53:20 2007 }
NMEVENT_RECONFIG [00][00][00][00][00][00][00][01] {Tue Aug 21 16:53:20 2007 }
Successful reconfiguration,  1 active node(s) node 0 is the master, my node num is 0 (reconfig 1) {Tue Aug 21 16:53:20 2007 }
NMEVENT_SUSPEND [00][00][00][00][00][00][00][01] {Tue Aug 21 16:54:15 2007 }
HandleUpdate(): SYNC(1) from node(0) completed {Tue Aug 21 16:54:17 2007 }
HandleUpdate(): NODE(0) IS ACTIVE MEMBER OF CLUSTER, INCARNATION(1) {Tue Aug 21 16:54:17 2007 }
HandleUpdate(): NODE(1) IS ACTIVE MEMBER OF CLUSTER, INCARNATION(2) {Tue Aug 21 16:54:17 2007 }
NMEVENT_RECONFIG [00][00][00][00][00][00][00][03] {Tue Aug 21 16:54:17 2007 }
Debug Hang : CMNodeListener(PID=2258) Registered with watchdog daemon. {Tue Aug 21 16:54:17 2007 }
Successful reconfiguration,  2 active node(s) node 0 is the master, my node num is 0 (reconfig 2) {Tue Aug 21 16:54:17 2007 }

t2------cm.log:
oracm, version[ 9.2.0.2.0.47 ] started {Tue Aug 21 16:54:17 2007 }
KernelModuleName is hangcheck-timer {Tue Aug 21 16:54:17 2007 }
OemNodeConfig(): Network Address of node0: 10.1.1.1 (port 9998)
{Tue Aug 21 16:54:17 2007 }
OemNodeConfig(): Network Address of node1: 10.1.1.2 (port 9998)
{Tue Aug 21 16:54:17 2007 }
>WARNING:  OemInit2: Opened file(/dev/raw/raw1 8), tid = main:16384 file = oem.c, line = 491 {Tue Aug 21 16:54:17 2007 }
Debug Hang : ClusterListener (PID=2297) Registered withwatchdog daemon. {Tue Aug 21 16:54:17 2007 }
InitializeCM: ModuleName = hangcheck-timer  {Tue Aug 21 16:54:17 2007 }
InitializeCM: Kernel module hangcheck-timer is not loaded, try to load(/sbin/insmod hangcheck-timer) {Tue Aug 21 16:54:17 2007 }
InitializeCM: Kernel module hangcheck-timer successfully loaded {Tue Aug 21 16:54:17 2007 }
Debug Hang :StartNMMon (PID=2292) Registered with watchdog daemon. {Tue Aug 21 16:54:17 2007 }
CreateLocalEndpoint(): Network Address: 10.1.1.2
{Tue Aug 21 16:54:17 2007 }
Debug Hang :PollingThread (PID=135159169): Registered with  {Tue Aug 21 16:54:17 2007 }
Debug Hang : DiskPingThread (PID=135159169): Registered with  {Tue Aug 21 16:54:17 2007 }
Debug Hang :SendingThread (PID=135159169): Registered with  {Tue Aug 21 16:54:17 2007 }
Debug Hang : CmConnectListener (PID=2299):Registered with watchdog daemon. {Tue Aug 21 16:54:17 2007 }
UpdateNodeState(): node(1) added udpated {Tue Aug 21 16:54:19 2007 }
HandleUpdate(): SYNC(1) from node(0) completed {Tue Aug 21 16:54:19 2007 }
HandleUpdate(): NODE(0) IS ACTIVE MEMBER OF CLUSTER, INCARNATION(1) {Tue Aug 21 16:54:19 2007 }
HandleUpdate(): NODE(1) IS ACTIVE MEMBER OF CLUSTER, INCARNATION(2) {Tue Aug 21 16:54:19 2007 }
NMEVENT_RECONFIG [00][00][00][00][00][00][00][03] {Tue Aug 21 16:54:19 2007 }
Debug Hang : CMNodeListener(PID=2305) Registered with watchdog daemon. {Tue Aug 21 16:54:20 2007 }
Successful reconfiguration,  2 active node(s) node 0 is the master, my node num is 1 (reconfig 2) {Tue Aug 21 16:54:20 2007 }
>WARNING:  RecvMsg:  socket closed for node(0), tid = CMNodeListener:163851 file = cmipc.c, line = 1036 {Tue Aug 21 17:01:52 2007 }
Debug Hang :CMNodeListener(PID=2305)  UnRegistered with watchdog daemon. {Tue Aug 21 17:01:52 2007 }
>WARNING:  PollingThread(): node(0) missed(12) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:04 2007 }
>WARNING:  PollingThread(): node(0) missed(13) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:05 2007 }
>WARNING:  PollingThread(): node(0) missed(14) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:06 2007 }
>WARNING:  PollingThread(): node(0) missed(15) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:07 2007 }
>WARNING:  PollingThread(): node(0) missed(16) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:08 2007 }
>WARNING:  PollingThread(): node(0) missed(17) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:09 2007 }
>WARNING:  PollingThread(): node(0) missed(18) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:10 2007 }
>WARNING:  PollingThread(): node(0) missed(19) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:11 2007 }
>WARNING:  PollingThread(): node(0) missed(20) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:12 2007 }
>WARNING:  PollingThread(): node(0) missed(20) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:13 2007 }
NMEVENT_SUSPEND [00][00][00][00][00][00][00][02] {Tue Aug 21 17:02:18 2007 }
HandleUpdate(): SYNC(2) from node(1) completed {Tue Aug 21 17:02:20 2007 }
HandleUpdate(): NODE(1) IS ACTIVE MEMBER OF CLUSTER, INCARNATION(2) {Tue Aug 21 17:02:20 2007 }
NMEVENT_RECONFIG [00][00][00][00][00][00][00][02] {Tue Aug 21 17:02:20 2007 }
Successful reconfiguration,  1 active node(s) node 1 is the master, my node num is 1 (reconfig 3) {Tue Aug 21 17:02:20 2007 }

t1上运行pe-ef|grep oracm的显示如下:
[root@t1 bin]# ps -ef|grep oracm
root      2402     1  9 17:21 pts/0    00:00:00 oracm
root      2415  2353  0 17:21 pts/0    00:00:00 grep oracm

t2上运行pe-ef|grep oracm的显示如下:
[root@t2 /]# ps -ef|grep oracm
root      2292     1  0 16:54 pts/0    00:00:00 oracm
root      2294  2292  0 16:54 pts/0    00:00:00 oracm
root      2295  2294  0 16:54 pts/0    00:00:00 oracm
root      2296  2294  0 16:54 pts/0    00:00:00 oracm
root      2297  2294  0 16:54 pts/0    00:00:00 oracm
root      2299  2294  0 16:54 pts/0    00:00:00 oracm
root      2300  2294  0 16:54 pts/0    00:00:00 oracm
root      2301  2294  0 16:54 pts/0    00:00:00 oracm
root      2302  2294  0 16:54 pts/0    00:00:00 oracm
root      2335  2294  0 17:21 pts/0    00:00:00 oracm
root      2337  2083  0 17:22 pts/0    00:00:00 grep oracm

请问,以上的配置有什么问题没有?

2007-8-21 17:24 agres
回复 #1 agres 的帖子

>WARNING:  RecvMsg:  socket closed for node(0), tid = CMNodeListener:163851 file = cmipc.c, line = 1036 {Tue Aug 21 17:01:52 2007 }
Debug Hang :CMNodeListener(PID=2305)  UnRegistered with watchdog daemon. {Tue Aug 21 17:01:52 2007 }
>WARNING:  PollingThread(): node(0) missed(12) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:04 2007 }
>WARNING:  PollingThread(): node(0) missed(13) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:05 2007 }
>WARNING:  PollingThread(): node(0) missed(14) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:06 2007 }
>WARNING:  PollingThread(): node(0) missed(15) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:07 2007 }
>WARNING:  PollingThread(): node(0) missed(16) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:08 2007 }
>WARNING:  PollingThread(): node(0) missed(17) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:09 2007 }
>WARNING:  PollingThread(): node(0) missed(18) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:10 2007 }
>WARNING:  PollingThread(): node(0) missed(19) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:11 2007 }
>WARNING:  PollingThread(): node(0) missed(20) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:12 2007 }
>WARNING:  PollingThread(): node(0) missed(20) checkin(s), tid = PollingThread:81926 file = nmmember.c, line = 846 {Tue Aug 21 17:02:13 2007 }
NMEVENT_SUSPEND [00][00][00][00][00][00][00][02] {Tue Aug 21 17:02:18 2007 }
HandleUpdate(): SYNC(2) from node(1) completed {Tue Aug 21 17:02:20 2007 }
HandleUpdate(): NODE(1) IS ACTIVE MEMBER OF CLUSTER, INCARNATION(2) {Tue Aug 21 17:02:20 2007 }
NMEVENT_RECONFIG [00][00][00][00][00][00][00][02] {Tue Aug 21 17:02:20 2007 }
Successful reconfiguration,  1 active node(s) node 1 is the master, my node num is 1 (reconfig 3) {Tue Aug 21 17:02:20 2007 }
这一段报警是因为我把t1上的oracm进程用kill -9手工杀掉出现的。

页: [1]


Powered by Discuz! Archiver 5.5.0  © 2001-2006 Comsenz Inc.