crs进程无法结束怎么办起不来怎么办哩

系统起不来常见原因_百度文库
两大类热门资源免费畅读
续费一年阅读会员,立省24元!
系统起不来常见原因
上传于||文档简介
&&介绍电脑系统无法正常启动的常见原因
阅读已结束,如果下载本文需要使用0下载券
想免费下载更多文档?
定制HR最喜欢的简历
你可能喜欢查看: 4239|回复: 8
11gr2 的一个节点crs无法启动 asm也起不来
论坛徽章:3
版本是11203,系统是aix
db1是失败节点,db2是正常的。
root用户执行:ocrcheck
PROT-602:Failed to retrieve data from the cluster registryPROC-26:Error while accessing the physical storageORA-15077:could not locate ASM instance serving a required diskgroup
请帮忙看看,非常感谢!
论坛徽章:3
然后用grid用户执行,
失败节点ocrcheck_grid用户.jpg (30.13 KB, 下载次数: 0)
10:06 上传
报错:Logicalcorruption check bypassed due to non-privileged user
正常的节点:
ocrcheck:
正常节点ocrcheck_root.jpg (24.41 KB, 下载次数: 0)
10:07 上传
两个节点执行 srvctl status asm ,都报错
PRCR-1070: Failed to check if resource ora.asm is registeredCannotcommunicate with crsd
srvctl status asm.jpg (6.25 KB, 下载次数: 0)
10:08 上传
以下是其他命令的对比
crsctl check crs
crsctl check crs.jpg (26.36 KB, 下载次数: 1)
10:08 上传
crsctlcheck cluster
crsctl check cluster.jpg (25.24 KB, 下载次数: 0)
10:09 上传
crsctlquery css votedisk
crsctl query css votedisk.jpg (39.11 KB, 下载次数: 0)
10:09 上传
求职 : 认证徽章论坛徽章:9
新安装的?
论坛徽章:3
包错前,有那些变更?
论坛徽章:3
jonas_li 发表于
包错前,有那些变更?
前天晚上机房停电,服务器停掉后整理了网线和光纤线
论坛徽章:3
失败节点上
[root@db2:/]#oifcfg iflist -p -n
en3&&192.168.11.0&&PRIVATE&&255.255.255.0
en7&&10.1.84.0&&PUBLIC&&255.255.255.0
[root@db2:/]#oifcfg getif
PRIF-10: failed to initialize the cluster registry
[root@db2:/]#smit tcpip
[root@db2:/]#ifconfig -a
en3: flags=5ec0&UP,BROADCAST,NOTRAILERS,RUNNING,SIMPLEX,MULTICAST,GROUPRT,64BIT,CHECKSUM_OFFLOAD(ACTIVE),PSEG,LARGESEND,CHAIN&
& && &&&inet 192.168.11.4 netmask 0xffffff00 broadcast 192.168.11.255
& && &&&inet 169.254.121.18 netmask 0xffff0000 broadcast 169.254.255.255
& && && &tcp_sendspace 65536 tcp_recvspace 65536 rfc1323 1
en7: flags=1e&UP,BROADCAST,NOTRAILERS,RUNNING,SIMPLEX,MULTICAST,GROUPRT,64BIT,CHECKSUM_OFFLOAD(ACTIVE),LARGESEND,CHAIN&
& && &&&inet 10.1.84.211 netmask 0xffffff00 broadcast 10.1.84.255
& && && &tcp_sendspace 131072 tcp_recvspace 65536 rfc1323 0
lo0: flags=e08084b,c0&UP,BROADCAST,LOOPBACK,RUNNING,SIMPLEX,MULTICAST,GROUPRT,64BIT,LARGESEND,CHAIN&
& && &&&inet 127.0.0.1 netmask 0xff000000 broadcast 127.255.255.255
& && &&&inet6 ::1%1/0
& && && &tcp_sendspace 131072 tcp_recvspace 131072 rfc1323 1
正常节点:
grid@db1-& oifconfig iflist -p -n
ksh: oifconfig:&&not found.
grid@db1-& oifcfg getif
en3&&192.168.11.0&&global&&cluster_interconnect
en7&&10.1.84.0&&global&&public
grid@db1-&
grid@db1-& oifcfg iflist -p -n
en3&&192.168.11.0&&PRIVATE&&255.255.255.0
en7&&10.1.84.0&&PUBLIC&&255.255.255.0
grid@db1-&
grid@db1-& ifconfig -a
en3: flags=5ec0&UP,BROADCAST,NOTRAILERS,RUNNING,SIMPLEX,MULTICAST,GROUPRT,64BIT,CHECKSUM_OFFLOAD(ACTIVE),PSEG,LARGESEND,CHAIN&
& && &&&inet 192.168.11.3 netmask 0xffffff00 broadcast 192.168.11.255
& && && &tcp_sendspace 65536 tcp_recvspace 65536 rfc1323 1
en7: flags=1e&UP,BROADCAST,NOTRAILERS,RUNNING,SIMPLEX,MULTICAST,GROUPRT,64BIT,CHECKSUM_OFFLOAD(ACTIVE),LARGESEND,CHAIN&
& && &&&inet 10.1.84.210 netmask 0xffffff00 broadcast 10.1.84.255
& && &&&inet 10.1.84.212 netmask 0xffffff00 broadcast 10.1.84.255
& && &&&inet 10.1.84.201 netmask 0xffffff00 broadcast 10.1.84.255
& && &&&inet 10.1.84.209 netmask 0xffffff00 broadcast 10.1.84.255
& && && &tcp_sendspace 131072 tcp_recvspace 65536 rfc1323 0
lo0: flags=e08084b,c0&UP,BROADCAST,LOOPBACK,RUNNING,SIMPLEX,MULTICAST,GROUPRT,64BIT,LARGESEND,CHAIN&
& && &&&inet 127.0.0.1 netmask 0xff000000 broadcast 127.255.255.255
& && &&&inet6 ::1%1/0
& && && &tcp_sendspace 131072 tcp_recvspace 131072 rfc1323 1
grid@db1-&
但是失败节点的alter_+ASM2.log里面提示:
Private Interface'en3' configured from GPnP for use as a private interconnect.&&[name='en3', type=1,ip=169.254.121.18, mac=6c-ae-8b-68-49-b3, net=169.254.0.0/16, mask=255.255.0.0,use=haip:cluster_interconnect/62]Public Interface'en7' configured from GPnP for use as a public interface.&&[name='en7', type=1, ip=10.1.84.211,mac=34-40-b5-f5-8e-76, net=10.1.84.0/24, mask=255.255.255.0, use=public/1]
我配置的私有IP应该是192.168.11.4 啊,是不是db2的私有IP信息乱了?
论坛徽章:14
ASM 的alert log啥情况
论坛徽章:3
失败节点上:
[db2] more&&orarootagent_root.log
...skipping...
09:51:46.100: [ default][2057]clsvactversion:4: Retrieving Active Version from local storage.
09:51:46.102: [ USRTHRD][2057] {0:0:2} Css::Css clsssinit() error, rc = 3
09:51:46.103: [ USRTHRD][2057] {0:0:2} Css::Css Error in constructor. All subsequent functions
will fail.
09:51:46.104: [ USRTHRD][2057] {0:0:2} Css::Css clsssinit() error, rc = 3
09:51:46.104: [ USRTHRD][2057] {0:0:2} Css::Css Error in constructor. All subsequent functions
will fail.
09:51:46.104: [ USRTHRD][2057] {0:0:2} CssGroup::CssGroup Error in constructor. All subsequent
functions will fail.
09:51:46.105: [ USRTHRD][2057] {0:0:2} HAIP: mbr num is 0.
09:51:46.106: [ USRTHRD][2057] {0:0:2} CssLock::CssLock clssssinit() error, rc = 3
09:51:46.106: [ora.ctssd][2829] {0:0:2} [check] PID 7208966 from /oracle/11203/grid/home/ctss/
init/db2.pid
09:51:46.107: [CLSFRAME][2057] {0:0:2} Timer [31] is being canceled
09:51:46.107: [CLSFRAME][1] [TIMER] opcode = IOC_TIMER_CANCEL [907005]
09:51:46.107: [CLSFRAME][1] [TIMER] id = 41
09:51:46.107: [CLSFRAME][1] [TIMER] timerId = 31
09:51:46.107: [CLSFRAME][2057] {0:0:2} Scheduling new timer [42,30000,0]
09:51:46.107: [CLSFRAME][1] [TIMER] Removing TimerMessage: 31
09:51:46.107: [CLSFRAME][1] [TIMER] opcode = OTHER [907006]
09:51:46.107: [CLSFRAME][1] [TIMER] opcode = IOC_TIMER [907006]
09:51:46.107: [CLSFRAME][2314] {0:0:2} [TIMER] Cancelling TimerMessage: 31
09:51:46.107: [CLSFRAME][1] [TIMER] id = 42
09:51:46.107: [CLSFRAME][1] [TIMER] Inserting TimerMessage: 42 delay: 30000 interval: 0 to epo
09:51:46.108: [& & AGFW][2314] {0:0:2} ora.cluster_interconnect.haip 1 1 state changed from: U
NKNOWN to: FAILED
09:51:46.108: [& & AGFW][2314] {0:0:2} Agent sending last reply for: RESOURCE_PROBE[ora.cluste
r_interconnect.haip 1 1] ID 4097:66
09:51:46.108: [ora.drivers.acfs][2057] {0:0:2} [check] DriversAcfsAgent:: Initializing resourc
e entry points
09:51:46.109: [ora.drivers.acfs][2057] {0:0:2} [check] __IS_HASD_AGENT=TRUE
09:51:46.109: [ CRSCOMM][1543] Ipc: Adding msg () to peer: 0
09:51:46.109: [CLSFRAME][2314] {0:0:2} [TIMER] New wait delay: 29973
09:51:46.109: [ora.drivers.acfs][2057] {0:0:2} [check] DriversAcfsAgent:: Completed initializi
ng resourceentry points
09:51:46.109: [CLSFRAME][1] TM [MultiThread] is changing desired thread # to 7. Current # is 6
09:51:46.109: [ CRSCOMM][772] clsIpc: Sent msg:
to member 0
09:51:46.109: [CLSFRAME][4114] Worker thread starting, TM [MultiThread]
09:51:46.109: [CLSFRAME][4114] {0:0:2} Delivering Timer cancel event [31]
09:51:46.110: [ora.diskmon][3086] {0:0:2} [check] CELL communication is configured to use 0 in
terface(s):
论坛徽章:3
shane1103 发表于
ASM 的alert log啥情况
怀疑是红色的部分的问题
Fri Dec 20 09:56:09 2013
NOTE: No asm libraries found in the system
MEMORY_TARGET defaulting to .
* instance_number obtained from CSS = 2, checking for the existence of node 0...
* node 0 does not exist. instance_number = 2
Starting ORACLE instance (normal)
LICENSE_MAX_SESSION = 0
LICENSE_SESSIONS_WARNING = 0
Private Interface 'en3' configured from GPnP for use as a private interconnect.
&&[name='en3', type=1, ip=169.254.121.18, mac=6c-ae-8b-68-49-b3, net=169.254.0.0/16, mask=255.255.0.0, us
e=haip:cluster_interconnect/62]
Public Interface 'en7' configured from GPnP for use as a public interface.
&&[name='en7', type=1, ip=10.1.84.211, mac=34-40-b5-f5-8e-76, net=10.1.84.0/24, mask=255.255.255.0, use=p
Picked latch-free SCN scheme 3
Using LOG_ARCHIVE_DEST_1 parameter default value as /oracle/11203/grid/home/dbs/arch
Autotune of undo retention is turned on.
LICENSE_MAX_USERS = 0
SYS auditing is disabled
Starting up:
Oracle Database 11g Enterprise Edition Release 11.2.0.3.0 - 64bit Production
With the Real Application Clusters and Automatic Storage Management options.
ORACLE_HOME = /oracle/11203/grid/home
System name:& & AIX
Node name:& && &db2
Release:& && &&&1
Version:& && &&&6
Machine:& && &&&00F8C4F74C00
Using parameter settings in server-side spfile +CRS/db/asmparameterfile/registry.253.
System parameters with non-default values:
&&large_pool_size& && && & = 12M
&&instance_type& && && && &= &asm&
&&remote_login_passwordfile= &EXCLUSIVE&
alert_+ASM2.log (94%)&&asm_diskgroups& && && &&&= &DATA&
&&asm_diskgroups& && && &&&= &ARCH&
&&asm_power_limit& && && & = 1
&&diagnostic_dest& && && & = &/oracle/11203/grid/base&
Cluster communication is configured to use the following interface(s) for this instance
&&169.254.121.18
cluster interconnect IPC version:Oracle UDP/IP (generic)
IPC Vendor 1 proto 2
Fri Dec 20 09:56:10 2013
PMON started with pid=2, OS id=3276824
Fri Dec 20 09:56:10 2013
PSP0 started with pid=3, OS id=6291922
Fri Dec 20 09:56:11 2013
VKTM started with pid=4, OS id=4915710 at elevated priority
VKTM running at (10)millisec precision with DBRM quantum (100)ms
Fri Dec 20 09:56:11 2013
GEN0 started with pid=5, OS id=6815990
Fri Dec 20 09:56:11 2013
DIAG started with pid=6, OS id=6881334
Fri Dec 20 09:56:11 2013
PING started with pid=7, OS id=2883974
Fri Dec 20 09:56:11 2013
DIA0 started with pid=8, OS id=6750360
Fri Dec 20 09:56:11 2013
LMON started with pid=9, OS id=6947016
Fri Dec 20 09:56:11 2013
LMD0 started with pid=10, OS id=3146022
* System load used for high load check
* New Low - High Load Threshold Range = [221184 - 294912]
Fri Dec 20 09:56:12 2013
LMS0 started with pid=11, OS id=6160764 at elevated priority
Fri Dec 20 09:56:12 2013
LMHB started with pid=12, OS id=5112092
Fri Dec 20 09:56:12 2013
MMAN started with pid=13, OS id=6029684
Fri Dec 20 09:56:12 2013
DBW0 started with pid=14, OS id=3997944
Fri Dec 20 09:56:12 2013
LGWR started with pid=15, OS id=4522434
Fri Dec 20 09:56:12 2013
CKPT started with pid=16, OS id=4850000
Fri Dec 20 09:56:12 2013
SMON started with pid=17, OS id=4063570
alert_+ASM2.log (95%)Fri Dec 20 09:56:12 2013
RBAL started with pid=18, OS id=4915298
Fri Dec 20 09:56:12 2013
GMON started with pid=19, OS id=6684896
Fri Dec 20 09:56:12 2013
MMON started with pid=20, OS id=5767406
Fri Dec 20 09:56:12 2013
MMNL started with pid=21, OS id=5964040
lmon registered with NM - instance number 2 (internal mem no 1)
Fri Dec 20 09:59:01 2013
PMON (ospid: 3276824): terminating the instance due to error 481
Fri Dec 20 10:00:04 2013
Instance terminated by PMON, pid = 3276824
Fri Dec 20 10:03:32 2013
NOTE: No asm libraries found in the system
MEMORY_TARGET defaulting to .
* instance_number obtained from CSS = 2, checking for the existence of node 0...
* node 0 does not exist. instance_number = 2
Starting ORACLE instance (normal)
LICENSE_MAX_SESSION = 0
LICENSE_SESSIONS_WARNING = 0
Fri Dec 20 10:04:42 2013
Private Interface 'en3' configured from GPnP for use as a private interconnect.
&&[name='en3', type=1, ip=169.254.121.18, mac=6c-ae-8b-68-49-b3, net=169.254.0.0/16, mask=255.255.0.0, us
e=haip:cluster_interconnect/62]
Public Interface 'en7' configured from GPnP for use as a public interface.
&&[name='en7', type=1, ip=10.1.84.211, mac=34-40-b5-f5-8e-76, net=10.1.84.0/24, mask=255.255.255.0, use=p
Picked latch-free SCN scheme 3
Using LOG_ARCHIVE_DEST_1 parameter default value as /oracle/11203/grid/home/dbs/arch
Autotune of undo retention is turned on.
LICENSE_MAX_USERS = 0
SYS auditing is disabled
Starting up:
Oracle Database 11g Enterprise Edition Release 11.2.0.3.0 - 64bit Production
With the Real Application Clusters and Automatic Storage Management options.
ORACLE_HOME = /oracle/11203/grid/home
System name:& & AIX
Node name:& && &db2
Release:& && &&&1
Version:& && &&&6
Machine:& && &&&00F8C4F74C00
Using parameter settings in server-side spfile +CRS/db/asmparameterfile/registry.253.
alert_+ASM2.log (96%)System parameters with non-default values:
&&large_pool_size& && && & = 12M
&&instance_type& && && && &= &asm&
&&remote_login_passwordfile= &EXCLUSIVE&
&&asm_diskgroups& && && &&&= &DATA&
&&asm_diskgroups& && && &&&= &ARCH&
&&asm_power_limit& && && & = 1
&&diagnostic_dest& && && & = &/oracle/11203/grid/base&
Cluster communication is configured to use the following interface(s) for this instance
&&169.254.121.18
cluster interconnect IPC version:Oracle UDP/IP (generic)
IPC Vendor 1 proto 2
Fri Dec 20 10:04:42 2013
PMON started with pid=2, OS id=3276808
Fri Dec 20 10:04:42 2013
PSP0 started with pid=3, OS id=6815896
Fri Dec 20 10:04:43 2013
VKTM started with pid=4, OS id=7471336 at elevated priority
VKTM running at (10)millisec precision with DBRM quantum (100)ms
Fri Dec 20 10:04:43 2013
GEN0 started with pid=5, OS id=7078074
Fri Dec 20 10:04:43 2013
DIAG started with pid=6, OS id=7864444
Fri Dec 20 10:04:43 2013
PING started with pid=7, OS id=7012488
Fri Dec 20 10:04:43 2013
DIA0 started with pid=8, OS id=7536672
Fri Dec 20 10:04:44 2013
LMON started with pid=9, OS id=3146168
Fri Dec 20 10:04:44 2013
LMD0 started with pid=10, OS id=6029782
* System load used for high load check
* New Low - High Load Threshold Range = [221184 - 294912]
Fri Dec 20 10:04:44 2013
LMS0 started with pid=11, OS id=4194812 at elevated priority
Fri Dec 20 10:04:44 2013
LMHB started with pid=12, OS id=4915218
Fri Dec 20 10:04:44 2013
MMAN started with pid=13, OS id=2883846
Fri Dec 20 10:04:44 2013
DBW0 started with pid=14, OS id=4915670
Fri Dec 20 10:04:44 2013
LGWR started with pid=15, OS id=3997700
alert_+ASM2.log (97%)Fri Dec 20 10:04:44 2013
CKPT started with pid=16, OS id=7798796
Fri Dec 20 10:04:44 2013
SMON started with pid=17, OS id=6750554
Fri Dec 20 10:04:44 2013
RBAL started with pid=18, OS id=7209012
Fri Dec 20 10:04:44 2013
GMON started with pid=19, OS id=5964206
Fri Dec 20 10:04:44 2013
MMON started with pid=20, OS id=5112084
Fri Dec 20 10:04:44 2013
MMNL started with pid=21, OS id=6946938
lmon registered with NM - instance number 2 (internal mem no 1)
Fri Dec 20 10:06:44 2013
PMON (ospid: 3276808): terminating the instance due to error 481
Fri Dec 20 10:06:44 2013
System state dump requested by (instance=2, osid=3276808 (PMON)), summary=[abnormal instance termination]
System State dumped to trace file /oracle/11203/grid/base/diag/asm/+asm/+ASM2/trace/+ASM2_diag_7864444.tr
Fri Dec 20 10:06:44 2013
ORA-1092 : opitsk aborting process
Dumping diagnostic data in directory=[cdmp_44], requested by (instance=2, osid=3276808 (PMON)
), summary=[abnormal instance termination].
Fri Dec 20 10:06:44 2013
ORA-1092 : opitsk aborting process
Instance terminated by PMON, pid = 3276808
Fri Dec 20 10:06:46 2013
License high water mark = 1
USER (ospid: 7143852): terminating the instance
Instance terminated by USER, pid = 7143852
Fri Dec 20 10:06:51 2013
NOTE: No asm libraries found in the system
MEMORY_TARGET defaulting to .
* instance_number obtained from CSS = 2, checking for the existence of node 0...
* node 0 does not exist. instance_number = 2
Starting ORACLE instance (normal)
LICENSE_MAX_SESSION = 0
LICENSE_SESSIONS_WARNING = 0
Private Interface 'en3' configured from GPnP for use as a private interconnect.
&&[name='en3', type=1, ip=169.254.121.18, mac=6c-ae-8b-68-49-b3, net=169.254.0.0/16, mask=255.255.0.0, us
e=haip:cluster_interconnect/62]
Public Interface 'en7' configured from GPnP for use as a public interface.
alert_+ASM2.log (98%)&&[name='en7', type=1, ip=10.1.84.211, mac=34-40-b5-f5-8e-76, net=10.1.84.0/24, mask=255.255.255.0, use=p
Picked latch-free SCN scheme 3
Using LOG_ARCHIVE_DEST_1 parameter default value as /oracle/11203/grid/home/dbs/arch
Autotune of undo retention is turned on.
LICENSE_MAX_USERS = 0
SYS auditing is disabled
Starting up:
Oracle Database 11g Enterprise Edition Release 11.2.0.3.0 - 64bit Production
With the Real Application Clusters and Automatic Storage Management options.
ORACLE_HOME = /oracle/11203/grid/home
System name:& & AIX
Node name:& && &db2
Release:& && &&&1
Version:& && &&&6
Machine:& && &&&00F8C4F74C00
Using parameter settings in server-side spfile +CRS/db/asmparameterfile/registry.253.
System parameters with non-default values:
&&large_pool_size& && && & = 12M
&&instance_type& && && && &= &asm&
&&remote_login_passwordfile= &EXCLUSIVE&
&&asm_diskgroups& && && &&&= &DATA&
&&asm_diskgroups& && && &&&= &ARCH&
&&asm_power_limit& && && & = 1
&&diagnostic_dest& && && & = &/oracle/11203/grid/base&
Cluster communication is configured to use the following interface(s) for this instance
&&169.254.121.18
cluster interconnect IPC version:Oracle UDP/IP (generic)
IPC Vendor 1 proto 2
Fri Dec 20 10:06:52 2013
PMON started with pid=2, OS id=7471120
Fri Dec 20 10:06:52 2013
PSP0 started with pid=3, OS id=5964252
Fri Dec 20 10:07:51 2013
VKTM started with pid=4, OS id=7798910 at elevated priority
VKTM running at (10)millisec precision with DBRM quantum (100)ms
Fri Dec 20 10:07:51 2013
GEN0 started with pid=5, OS id=6750266
Fri Dec 20 10:07:51 2013
DIAG started with pid=6, OS id=3997790
Fri Dec 20 10:07:51 2013
PING started with pid=7, OS id=4915676
Fri Dec 20 10:07:51 2013
alert_+ASM2.log (99%)DIA0 started with pid=8, OS id=3276806
Fri Dec 20 10:07:51 2013
LMON started with pid=9, OS id=8323084
Fri Dec 20 10:07:51 2013
LMD0 started with pid=10, OS id=7405940
* System load used for high load check
* New Low - High Load Threshold Range = [221184 - 294912]
Fri Dec 20 10:07:51 2013
LMS0 started with pid=11, OS id=7864378 at elevated priority
Fri Dec 20 10:07:51 2013
LMHB started with pid=12, OS id=7667934
Fri Dec 20 10:07:51 2013
MMAN started with pid=13, OS id=7340182
Fri Dec 20 10:07:51 2013
DBW0 started with pid=14, OS id=6947000
Fri Dec 20 10:07:51 2013
LGWR started with pid=15, OS id=6815932
Fri Dec 20 10:07:51 2013
CKPT started with pid=16, OS id=4915238
Fri Dec 20 10:07:51 2013
SMON started with pid=17, OS id=8257788
Fri Dec 20 10:07:51 2013
RBAL started with pid=18, OS id=3146078
Fri Dec 20 10:07:51 2013
GMON started with pid=19, OS id=6684870
Fri Dec 20 10:07:51 2013
MMON started with pid=20, OS id=6029722
Fri Dec 20 10:07:51 2013
MMNL started with pid=21, OS id=6160808
lmon registered with NM - instance number 2 (internal mem no 1)
Fri Dec 20 10:09:51 2013
PMON (ospid: 7471120): terminating the instance due to error 481
Fri Dec 20 10:09:51 2013
System state dump requested by (instance=2, osid=7471120 (PMON)), summary=[abnormal instance termination]
System State dumped to trace file /oracle/11203/grid/base/diag/asm/+asm/+ASM2/trace/+ASM2_diag_3997790.tr
Fri Dec 20 10:09:51 2013
ORA-1092 : opitsk aborting process
Dumping diagnostic data in directory=[cdmp_51], requested by (instance=2, osid=7471120 (PMON)
), summary=[abnormal instance termination].
Fri Dec 20 10:11:17 2013
Instance terminated by PMON, pid = 7471120
alert_+ASM2.log: END[root@db2:/oracle/11203/grid/base/diag/asm/+asm/+ASM2/trace]#
itpub.net All Right Reserved. 北京皓辰网域网络信息技术有限公司版权所有    
 北京市公安局海淀分局网监中心备案编号: 广播电视节目制作经营许可证:编号(京)字第1149号额头深刻的皱纹和斑驳的脸庞,让人感受到岁月的无情。
当地人给断掉的鼻子贴上了创口贴,一时在网上走红。
声明:本文由入驻搜狐公众平台的作者撰写,除搜狐官方账号外,观点仅代表作者本人,不代表搜狐立场。
张大朋(Lunar)Oracle 资深技术专家
  Lunar 拥有超过十年的 ORACLE SUPPORT 从业经验,曾经服务于ORACLE ACS部门,现就职于 ORACLE Sales Consultant 部门,负责的产品主要是 Exadata,Golden Gate,Database 等。
  我们都知道,在RAC环境中,如果kill ocssd.bin进程,会引起主机重启。
  但是有时候系统已经异常了了,且CRS不能正常关闭,而主机可能是几年没重启的老系统,没人敢重启,现在怎么办?
  我们只能尝试手工kill进程的方式,然后手工修复CRS(注意,在10.2 RAC中,只有3个d.bin进程)。
  测试环境:操作系统是OEL 6.6
  这套RAC的CRS版本是11.2.0.4:
  查看当前CRS的状态:
  查看当前所有的CRS进程:
  我们开始模拟kill进程。首先kill 掉/u01/app/11.2.0.4/grid/bin/ohasd.bin
  如果大家了解11.2RAC的启动过程,我们会知道,kill后会自动重启 。
  然后,我们kill cssdmonitor:
  这里没有这个进程,表示cssdmonitor进程被重启过了
  检查进程
  上面进程启动时间在20:04~20:07之间的,都是被/u01/app/11.2.0.4/grid/bin/ohasd.bin进程重启后,自动后台重启的。
  现在,我们kill mdnsd gpnpd gipcd osysmond。这4个进程中,前面3个是CRS启动除了ohasd以外,最早启动的几个进程。
  如果kill这些进程,ohasd都会重启的:
  这里我们看到,刚才kill 的4 进程都没起来,怎么回事?别急,还没到时间,ohasd需要check后才启动。
  然后,我们kill 监听:
  我们看到,刚才kill的进程都被重启了,11.2的RAC真强悍啊。
  现在我们kill /etc/init.d/init.ohasd进程:
  ’这里我们看到的就是/etc/init.d/init.ohasd被系统自动重启的过程。这些信息会记录在/var/log/message/中:
  而且他进程都被自动重启了(注意这是crsd进程还没被重启):
  现在我们依次kill:evmlogger.bin gpnpd.bin mdnsd.bin gipcd.bin evmd.bin oraagent.bin agent.bin oraagent.bin orarootagent.bin和两个lisnterner
  然后,kill osysmond.bin ologgerd cssdmonitor cssdagent :
  现在就剩下一个ocssd.bin了:
  现在我们kill 传说中一旦被kill就会引起主机重启的进程 ocssd.bin :
  好了,我们的系统都还好好的,没有重启,资源也都释放干净了:
  如果要恢复,很简单,只要直接重启crs就ok了:
  检查进程:
  检查集群状态
  这里只显示了节点1,因为节点2我关闭了。
  测试证明,只要先kill cssdmonitor 和 cssdagent进程(准确的说是cssagent),再kill ocssd.bin进程,系统是不会重启的。
  另外,12.1普通RAC(非Flex Cluster)的情况根本文一样,处理思路和过程也一样。
  ----the end
  假日远去,精彩还在继续!
  11月4日,Oracle技术嘉年华 - 国内规模最大的Oracle技术盛会将在京举行,美国OOW的最新信息将悉数来至中国,欢迎您来到大会现场。
  主题:数据?技术?平台 - 新时代的技术前沿
  大会网站:/
  会议时间:日(周五)- 5日(周六)
  会议地点:北京&富力万丽酒店
  更多Oracle技术嘉年华相关信息请点击了解:。
  如何加入&云和恩墨大讲堂&微信群
  搜索 盖国强(Eygle)微信号:eyygle,或者扫描下面二维码,备注:云和恩墨大讲堂,即可入群。每周与千人共享免费技术分享,与讲师在线讨论。
  近期文章
  资源下载
  关注本微信(OraNews)回复关键字获取
  2016DTCC, 2016数据库大会PPT;
  DBALife,&DBA的一天&精品海报大图;
  12cArch,“Oracle 12c体系结构”精品海报;
  DBA01,《Oracle DBA手记》第一本下载;
  YunHe,“云和恩墨大讲堂”案例文档下载;
  鼓励下美女大师吧!
  人赞赏
欢迎举报抄袭、转载、暴力色情及含有欺诈和虚假信息的不良文章。
请先登录再操作
请先登录再操作
微信扫一扫分享至朋友圈
搜狐公众平台官方账号
生活时尚&搭配博主 /生活时尚自媒体 /时尚类书籍作者
搜狐网教育频道官方账号
全球最大华文占星网站-专业研究星座命理及测算服务机构
主演:黄晓明/陈乔恩/乔任梁/谢君豪/吕佳容/戚迹
主演:陈晓/陈妍希/张馨予/杨明娜/毛晓彤/孙耀琦
主演:陈键锋/李依晓/张迪/郑亦桐/张明明/何彦霓
主演:尚格?云顿/乔?弗拉尼甘/Bianca Bree
主演:艾斯?库珀/ 查宁?塔图姆/ 乔纳?希尔
baby14岁写真曝光
李冰冰向成龙撒娇争宠
李湘遭闺蜜曝光旧爱
美女模特教老板走秀
曝搬砖男神奇葩择偶观
柳岩被迫成赚钱工具
大屁小P虐心恋
匆匆那年大结局
乔杉遭粉丝骚扰
男闺蜜的尴尬初夜
客服热线:86-10-
客服邮箱:

我要回帖

更多关于 进程无法结束怎么办 的文章

 

随机推荐