当前位置:网站首页>Oracle RAC database instance startup exception analysis IPC send timeout
Oracle RAC database instance startup exception analysis IPC send timeout
2022-04-23 13:43:00 【Not dizzy yet】
In the near future , A user is restarting RAC Database instance of a node , Found that the startup speed is very slow . At the same time, the business department feedback connection RAC The business of the surviving node is also affected .
Through the analysis of logs , When starting the database ,Reconfiguration Slow speed ,Reconfiguration Report a mistake later IPC Send timeout detected. Sender: ospid 53884 [oracle@test2 (LMD0)], Thus, the node expulsion of database instance group appears ;Wed Apr 13 19:28:02 2022
Instance termination initiated by instance 2 with reason 1.
Instance 2 received a reconfig event from its cluster manager indicating that this instance is supposed to be down
Please check instance 2's alert log and LMON trace file for more details.
Please also examine the CSS log files.
LMON (ospid: 47523): terminating the instance due to error 481
therefore , There is a problem in the process of troubleshooting IPC Send timeout Why ; There are BUG There may be RAC Node load reason , You can refer to MOS The troubleshooting steps in the document check the information of the system item by item :
Instance Evicted After LMON to LMON IPC Send timeout Due to Storage Issue (Doc ID 2080029.1)
"ipc send timeout" Precedes Database Instance Crash or Eviction (Doc ID 1951216.1)
While Evicting One of the Instance, the Remaining instances Terminated by LMON with "LMON is running too slowly and in the middle of reconfiguration" (Doc ID 1949505.1)
The relevant logs are as follows :
1.
2022-04-13 18:57:29.215 node 1 The cluster software was restarted manually ,
The database instance was also started successfully ,
Wed Apr 13 18:58:32 2022
QMNC started with pid=100, OS id=52025
Completed: ALTER DATABASE OPEN /* db agent *//* {1:49652:2} */
2. node 2 RECONFIG Nodes in process 1 abnormal
-- node 2
Wed Apr 13 19:22:26 2022
Starting ORACLE instance (normal)
-- node 1:
Wed Apr 13 19:28:00 2022
IPC Send timeout detected. Receiver ospid 47526 [
Wed Apr 13 19:28:00 2022
Errors in file /oracle/app/diag/rdbms/testnew/test1/trace/test1_lmd0_47526.trc:
Wed Apr 13 19:28:02 2022
Instance termination initiated by instance 2 with reason 1.
Instance 2 received a reconfig event from its cluster manager indicating that this instance is supposed to be down
Please check instance 2's alert log and LMON trace file for more details.
Please also examine the CSS log files.
LMON (ospid: 47523): terminating the instance due to error 481
System state dump requested by (instance=1, osid=47523 (LMON)), summary=[abnormal instance termination].
System State dumped to trace file /oracle/app/diag/rdbms/testnew/test1/trace/test1_diag_47507_20220413192802.trc
Wed Apr 13 19:28:03 2022
ORA-1092 : opitsk aborting process
Instance terminated by LMON, pid = 47523
-- node 2:
Wed Apr 13 19:28:00 2022
IPC Send timeout detected. Sender: ospid 53884 [oracle@test2 (LMD0)]
Receiver: inst 1 binc 429458022 ospid 47526
IPC Send timeout to 1.0 inc 4 for msg type 65521 from opid 11
Wed Apr 13 19:28:02 2022
Communications reconfiguration: instance_number 1
Wed Apr 13 19:28:02 2022
Dumping diagnostic data in directory=[cdmp_20220413192802], requested by (instance=1, osid=47523 (LMON)), summary=[abnormal instance termination].
Reconfiguration started (old inc 4, new inc 8)
#############################
3. Node to check 2 start-up ,Reconfiguration In the process ,IPC Send timeout Why -- This is also the node 2 The reason why it feels slow when starting manually ; At the same time node 1 stay 19:34 Start the times ORA-00240 error , We should comprehensively check the network and storage conditions at that time and the load of nodes , Reference resources MOS On file .
Wed Apr 13 19:34:49 2022
Errors in file /oracle/app/diag/rdbms/testnew/test1/trace/test1_dbw0_247773.trc (incident=168173):
ORA-00240: control file enqueue held for more than 120 seconds
Instance Evicted After LMON to LMON IPC Send timeout Due to Storage Issue (Doc ID 2080029.1)
"ipc send timeout" Precedes Database Instance Crash or Eviction (Doc ID 1951216.1)
While Evicting One of the Instance, the Remaining instances Terminated by LMON with "LMON is running too slowly and in the middle of reconfiguration" (Doc ID 1949505.1)
版权声明
本文为[Not dizzy yet]所创,转载请带上原文链接,感谢
https://yzsam.com/2022/04/202204230601579426.html
边栏推荐
- QT调用外部程序
- [point cloud series] full revolutionary geometric features
- Special window function rank, deny_ rank, row_ number
- Common commands of ADB shell
- Oracle view related
- How do ordinary college students get offers from big factories? Ao Bing teaches you one move to win!
- Window function row commonly used for fusion and de duplication_ number
- Machine learning -- naive Bayes
- Apache seatunnel 2.1.0 deployment and stepping on the pit
- Part 3: docker installing MySQL container (custom port)
猜你喜欢
On the bug of JS regular test method
【重心坐标插值、透视矫正插值】原理以及用法见解
kettle庖丁解牛第16篇之输入组件周边讲解
Window analysis function last_ VALUE,FIRST_ VALUE,lag,lead
Common types and basic usage of input plug-in of logstash data processing service
[barycentric coordinate interpolation, perspective correction interpolation] principle and usage opinions
ACFs file system creation, expansion, reduction and other configuration steps
SHA512 / 384 principle and C language implementation (with source code)
Dolphin scheduler scheduling spark task stepping record
Dolphin scheduler integrates Flink task pit records
随机推荐
QT calling external program
The difference between is and as in Oracle stored procedure
[point cloud series] unsupervised multi task feature learning on point clouds
Oracle generates millisecond timestamps
软考系统集成项目管理工程师全真模拟题(含答案、解析)
MySQL and PgSQL time related operations
[tensorflow] sharing mechanism
NPM err code 500 solution
Is Hongmeng system plagiarism? Or the future? Professional explanation that can be understood after listening in 3 minutes
聯想拯救者Y9000X 2020
Personal learning related
Resolution: argument 'radius' is required to be an integer
集简云 x 飞书深诺,助力企业运营部实现自动化办公
Cross carbon market and Web3 to achieve renewable transformation
Double pointer instrument panel reading (I)
At the same time, the problems of height collapse and outer margin overlap are solved
Antd design form verification
Campus takeout system - "nongzhibang" wechat native cloud development applet
Zero copy technology
Search ideas and cases of large amount of Oracle redo log