|
|||||||||||
|
1 node in cluster fails hourly: Ndb kernel is stuck in: Job Handling
From: James Graham <james(at)asperity.co.uk>
Date: Mon Oct 01 2007 - 04:51:48 EDT
Since yesterday evening, one of our data nodes has been crashing every
hour or so.
The error is below:
Trace (relevant part) NR: setLcpActiveStatusEnd - !m_participatingLQH NR: setLcpActiveStatusEnd - m_participatingLQH Ndb kernel is stuck in: Job Handling Ndb kernel is stuck in: Job Handling Ndb kernel is stuck in: Job Handling 2007-10-01 06:54:03 [ndbd] INFO -- Watchdog restarting system 2007-10-01 06:54:03 [ndbd] INFO -- Watchdog shutdown completed -exiting 2007-10-01 06:54:03 [ndbd] ALERT -- Node 3: Forced node shutdown completed, restarting. Initiated by signal 0. Caused by error 6050: 'WatchDog terminate, internal error or massive overload on the machine running this node(Internal error, programming error or missing error message, please re 2007-10-01 06:54:03 [ndbd] INFO -- Ndb has terminated (pid 4218) restarting 2007-10-01 06:54:03 [ndbd] INFO -- Angel pid: 2868 ndb pid: 5076 2007-10-01 06:54:03 [ndbd] INFO -- NDB Cluster -- DB node 3 2007-10-01 06:54:03 [ndbd] INFO -- Version 5.0.32 -- 2007-10-01 06:54:03 [ndbd] INFO -- Configuration fetched at 89.200.138.148 port 1186 2007-10-01 06:54:03 [ndbd] INFO -- Start initiated (version 5.0.32)
It is only one of the two nodes that does this, luckily the other is
fine.
-- MySQL Cluster Mailing List For list archives: http://lists.mysql.com/cluster To unsubscribe: http://lists.mysql.com/cluster?unsub=lists@pantek.comReceived on Mon Oct 1 05:07:19 2007 This archive was generated by hypermail 2.1.8 : Sun Oct 07 2007 - 10:15:17 EDT |
||||||||||
|
|||||||||||