There are 2 issues: 1. If recovery helper fails to freeze a database, then it does not ban the node that fails to freeze the database. 2. If node stays in recovery for too long, then it should drop all the IP addresses. As a result of the 1st problem, CTDB never comes out of recovery and none of the clients can actually get service since all the databases keep getting frozen. As a result of the 2nd problem, CTDB does not end up droping all the IPs even though it stays in recovery for far too long. This is very useful to tell the clients that something has gone wrong.
Created attachment 13311 [details] Patches for v4-5
Created attachment 13312 [details] Patches for v4-6
Hi Karolin, This is ready for 4.5 and 4.6. Thanks...
(In reply to Martin Schwenke from comment #3) Pushed to autobuild-v4-{6,5}-test.
(In reply to Karolin Seeger from comment #4) Pushed to both branches. Closing out bug report. Thanks!