During the CTDB cluster is recovering, if the shared directory used by recovery file is also unavailable and then recovers, cluster will enter a loop recovery state. Reproduce steps: 0. the CTDB cluster consists of three or more nodes 1. kill one ctdbd process 2. umount the shared directory of the recovery master node 3. immediately remount
Samba 4.12.7 is no longer supported. Please only report bugs against supported versions. However... The behaviour of the cluster lock has change a lot since 4.12. Please see the Cluster Lock section in https://ctdb.samba.org/manpages/ctdb.7.html. I wonder if specifying the recheck interval (e.g. 5s) will help. Here is an example: [cluster] cluster lock = !/usr/libexec/ctdb/ctdb_mutex_fcntl_helper /clusterfs/.cluster_lock 5 Please try something like this with a supported version. Thanks...