On a number of Samba 4 DCs core dumps have been observed that are related to a termination of the source4/winbind service. After the core dump the unix domain sockets below /var/run/samba/winbindd not opened any longer. Maybe related to this is "samba-tool drs showrepl" shows a number of WERR_INVALID_PARAM errors. Maybe also related: Several occurences of IRPC callback failed for DsReplicaSync - NT_STATUS_IO_TIMEOUT and dcerpc_fault WERR_EPT_S_CANT_PERFORM_OP in drsuapi_DsReplicaSync can be found in log.samba can be found a couple of times per day, not only before the core dump happens.
Created attachment 7361 [details] Ths was observed with a pre-alpha18 git snapshot. The full backtrace is attached.
Created attachment 7362 [details] relevant part of log.samba at loglevel 2
Does 7b1fb088421565f1752acde02377237e4ca19248 fixes at least the segfault?
Created attachment 7383 [details] Backtrace with commit 7b1fb088421565f1752acde02377237e4ca19248 applied (In reply to comment #3) > Does 7b1fb088421565f1752acde02377237e4ca19248 fixes at least the segfault? No, the segfault still occurs. Find a updated backtrace attached.
metze, shouldn't this have been fixed by your recent rpc library rework?
I hope so, ebcfa61d9f712db8400acd722dfc43c07021c9b0 and the s4:librp/rpc patches before should fix it.
Should have been fixed as well.
The patches seem to fix the segfaults but the error messages still occur (~every 10 seconds on 3 Samba 4 DCs): --- [2012/03/27 15:49:17, 0] ../source4/rpc_server/common/forward.c:51(dcesrv_irpc_forward_callback) IRPC callback failed for DsReplicaSync - NT_STATUS_IO_TIMEOUT [2012/03/27 15:49:27, 0] ../source4/rpc_server/common/forward.c:51(dcesrv_irpc_forward_callback) IRPC callback failed for DsReplicaSync - NT_STATUS_IO_TIMEOUT [2012/03/27 15:49:28, 0] ../source4/rpc_server/common/forward.c:51(dcesrv_irpc_forward_callback) IRPC callback failed for DsReplicaSync - NT_STATUS_IO_TIMEOUT [2012/03/27 15:49:37, 0] ../source4/rpc_server/common/forward.c:51(dcesrv_irpc_forward_callback) IRPC callback failed for DsReplicaSync - NT_STATUS_IO_TIMEOUT [2012/03/27 15:49:47, 0] ../source4/rpc_server/common/forward.c:51(dcesrv_irpc_forward_callback) IRPC callback failed for DsReplicaSync - NT_STATUS_IO_TIMEOUT [2012/03/27 15:49:48, 0] ../source4/rpc_server/common/forward.c:51(dcesrv_irpc_forward_callback) IRPC callback failed for DsReplicaSync - NT_STATUS_IO_TIMEOUT [2012/03/27 15:49:57, 0] ../source4/rpc_server/common/forward.c:51(dcesrv_irpc_forward_callback) IRPC callback failed for DsReplicaSync - NT_STATUS_IO_TIMEOUT ---
Comment 8 has been split of as Bug 8851.