Bug 5922 - Winbind error prevents client connections
Summary: Winbind error prevents client connections
Status: RESOLVED WORKSFORME
Alias: None
Product: Samba 3.0
Classification: Unclassified
Component: winbind (show other bugs)
Version: 3.0.24
Hardware: x86 Linux
: P3 major
Target Milestone: none
Assignee: Volker Lendecke
QA Contact: Samba QA Contact
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2008-11-25 13:59 UTC by Michael Siemens
Modified: 2016-04-29 01:15 UTC (History)
0 users

See Also:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Michael Siemens 2008-11-25 13:59:29 UTC
We manage several samba server pools, each on it's own network and authenticating against different AD domains. These sites use winbind for AD integration, and have been experiencing the following intermittent error on some of the servers. (The problem does not happen on all servers, but when it affects one, it usually re-appears weeks/months later on the same server.) I have not been able to locate a bug report pertaining to this:

First, the log.winbindd snippet from one of the affected sites:

[2008/11/21 06:55:57, 0] rpc_client/cli_pipe.c:rpc_api_pipe(790)
  rpc_api_pipe: Remote machine DOMAIN_CONTROLLER pipe \NETLOGON fnum 0x4002returned critical error. Error was Call timed out: server did not respond after 10000 milliseconds
[2008/11/21 06:55:57, 0] libsmb/clientgen.c:cli_rpc_pipe_close(375)
  cli_rpc_pipe_close: cli_close failed on pipe \samr, fnum 0x4003 to machine DOMAIN_CONTROLLER.  Error was Call timed out: server did not respond after 10000 milliseconds
[2008/11/21 06:55:57, 0] libsmb/clientgen.c:cli_rpc_pipe_close(375)
  cli_rpc_pipe_close: cli_close failed on pipe \lsarpc, fnum 0x4000 to machine DOMAIN_CONTROLLER.  Error was Call timed out: server did not respond after 10000 milliseconds
[2008/11/21 06:55:57, 0] libsmb/clientgen.c:cli_rpc_pipe_close(375)
  cli_rpc_pipe_close: cli_close failed on pipe \NETLOGON, fnum 0x4002 to machine DOMAIN_CONTROLLER.  Error was Call timed out: server did not respond after 10000 milliseconds
[2008/11/21 06:59:05, 0] rpc_client/cli_pipe.c:rpc_api_pipe(790)
  rpc_api_pipe: Remote machine DOMAIN_CONTROLLER pipe \lsarpc fnum 0x8008returned critical error. Error was Call timed out: server did not respond after 10000 milliseconds
[2008/11/21 06:59:05, 1] nsswitch/winbindd_group.c:winbindd_getgrnam(255)
  group SOME_GROUP in domain SOME_DOMAIN does not exist
[2008/11/21 06:59:05, 1] nsswitch/winbindd_group.c:winbindd_getgrnam(255)
  group SOME_GROUP in domain SOME_DOMAIN does not exist
[2008/11/21 06:59:05, 0] libsmb/clientgen.c:cli_rpc_pipe_close(375)
  cli_rpc_pipe_close: cli_close failed on pipe \lsarpc, fnum 0x8008 to machine DOMAIN_CONTROLLER.  Error was Call timed out: server did not respond after 10000 milliseconds
[2008/11/21 06:59:36, 1] nsswitch/winbindd_group.c:winbindd_getgrnam(255)
  group SOME_GROUP in domain SOME_DOMAIN does not exist
<... this last log entry is repeated several times...>


When the problem presents itself, we consistently find these log entries 2-5 days prior to the client access problems. Most commonly, the problem presents itself to the client as an inability to resolve the hostname of the samba server. (There may have been an incident where the AD username/group was not resolvable anymore, but this detail may have been due to faulty investigation.)

There is one further curiosity: on some of the affected servers, the SOME_GROUP is actually the name of the local samba access group (i.e. a local linux group,) and does not exist in the AD domain. On most, it is the AD domain group which is storing to the samba server.

A restart of winbind always clears the problem up.

We are running SLES 10, with the included samba build of Version 3.0.24-2.23-1296-SUSE-CODE10
Comment 1 Jeremy Allison 2008-11-25 14:03:03 UTC
3.0.24 is a very old release and there have been very many winbindd fixes between then and now. Please try and reproduce with the latest release.
Thanks,
Jeremy.
Comment 2 Rajeesh Kumar M. P. 2009-06-02 03:59:14 UTC
Infact i'm also facing the same problem with stable version samba-3.0.28-1.el5_2.1.


Let me know this is fixed
Comment 3 Kai Blin 2009-06-04 05:01:14 UTC
The current stable release is 3.3.4.
The last 3.0 release is 3.0.34.
Comment 4 Björn Jacke 2014-07-24 20:32:39 UTC
is this issue still popping up for you?
Comment 5 Björn Jacke 2016-04-29 01:15:54 UTC
no feedback. probably no issue anymore.