Bug 10123 - libsmbclient referral issue with Windows DFS
Summary: libsmbclient referral issue with Windows DFS
Status: RESOLVED FIXED
Alias: None
Product: Samba 3.6
Classification: Unclassified
Component: libsmbclient (show other bugs)
Version: 3.6.3
Hardware: All All
: P5 major
Target Milestone: ---
Assignee: Karolin Seeger
QA Contact: Samba QA Contact
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2013-09-02 09:32 UTC by Mahabaleshwar
Modified: 2015-02-05 20:45 UTC (History)
3 users (show)

See Also:


Attachments
Patch proposal for fixing the issue related to DFS link (4.04 KB, patch)
2014-03-18 05:47 UTC, hargagan
no flags Details
new patch addressing points in comment#3 (6.91 KB, patch)
2015-01-14 17:36 UTC, David Disseldorp
no flags Details
simple test, based on examples/libsmbclient/teststat3.c (1.25 KB, text/plain)
2015-01-16 13:28 UTC, David Disseldorp
no flags Details
fix for 4-2-test branch, cherry-picked from master (7.02 KB, patch)
2015-01-19 15:44 UTC, David Disseldorp
jra: review+
Details
fix for 4-1-test branch, cherry-picked from master (7.02 KB, patch)
2015-01-19 15:45 UTC, David Disseldorp
jra: review+
Details
fix for 4-0-test branch, slightly modified from master - no smbXcli_tcon context (6.96 KB, patch)
2015-01-19 15:59 UTC, David Disseldorp
jra: review+
Details
fix for 3-6-stable branch, modified from master - just in case it's useful to others maintaining the 3.6 series (6.48 KB, patch)
2015-01-19 16:27 UTC, David Disseldorp
no flags Details

Note You need to log in before you can comment on or make changes to this bug.
Description Mahabaleshwar 2013-09-02 09:32:25 UTC
We are using libsmbclient to talk to Windows share, It works perfectly fine with regular DFS environment. But in case of DFS with replication environment, we are getting some issues with the way referrals are managed, 


Here is what we do 
>> 
file = smbc_getFunctionOpen(c)(c, smb_path, O_RDWR|O_TRUNC, 0666); 
.. 
.. 
if((smbc_getFunctionFstat(c)(c, file, pSt)) < 0) 
.. 
.. 
while(//chunk by chunk)  
writtenLen = smbc_getFunctionWrite(c)(c, file, writebuf, len); 
>> 


Our understanding is, trans2_reqeust Get_Dfs_referral is made for each of the
smbclient api's. AD returns referrals in different order for subsequent calls
and the request ends up on to a different member server. And so in the above
case smbc_write fails with EBADF (errno 9) 


cli_resolve_path always calls cli_dfs_get_referral and is taking the first
referral from the responded referral list.. 


Is there any control over the referrals? We basically want every call to stick
to one server always (until the server is active)..
Comment 1 hargagan 2014-03-18 05:47:26 UTC
Created attachment 9785 [details]
Patch proposal for fixing the issue related to DFS link

The issue mentioned in the description is found to be happening due to libsmbclient resolving to referrals on every request. So if SMBC_Open() goes to one referral server and does connection, the SMBC_Read() will try to get the referral list again. It will cause confusion as the context obtained (like tid) in the previous connection will not go well with the new server connection. 

To fix this, now looking for the connection and re-using it. This change also include checking for the server alive status before using that connection.
Comment 2 Björn Jacke 2014-11-04 08:26:45 UTC
metze: do you have some time to have a look at the patch here? This looks a bit tricky...
Comment 3 David Disseldorp 2015-01-14 16:05:09 UTC
(In reply to hargagan from comment #1)

In general I like the patch, but there are a few things I'd like to change before pushing:
- fix build failure from the d_printf("Unable to follow dfs referral" statement.
- free allocated dfs_refs memory in exit paths.
- Remove the cached connection check (cli_echo).
  + IMO it shouldn't be done at this layer.
- move cm_find / cm_connect logic closer together.
- fix minor white-space / formatting issues.
Comment 4 David Disseldorp 2015-01-14 17:36:59 UTC
Created attachment 10620 [details]
new patch addressing points in comment#3

I'll start work on a simple test for this.
Comment 5 David Disseldorp 2015-01-16 13:28:45 UTC
Created attachment 10624 [details]
simple test, based on examples/libsmbclient/teststat3.c

This test successfully reproduces the issue and demonstrates the effectiveness of the fix.
Comment 6 David Disseldorp 2015-01-19 15:43:25 UTC
Comment on attachment 10620 [details]
new patch addressing points in comment#3

Fix is now upstream - marking obsolete. Maintenance back-ports to follow.
Comment 7 David Disseldorp 2015-01-19 15:44:56 UTC
Created attachment 10632 [details]
fix for 4-2-test branch, cherry-picked from master
Comment 8 David Disseldorp 2015-01-19 15:45:30 UTC
Created attachment 10633 [details]
fix for 4-1-test branch, cherry-picked from master
Comment 9 David Disseldorp 2015-01-19 15:59:38 UTC
Created attachment 10634 [details]
fix for 4-0-test branch, slightly modified from master - no smbXcli_tcon context
Comment 10 David Disseldorp 2015-01-19 16:27:00 UTC
Created attachment 10635 [details]
fix for 3-6-stable branch, modified from master - just in case it's useful to others maintaining the 3.6 series
Comment 11 Jeremy Allison 2015-01-20 16:59:24 UTC
Comment on attachment 10632 [details]
fix for 4-2-test branch, cherry-picked from master

LGTM.
Comment 12 Jeremy Allison 2015-01-20 17:11:38 UTC
Comment on attachment 10633 [details]
fix for 4-1-test branch, cherry-picked from master

LGTM.
Comment 13 Jeremy Allison 2015-01-20 17:16:54 UTC
Comment on attachment 10634 [details]
fix for 4-0-test branch, slightly modified from master - no smbXcli_tcon context

LGTM.
Comment 14 Jeremy Allison 2015-01-20 17:17:26 UTC
Re-assigning to Karolin to include in 4.0.next, 4.1.next, 4.2.0.
Comment 15 Karolin Seeger 2015-01-24 20:49:46 UTC
Pushed to autobuild-v4-[0|1|2]-test.
Comment 16 Karolin Seeger 2015-02-05 20:45:13 UTC
Pushed to all branches.
Closing out bug report.

Thanks!