15261 – without "oplocks=no" smbd processes skyrocket in memory usage and get killed by oomkiller

Bug 15261 - without "oplocks=no" smbd processes skyrocket in memory usage and get killed by oomkiller

Summary: without "oplocks=no" smbd processes skyrocket in memory usage and get killed ...

Status:	ASSIGNED

Alias:	None

Product:	Samba 4.1 and newer
Classification:	Unclassified
Component:	File services (show other bugs)
Version:	4.20.0
Hardware:	All All

Importance:	P5 normal (vote)
Target Milestone:	---
Assignee:	Ralph Böhme
QA Contact:	Samba QA Contact

URL:
Keywords:

Depends on:
Blocks:

Reported:	2022-12-08 15:49 UTC by roland
Modified:	2024-04-24 12:57 UTC (History)
CC List:	3 users (show)

See Also:

Attachments
WIP patch for 4.13 (1.80 KB, patch) 2022-12-08 17:36 UTC, Ralph Böhme	no flags	Details
View All Add an attachment (proposed patch, testcase, etc.)

Note You need to log in before you can comment on or make changes to this bug.

Description roland 2022-12-08 15:49:34 UTC

i'm testing samba as a backup target for proxmox 7.3 vzdump backup via 10gigE

without "oplocks = no" (i.e. default settings) smbd processes go nuts , grow to >>5gb in size and then getting all killed by oomkiller.

i'm using debian11 default smb.conf with only basic share added like

[sharename]
        valid users=username
        writeable=yes
        path=/path/to/zfs/dataset

i found the problem goes away when setting "oplocks = no" (via https://www.taste-of-it.de/samba-smb-process-crashes-with-memory-leak/ )

more details is being reported here:  https://forum.proxmox.com/threads/smbd-memory-leak.119199/


i cannot believe this is correct behaviour.  

unix processes should not receiving data from the network without limitation until they either burst or getting killed externally, no matter how fast the network or the storage can deliver troughput

Comment 1 Ralph Böhme 2022-12-08 16:26:14 UTC

This smells like a known issue that basically our async SMB read/write processing is subtly broken and results in letting the client run loose and out of control of the SMB creditting mechanism that is supposed to throttle clients.

We should likely NOT returns async interim responses to read and write SMB requests, but that's what we do currently. This needs some decent research and validation and unfortunately so far noone has put the required resources into this.

Would you be able to test a simple patch on top of the sources of your Samba version?

Comment 2 roland 2022-12-08 16:47:21 UTC

>Would you be able to test a simple patch on top of the sources of your Samba version?

yes, i can give it a try and like to help resolving this

Comment 3 Ralph Böhme 2022-12-08 17:36:14 UTC

Created attachment 17683 [details]
WIP patch for 4.13

WIP patch that should do the trick. Needs more research to compare against Windows behaviour.

There's likely already another bugreport that discusses this.

Comment 4 roland 2022-12-08 22:27:33 UTC

i tested the patch and i see no real difference. 

i could only test with 1 remote writer, but i see the VSZ climb >5GB and RSS >1,5GB

Comment 5 roland 2022-12-08 23:26:19 UTC

>There's likely already another bugreport that discusses this.

i guess you meant this?

https://lists.samba.org/archive/samba/2021-September/237262.html


it seems there is no entry in bugzilla for this

Comment 6 Ralph Böhme 2022-12-09 09:50:08 UTC

(In reply to roland from comment #4)
Oh, so now what? :)

I'd say as 4.13 is EOL the next sensible step would be updating to 4.17 to check whether the issue is still present in the latest release.

Comment 7 roland 2022-12-09 10:08:35 UTC

ok. will test and report

Comment 8 roland 2022-12-09 17:09:40 UTC

problem also happens with samba 4.17.3-Debian

Comment 9 Ralph Böhme 2022-12-09 17:31:08 UTC

(In reply to roland from comment #8)
That's unfortunate. You can follow the instructions in
https://lists.samba.org/archive/samba/2021-September/237295.html
to check the `smbcontrol PID pool-usage > pool-usage.txt` output if the memory consumption is really caused by the IO buffers.

If not, someone has to take a closer look at the pool-usage output.

You can also try the big hammer of disabling async IO as described at the end of the mail linked above. Maybe it's something else altogether.

Comment 10 roland 2022-12-09 18:04:41 UTC

i have pasted pool-usage of some >1gb smbd process at https://paste.debian.net/1263460/

i see no pthreadpool_tevent_job_state entries

>You can also try the big hammer of disabling async IO as 
>described at the end of the mail linked above. Maybe it's 
>something else altogether.

mind that oplocks=no seems to resolve the problem. 

i don't like disabling async io, as there is zfs underneath and i have no ZIL with that pool for accelerating sync writes

Comment 11 roland 2022-12-09 18:14:10 UTC

mhh, i don't see anything in that, maybe i need to issue the command while the process is growing, i.e. while data is in flight, and not afterwards. will do later on

Comment 12 Ralph Böhme 2022-12-09 18:15:27 UTC

(In reply to roland from comment #10)
Nothing in the talloc report. Next would be running smbd under valgrind with memcheck.

Comment 13 roland 2022-12-10 13:21:58 UTC

so, while data is in flight things look different. 

i could not upload to pastbin, because file to large, so i uploaded to https://www.file-upload.net/download-15056180/pool-usage3.txt.html

there are 1949 occurences of the following structs:

aio_extra , aio_req_fsp_link , pthreadpool_tevent_job_state , pwrite_fsync_state , smbd_smb2_request , smbd_smb2_write_state, smb_request, smb_vfs_call_pwrite_state

Comment 14 roland 2022-12-13 20:45:39 UTC

shouldn't this problem be reproducable whith gigabit networking if the storage where the samba share resides is slow enough?

if so, i would like to try reproducing it that way

to be honest, i think this bug is quite serious, and i'm wondering why there are not more users affected.  samba is such popular tool and so widely used.

Comment 15 roland 2022-12-14 13:50:05 UTC

i could reproduce the problem with a virtual machine on proxmox, exporting a samba share, mounted on the host via bridge interface, i.e. without phyiscal nic in between.

virtual machine network has been throttled to 1gbit and virtual machines virtual disk has been throttled to a lower bandwith of <100mb/s

i could reproduce the problem with this settings, so this should not be a high-performance-network-only bug

Comment 16 Volker Lendecke 2022-12-14 17:26:42 UTC

The new pool report confirms pretty much what Ralph said in comment 1. We should not turn smb2_read and smb2_write requests async in the smb2 sense, even if we do process them asynchronously internally.

Could you test a patch if we sent it to you?

Comment 17 Volker Lendecke 2022-12-14 17:28:12 UTC

By the way, Ralph attached the patch in comment 3. This still applies to master and thus 4.17. Can you give that a try?

Comment 18 roland 2022-12-15 13:32:57 UTC

thanks. i gave it a try and i can still reproduce the problem

Comment 19 Volker Lendecke 2022-12-15 13:48:59 UTC

(In reply to roland from comment #18)
> thanks. i gave it a try and i can still reproduce the problem

With that patch in place, does setting "smb2 max credits = 512" make a difference? The default is 8192, which means 8192*64k worth of buffers. If you play with the credits and set them much lower, this should throttle the clients.

Comment 20 roland 2022-12-15 16:23:35 UTC

yes 512 seems to make a difference, it's much harder to push smbd to excess memory usage.

i was able to push it to 1gb rss and 5gb vsz with this, but no further. i'm  testing with a single linux cifs client with multiple dd writers, writing zeroes or garbage to the samba share

Comment 21 Forest 2024-04-12 23:35:19 UTC

I can reproduce this on Samba 4.17.12 and 4.19.5 (Debian Stable), by using cp to copy a ~50GB file to a cifs-mounted samba share on a low-power ARM server.

The target filesystem is on an encrypted volume of a relatively slow NAS disk, which I imagine leads to some backpressure on writes. Gigabit ethernet.

When smbd's memory usage pushes into swap, which is on the same disk, the transfer drags to a crawl. If it's allowed to continue for more than a few minutes, the client's cp process becomes effectively unkillable.

The problem is avoided with oplocks = no, as reported here:
https://www.omnespro.ch/post/samba-extremer-ram-fussabdruck-bei-grossen-dateien/