Bug 15379 - Spotlight fails if search starts from path containing an Umlaut
Summary: Spotlight fails if search starts from path containing an Umlaut
Status: NEW
Alias: None
Product: Samba 4.1 and newer
Classification: Unclassified
Component: Other (show other bugs)
Version: 4.18.2
Hardware: x64 Linux
: P5 normal (vote)
Target Milestone: ---
Assignee: Samba QA Contact
QA Contact: Samba QA Contact
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2023-05-27 18:39 UTC by M Weber
Modified: 2024-08-19 14:43 UTC (History)
2 users (show)

See Also:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description M Weber 2023-05-27 18:39:41 UTC
Depending on the startingpoint of a Spotlight search in Macos Finder, the search fails.

### Env
smbd -V
Version 4.18.2

Fruits:
    fruit:veto_appledouble = yes
    fruit:aapl = yes
    fruit:nfs_aces = no
    fruit:metadata = stream
    fruit:resource = xattr
    fruit:copyfile = yes
    vfs objects = catia fruit streams_xattr recycle shadow_copy2

Samba Spotlight Backend: Elasticsearch
Macos Ventura 13.2
Filesystem: Ubuntu included ZFS


### Setup
/myshare/MYSUBDIR_Ä/files/doc1.pdf
/myshare/MYSUBDIR/files/doc2.pdf


### Success and Fail
Doing the following in Macos Finder: enter directory and search

if starting my Spotlight search from folder "/myshare", both "doc1.pdf and "doc2.pdf" CAN be found by name.

if starting my Spotlight search from folder "/myshare/MYSUBDIR", then "doc2.pdf" CAN be found by name.

if starting my Spotlight search from folder "/myshare/MYSUBDIR_Ä", then "doc1.pdf" CANNOT be found by name.


### My guess
I suspect that Spotlight sends a directory string that is somehow corrupted when reaching samba.
Other characters may suffer from this too.
Comment 1 M Weber 2023-05-29 12:46:54 UTC
I suspect combined diaeresis.
My folder "MYSUBDIR_Ä" contains that nasty double character.
This is however not in the elasticsearch db.

It seems samba produces these, as the folder name seen by Linux is a normal single character.

Is there a way to disable combined diaeresis in Samba, so the Mac gets a real Umlaut in the first place?

Thanks
M
Comment 2 M Weber 2023-05-29 12:48:31 UTC
Note:
Updated to Ventura 13.4.
Bug persists.
Comment 3 Perttu Aaltonen 2024-07-10 09:53:16 UTC
I ran into this as well but in my case the share name has the umlaut. Fast searches don't work at all and Finder starts a full directory travelsal.

Removing the umlaut from the share name instantly fixes the issue, even without reindexing the directory structure. But obviously Samba should support special characters since native macOS SMB & Spotlight do as well.

Hopefully we get a fix at some point.