Bug 2696 - Filenames with non-english characters are displayed as '?'
Summary: Filenames with non-english characters are displayed as '?'
Status: CLOSED FIXED
Alias: None
Product: rsync
Classification: Unclassified
Component: core (show other bugs)
Version: 2.6.4
Hardware: All Linux
: P3 normal (vote)
Target Milestone: ---
Assignee: Wayne Davison
QA Contact: Rsync QA Contact
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2005-05-10 02:24 UTC by Peter Englmaier
Modified: 2005-06-03 01:39 UTC (History)
0 users

See Also:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Peter Englmaier 2005-05-10 02:24:07 UTC
Dear Rsync Developers:

We use rsync to make backups. Files with special characters like german umlaute
are displayed with '?' for the special characters. We would like to have
an option for displaying the original characters because the list is
processed by other programs. I.e.: --log-characterset=full

Even better would be options for saving file names by category, i.e.:

--log-modified=file    all modified files (no dirs)
--log-deleted-files=file
--log-deleted-dirs=file

This would greatly simplify backup programs build on rsync

Best,
Peter.
Comment 1 Wayne Davison 2005-05-10 09:42:33 UTC
Please try out one of the nightly tar files from the rsync web site (or the
upcoming 2.6.5 pre-release).  It obscures fewer characters (since rsync now uses
the user's locale information), and those that it tweaks are escaped using
backslash escapes that allow the filename to be reconstructed by a script, e.g.:

    $filename =~ s/\\(\d\d\d|\\)/ $1 eq "\\" ? $1 : chr(oct($1)) /eg;

See also the --log-format option for a better way to get the list of filenames
that rsync has operated on (rather than depending on the --verbose output) and
the -i option (an easy way to specify --log-format="%i %n%L") for more
information, and the "%o" escape as an alternative to "%i" (if you don't want
tweaked, but not-transferred files to be logged).