How to speed up rsync?
Matthew Harrington
I'm running rsync to sync a directory onto my external USB HDD. It's about 150 gigs of data. 50000+ files I would guess.
It's running it's first sync at the moment, but its copying files at a rate of only 1-5 MB/s. That seems incredibly slow for a USB 2.0 enclosure. There are no other transfers happening on the drive either.
Here are the options I used:
rsync -avz --progress /mysourcefolder /mytargetfolderI'm running Ubuntu Server 9.10.
310 Answers
For the first sync just use
cp -a /mysourcefolder /mytargetfolderrsync only adds overhead when the destination is empty.
also.. the -z option is probably killing your performance, you shouldn't be using it if you are not transfering data over a slow link.
5If you're using rsync with a fast network or disk to disk in the same machine,
not using compression -z
and using --inplace
speeds it up to the performance of the harddrives or network
compression uses lots of CPU
not using inplace makes the harddrive thrash alot (it uses a temp file before creating the final)
compression & not using inplace is better for doing it over the internet (slow network)
NEW: Be aware of the destination... if there is NTFS "compression" enabled... this severely slows down large files (I'd say 200MB+) rsync almost seems stalled, it's caused by this.
4Use the -W option. This disables delta/diff comparisons. When the file time/sizes differ, rsync copies the whole file.
Also remove the -z option. This is only useful for compressing network traffic.
Now rsync should be as fast as cp.
First - the number of files in this case is going to be a major factor. It's an average size of 3MB each. There's probably an io bottleneck influencing the speed in the OP's case. More here - that's a pretty dry read, but the cover picture is worth it.
So, using rsync to copy to an empty directory? Here are some ways to speed it up:
- No -z - definitely don't use -z as in the OP.
- --no-compress might speed you up. This could have the biggest impact... my test was 13,000 files, total size 200MB, and using rsync 3.1.3. I synced to a different partition on the same internal SSD drive. With --no-compress, I get 18 MBps, and without it I get 15 MBps. cp, by the way, gets 16 MBps. That's a much smaller average file size though. Also - I can't find any documentation for --no-compress. I learned about it from this post on stackexchange.com.
- -W to copy files whole - always use this if you don't want it to compare differences; never mind that the point of rsync is to compare differences and only update the changes.
- -S to handle sparse files well - can't hurt if you don't have sparse files.
- --exclude-from or something similar to exclude files you might not need will cut down the time, but it won't increase your transfer speed.
- It's possible if you send the output to a file like this
rsync -a /source /destination >/somewhere/rsync.out 2>/somewhere/rsync.err- the first > basically prints a file with all the stuff you would normally see, and the 2> refers to error messages. - Finally, running multiple instances of rsync for different parts of your transfer could be a big help.
My command would be:
rsync -avAXEWSlHh /source /destination --no-compress --info=progress2 --dry-run
If all looked well, I'd delete "--dry-run" and let it go. A, X, and E cover extended attributes and permissions not covered by -a, l is for soft links, H is for hard links, and h is for human readable.
Updating an already synced directory on a USB drive, or the same drive, or over a network, will all require different rsync commands to maximize transfer speed.
Bonus - here's the rsync man page, and if you want to test your hard drive speed, bonnie++ is a good option, and for your network speed, try iperf.
*The post is almost ten years old, but search engines sure like it, and I keep seeing it. It's a good question, and I don't think the top answer to "how to speed up rsync" should be "use cp instead."
5You definitely want to give rclone a try. This thing is crazy fast :
$ tree /usr [...] 26105 directories, 293208 files
$ sudo rclone sync /usr /home/fred/temp -P -L --transfers 64
Transferred: 17.929G / 17.929 GBytes, 100%, 165.692 MBytes/s, ETA 0s Errors: 75 (retrying may help) Checks: 691078 / 691078, 100% Transferred: 345539 / 345539, 100% Elapsed time: 1m50.8s
This is a local copy from and to a LITEONIT LCS-256 (256GB) SSD.
You can add --ignore-checksum on the first run to make it even more faster.
1Avoid
-z/--compress: compression will only load up the CPU as the transfer isn't over a network but over RAM.--append-verify: resume an interrupted transfer. This sounds like a good idea, but it has the dangerous failure case: any destination file the same size (or greater) than the source will be IGNORED. Also, it checksums the whole file at the end, meaning no significant speed up over--no-whole-filewhile adding a dangerous failure case.
Use
-S/--sparse: turn sequences of nulls into sparse blocks--partialor-Pwhich is--partial --progress: save any partially transferred files for future resuming. Note: files won't have a temporary name, so ensure that nothing else is expecting to use the destination until the whole copy has completed.--no-whole-fileso that anything that needs to be resent uses delta transfer. Reading half of a partially transferred file is often much quicker than writing it again.--inplaceto avoid file copy (but only if nothing is reading the destination until the whole transfer completes)
You don't say what size distribution your files have. If there are many small files then this will reduce overall transfer rate by increasing head movement latency in both the source and destination drives as the tool opens new files and the OS keeps directory entries and other metadata (such as the filesystem's journal if you are using meta-data journaling like ext3/ext4 and NTFS do by default) up to date during the transfer. A file copy proces will only "get into its stride" for larger objects, when a simple bulk transfer is happening.
i found this as my answer
rsync -aHAXxv --numeric-ids --progress -e 'ssh -T -c -o Compression=no -x ' <source_dir> user@<host>:<dest_dir>Some explanation regarding switches below
rsync (Everyone seems to like -z, but it is much slower for me)
a: archive mode - rescursive, preserves owner, preserves permissions, preserves modification times, preserves group, copies symlinks as symlinks, preserves device files.
H: preserves hard-links
A: preserves ACLs
X: preserves extended attributes
x: don't cross file-system boundaries
v: increase verbosity
--numeric-ds: don't map uid/gid values by user/group name
--progress: show progress during transferssh
T: turn off pseudo-tty to decrease cpu load on destination.
c : use the weakest but fastest SSH encryption.
o Compression=no: Turn off SSH compression.
x: turn off X forwarding if it is on by default. 2 I don't have any reason to suspect rsync is the culprit for this slow speed.
I would suspect that the drive itself or its filesystem is the issue.
In fact, a few factors hint at a possible cause. You mention it's a USB drive, and you're transferring 150GB at once. I would hazard a guess that your USB drive uses Shingled Magnetic Recording (SMR).
SMR drives overlap the tracks in such a way that reading is done as normal, but every write to the drive, since it overwrites the neighbouring tracks as well, requires the drive to re-write a sizeable chunk of the drive, and the writing is slow. The miracle is that these drives perform as normal most of the time, because the drive remaps writes to a temporary holding area that doesn't use SMR, and then re-writes them to the drive later in the background. But this holding area is only a few GB in size, so for any continuous transfers longer than, say, 10GB, the write speed drops dramatically down to around 5MB/s. If you stop writing to the drive and let it idle for about 10 minutes, its performance will return to normal because it has had time to clear its temporary holding area, but will fall again if you do another multi-GB sized continuous write.
Despite some comments otherwise, rsync is nice and fast when operating locally, altering its setup accordingly. When operating locally, rsync uses no delta transfer. And, since compressing the transfer would make no sense when it's essentially just over a local pipe, I doubt the -z has any effect, although the documentation doesn't say. Even if it does, it shouldn't be clamping the speed to <5MB/s, as its compression would be capable of many times that speed.
And, --inplace only affects rsync's behaviour when the file exists at the destination and only part of the file needs to be updated. Since delta transfer is disabled when rsync is operating locally, I believe it should have no effect.
Avoid using the -v, --verbose option, except when debugging. Especially when redirecting the output to some file. In my case, running rsync with -v was 8x slower than letting it run silently. Consider using the --log-file parameter instead.