Using rsync to copy files

Call Us 0800 107 7979

Service Status

Back to knowledgebase

Last updated: 29 April 2021

The rsync utility is used to copy files. It can copy files locally as well as from one host to another. You can also use rsync to make backups.

Common options

rsync is almost always run with one or more options. The most common options are -a and -v. We will briefly look at these options before we get to some examples.

Archiving

The -a option is a combination of a bunch of other options:

-r  Recurse into directories  
-l  Copy symlinks as symlinks
-p  Preserve file permissions
-t  Preserve modification times
-g  Preserve group
-o  Preserve owner
-D  Preserve device and special files

This is a convenient shortcut but you don’t necessary want to use all these options. For instance, if you are syncing files to a remote system you may not want to preserve the group and owner data, as the same user and group might not exist on the remote system.

Verbose output

Another commonly used option is -v (--verbose). As the name suggest, this option makes rsync produce lots of output – it prints all the files that are being synced and a summary of the amount of data transferred. You typically want to use this option, as it tells you what rsync is doing.

Copying files

To demonstrate how to use rsync we will copy an entire WordPress site to a different directory. This is something you may want to do after you have installed WordPress via Softaculous. By default Softaculous installs WordPress in a subdirectory with the name wp. Often that is useful, as the install will be separate from any existing files in the public_html directory. However, it does mean that you have to move the files if you prefer to have the install in the public_html directory.

You can sync all the files from the public_html/wp directory to the public_html directory in seconds:

$ rsync -av /home/example/public_html/wp/ /home/example/public_html/
sending incremental file list
./
.htaccess
index.php
...
sent 44,901,625 bytes  received 36,723 bytes  29,958,898.67 bytes/sec
total size is 44,762,896  speedup is 1.00

As we added the -v option the output tells us which files were copied and how much data was transferred. Copying about 45MB took about 1.5 seconds.

rsync vs mv

It is worth mentioning that the data has been copied rather than moved. In other words, the public_html/wp directory still contains all the original files. If you just want to move files it makes more sense to use the mv utility instead. There are two things to be aware of though. The first is that mv doesn’t move dotfiles by default:

$ mv /home/example/public_html/wp/ /home/example/public_html/

$ ls -1A /home/example/public_html/wp/
.htaccess

In the above example we moved all the files, but the .htaccess file is still sitting in the wp subdirectory. In Bash, you can set the dotglob option to get mv to also move hidden files:

$ shopt -s dotglob nullglob
$ mv /home/example/public_html/wp/* /home/example/public_html/
$ ls -lA wp/
$

A second thing to note is that mv doesn’t preserve the SELinux context of files. If SELinux is enforced on your server then it is easier to use rsync, which does preserve the context.

Beware of the trailing slash

It is very important to be aware of the function of the trailing slash in the source. To illustrate the point, let’s create two directories (source and destination) and create a file named testfile in the source directory:

$ mkdir source destination
$ touch source/testfile

If you want to copy all files in the source directory to the destination directory then the source should include the trailing slash, like so:

$ rsync -av source/ destination/

The trailing slash instructs rsync to copy all files (and directories) in the source directory. Can you guess what happens if you omit the trailing slash?

$ rsync -a source destination
$ tree destination/
destination/
└── source
    └── testfile

1 directory, 1 file

Instead of copying the test file it copied the entire source directory to the destination directory. It is easy to see how you can very quickly do a lot of damage with rsync if you get the trailing slash wrong.

Dry run

If you are not quite sure about an rsync command then you can use the -n (--dry-run) option. rsync will go through the motions but not actually copy any files. You can then check the output to see if your command works as it should:

$ rsync -avn source destination
sending incremental file list
source/
source/testfile

sent 108 bytes  received 23 bytes  262.00 bytes/sec
total size is 0  speedup is 0.00 (DRY RUN)

The main thing to note in the above output is that rsync is copying the source/ directory, which is not what we wanted.

Copying files to a remote server

To copy files to a remote server you use pretty much the same syntax. In the below example we are syncing files from the /srv/www/public_html directory on our local computer to the example.net server:

$ rsync -avz /srv/www/public_html/ example@example.net:/home/example/transfers/
sending incremental file list
./
.htaccess
index.php
license.txt
...
sent 12,344,200 bytes  received 36,504 bytes  575,846.70 bytes/sec
total size is 44,750,608  speedup is 3.61

We added one new option: -z (--compress) compresses file data during the transfer. This is particularly useful when you sync files to and from a remote server, as this will always be slower than syncing files locally.

To connect to the remote server you use the username and hostname, just like you do when you connect to a remote server via SSH. You can specify the directory to which you want to sync the files by adding a colon (:) followed by the path to the destination directory. In this case the files were synced to the /home/example/transfers directory on the remote server.

Copying files from a remote server

The opposite, syncing files from a remote server to your local computer, is of course also possible. You can simply reverse the source and destination, as shown in the below example:

$ rsync -avz example@example.net:/home/example/transfers/ /srv/www/public_html/

Exclude directories

The --exclude option lets you exclude a directory. This can be useful if you are developing websites locally and have, say, a directory for backups in your document root. Using --exclude you can copy all the files, except for the backups directory:

$ rsync -avn --exclude 'backup' /srv/www/public_html/ example@example.net:/var/www/html/ | grep -c ^backup
0

Notice that we used the verbose option (-v) and did a dry run (-n). The output lists all the files that would normally be copied. As that list is very long we piped the output to grep -c ^backup, which gives us a count of the number of lines starting with the string backup. There were zero lines, which means that our command does indeed exclude the backup directory.

You can exclude files in exactly the same way. And if you need to exclude more than one file or directory you can simply use multiple --exclude options.

Using wild cards

As a bonus tip, you can also use wild cards (“globs”). For instance, you may have two directories that are not part of the website itself and which should never be copied. Let’s say you got a __backups directory for backups and a __resources directory for assets. As both directories start with two underscores you can exclude both at the same time:

$ rsync -avn --exclude '__*' /srv/www/public_html/ example@example.net:/var/www/html/ | grep -c ^__
0

The asterisk in --exclude '__*' matches both __backups and __resources. In the same way you can exclude specific file types, such as .tar.gz or .zip files:

$ rsync -avn /srv/www/public_html/ example@example.net:/var/www/html/ \
| grep -c ".*tar.gz$"
21

$ rsync -avn --exclude '*.tar.gz' /srv/www/public_html/ example@example.net:/var/www/html/ \
| grep -c ".*tar.gz$"
0

Deleting files

The --delete option deletes files from the destination that are not in the source. In other words, the file foo.txt is deleted if it exists in the destination but not in the source.

This option is typically used for non-incremental backups. For instance, here we are copying the directory /src/www/ to an external backup drive:

$ rsync -av --delete /srv/www/ /run/media/example/ext_drive/backup/

After the sync the two directories are identical. All files are copied from /src/www/ to the backup drive and any files that exist on the backup drive but not in /srv/www/ directory are deleted.

As an aside, strictly speaking it is not true that all the files in the /src/www/ directory are copied. Any files that already exist on the backup drive are skipped. If you want you can force rsync to overwrite files that already exist at the destination using the -I (--ignore-times) option.

More information

By catalyst2 Team

As companies scale their operations and seek to maintain a seamless online presence, many consider upgrading to a dedicated server. This option offers greater control, enhanced security, and improved performance; all crucial elements for growing businesses. Given these advantages, it’s no surprise that dedicated server hosting has become a popular choice. Deciding if it’s worth …

Read Article

What our clients say

Great real person support – direct phone number, usually the same individual so any problems are handled by the same people. Excellent.

Daniel Chandler, Vindico UK Ltd

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these cookies, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may have an effect on your browsing experience.

Necessary

Always Enabled

Functional

Functional cookies help to perform certain functionalities like sharing the content of the website on social media platforms, collect feedbacks, and other third-party features.

Cookie	Duration	Description
bcookie	2 years	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser ID.
bscookie	2 years	LinkedIn sets this cookie to store performed actions on the website.
lang	session	LinkedIn sets this cookie to remember a user's language setting.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
UserMatchHistory	1 month	LinkedIn sets this cookie for LinkedIn Ads ID syncing.

Performance

Analytics

Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics the number of visitors, bounce rate, traffic source, etc.

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gat_gtag_UA_5562310_11	1 minute	Set by Google to distinguish users.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.

Advertisement cookies are used to provide visitors with relevant ads and marketing campaigns. These cookies track visitors across websites and collect information to provide customized ads.

Cookie	Duration	Description
_fbp	3 months	This cookie is set by Facebook to display advertisements when either on Facebook or on a digital platform powered by Facebook advertising, after visiting the website.
fr	3 months	Facebook sets this cookie to show relevant advertisements to users by tracking user behaviour across the web, on sites that have Facebook pixel or Facebook social plugin.
IDE	1 year 24 days	Google DoubleClick IDE cookies are used to store information about how the user uses the website to present them with relevant ads and according to the user profile.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.

Others

Other uncategorized cookies are those that are being analyzed and have not been classified into a category as yet.

Cookie	Duration	Description
_ashkii	session	No description available.
_wicasa	3 months	No description available.
AnalyticsSyncHistory	1 month	No description
cookid	3 months	No description available.
cookietest	session	No description
crisp-client/domain-detect/1644827320973	session	No description
crisp-client/domain-detect/1644827348275	session	No description
crisp-client/domain-detect/1644827428415	session	No description
crisp-client/domain-detect/1644827479357	session	No description
crisp-client/domain-detect/1644827596454	session	No description
crisp-client/domain-detect/1644827724838	session	No description
crisp-client/domain-detect/1644827824383	session	No description
crisp-client/domain-detect/1644827878659	session	No description
crisp-client/domain-detect/1644828716243	session	No description
crisp-client/domain-detect/1644828846246	session	No description
crisp-client/domain-detect/1644829369013	session	No description
crisp-clientsession30cc6953-ebcf-4bc6-b649-c44eb446409e	6 months	No description
dbmFP	3 months	No description available.
dbmPK	3 months	No description available.
li_gc	2 years	No description