WARNING - This package modifies database tables. Use with extreme caution and back up databases before running.
DbSync is a tool for efficiently comparing and synchronising two or more remote MySQL database tables.
In order to do this without comparing every byte of data, the tool preforms a checksum (MD5, SHA1, CRC32) over a range of rows on both the source and destination tables, and compares only the hash. If a block is found to have an inconsistency in a block, the tool performs a recursive checksum on each half of the block (down to a minumum block transfer size) until it finds the inconsistency.
DbSync will only delete rows from the destination that no longer exist on the source when the --delete
option is specified. Use this option with extreme caution. Always perform a dry run first.
If you use DbSync to synchronise a table which has row deletions on the source without using the --delete
option, DbSync will find inconsistencies in any block with a deleted row on every run but will not be able to remove the rows from the target.
Via composer - run the following command in your project directory:
composer require mrjgreen/db-sync
Or use the packaged archive directly
wget https://github.com/mrjgreen/db-sync/raw/v3/db-sync.phar -O db-sync.phar
chmod a+x db-sync.phar
Optionally make the command available globally
sudo mv db-sync.phar /usr/bin/db-sync
Usage:
db-sync [options] [--] <source> <target> <table>
Sync a mysql database table from one host to another using an efficient checksum algorithm to find differences.
Arguments:
source The source host ip to use.
target The target host ip to use.
table The fully qualified database table to sync.
Options:
-b, --block-size=BLOCK-SIZE The maximum block to use for when comparing. [default: 1024]
--charset=CHARSET The charset to use for database connections. [default: "utf8"]
-c, --columns=COLUMNS Columns to sync - all columns not "ignored" will be included if not specified. Primary key columns will be included automatically. (multiple values allowed)
-C, --comparison=COMPARISON Columns from the list of synced columns to use to create the hash - all columns not "ignored" will be included if not specified. Primary key columns will be included automatically. (multiple values allowed)
-f, --config-file=CONFIG-FILE A path to a config.ini file from which to read values. [default: "dbsync.ini"]
--delete Remove rows from the target table that do not exist in the source.
-e, --execute Perform the data write on non-matching blocks.
-h, --help Show this usage information.
-i, --ignore-columns=IGNORE-COLUMNS Columns to ignore. Will not be copied or used to create the hash. (multiple values allowed)
-I, --ignore-comparison=IGNORE-COMPARISON Columns to ignore from the hash. Columns will still be copied. (multiple values allowed)
-p, --password[=PASSWORD] The password for the specified user. Will be solicited on the tty if not given.
-u, --user=USER The name of the user to connect with. [default: <CURRENT USER>]
-s, --transfer-size=TRANSFER-SIZE The maximum copy size to use for when comparing. [default: 8]
--target.user=TARGET.USER The name of the user to connect to the target host with if different to the source.
--target.table=TARGET.TABLE The name of the table on the target host if different to the source.
--target.password=TARGET.PASSWORD The password for the target host if the target user is specified. Will be solicited on the tty if not given.
--where=WHERE A where clause to apply to the tables.
Sync the table web.customers
from one host to another:
db-sync --user root --password mypass 127.0.0.1 111.222.3.44 web.customers
Sync the table web.customers
from one host to another, deleting rows from the target that no longer exist on the source:
db-sync --user root --password mypass --delete 127.0.0.1 111.222.3.44 web.customers
Sync the table web.customers
from one host to another using different credentials:
db-sync --user root --password mypass --target.user admin --target.password password 127.0.0.1 111.222.3.44 web.customers:
Sync only the email
and name
fields from the table web.customers
:
NB. The primary key will automatically be included in the column set
db-sync --user root --password mypass 127.0.0.1 111.222.3.44 web.customers -c email -c name
Sync every column except for the updated_at
fields from the table web.customers
:
db-sync --user root --password mypass 127.0.0.1 111.222.3.44 web.customers -i updated_at
Sync every column from the table web.customers
but only use the updated_at
fields when calculating the hash:
Inconsistencies in other fields will not be detected. In the event of a hash inconsistency in fields which are included, the excluded fields will still be copied to the target host.
db-sync --user root --password mypass 127.0.0.1 111.222.3.44 web.customers -C updated_at
Sync every column from the table web.customers
and use all fields except for the notes
or info
fields when calculating the hash:
Inconsistencies in excluded fields will not be detected. In the event of a hash inconsistency in fields which are included, the excluded fields will still be copied to the target host.
This is especially useful for tables with long text fields that don't change after initial insert, or which are associated with an
on update CURRENT_TIMESTAMP
field. For large tables this can offer a big performance boost.
db-sync --user root --password mypass 127.0.0.1 111.222.3.44 web.customers -I notes -I info
Sync the table web.customers
to a table under a different name in a different database web_backup.customers_2
:
db-sync --user root --password mypass --target.table web_backup.customers_2 127.0.0.1 111.222.3.44 web.customers
To avoid having to specify options repeatedly, and to avoid exposing your password on the tty you can specify a config file.
By default DbSync will look for a file named dbsync.ini
in the current working directory.
Example:
user=root
password=mypass
target.user=admin
target.pass=myadminpass
You can include the library within your project and use the components directly:
$sync = new DbSync(new Transfer(new Md5Hash(), $blockSize, $transferSize));
$sync->setLogger(new YourPsrLogger());
$sync->dryRun(false);
$sync->delete(true);
$sync->sync(
new Table($sourceConnection, $sourceDb, $sourceTable),
new Table($targetConnection, $targetDb, $targetTable),
new ColumnConfiguration($syncColumns, $ignoreColumns)
);
- 100% test coverage via full stack integration tests
- Allow option to delete data from target where not present on the source
- Use symfony console command for sync
- Option to re-try with back-off on lock wait timeouts
- Option to create missing tables on target
- Option to skip duplicate key errors
- Speed up initial sync of empty table - Maybe offer combination with other tool for full fast outfile based replacement
PHP 5.4 or above PDO MySQL Extension
DbSync is licensed under the MIT License - see the LICENSE file for details
- Inspiration for this project came from the Percona Tools
pt-table-sync
.