MySQL的逻辑备份工具--mydumper
0. summary
1. 下载和安装
2. 参数
3. 导出示例
4. 如何实现一致性的多个线程导出
5. 原理参考
1. 下载和安装 ### 下载 ####
git clone https://github.com/maxbube/mydumper
#### 安装需要的包 ####
yum -y install cmake
yum -y install glib2-devel
yum -y install pcre-devel
yum -y install zlib-devel
yum -y install openssl-devel
yum -y install gcc-c++
#### 安装过程 ####
[root@test-1 mydumper]# cmake .
-- The CXX compiler identification is GNU 4.8.5
-- Check for working CXX compiler: /usr/bin/c++
-- Check for working CXX compiler: /usr/bin/c++ -- works
-- Detecting CXX compiler ABI info
-- Detecting CXX compiler ABI info - done
-- Using mysql-config: /usr/local/mysql/bin/mysql_config
-- Found MySQL: /usr/local/mysql/include, /usr/local/mysql/lib/libmysqlclient.so;
/usr/lib64/libpthread.so;
/usr/lib64/libm.so;
/usr/lib64/librt.so;
/usr/lib64/libdl.soCMake Warning at docs/CMakeLists.txt:9 (message):
Unable to find Sphinx documentation generator-- ------------------------------------------------
-- MYSQL_CONFIG = /usr/local/mysql/bin/mysql_config
-- CMAKE_INSTALL_PREFIX = /usr/local
-- BUILD_DOCS = ON
-- WITH_BINLOG = OFF
-- RUN_CPPCHECK = OFF
-- Change a values with: cmake -D=
-- ------------------------------------------------
--
-- Configuring done
-- Generating done
-- Build files have been written to: /usr/local/mydumper
[root@test-1 mydumper]# echo $?
0
[root@test-1 mydumper]# make
Scanning dependencies of target mydumper
[ 25%] Building C object CMakeFiles/mydumper.dir/mydumper.c.o
[ 50%] Building C object CMakeFiles/mydumper.dir/server_detect.c.o
[ 75%] Building C object CMakeFiles/mydumper.dir/g_unix_signal.c.o
Linking C executable mydumper
[ 75%] Built target mydumper
Scanning dependencies of target myloader
[100%] Building C object CMakeFiles/myloader.dir/myloader.c.o
Linking C executable myloader
[100%] Built target myloader
#### 添加环境变量 ####
export PATH=/usr/local/mysql/bin:/usr/local/mydumper:$PATH
2. 参数
[root@test-1 ~]# mydumper --help
Usage:
mydumper [OPTION...] multi-threaded MySQL dumpingHelp Options:
-?, --helpShow help optionsApplication Options:
-B, --databaseDatabase to dump
-T, --tables-listComma delimited table list to dump (does not exclude regex option)
-o, --outputdirDirectory to output files to
-s, --statement-sizeAttempted size of INSERT statement in bytes, default 1000000
-r, --rowsTry to split tables into chunks of this many rows. This option turns off --chunk-filesize
-F, --chunk-filesizeSplit tables into chunks of this output file size. This value is in MB
-c, --compressCompress output files
-e, --build-empty-filesBuild dump files even if no data available from table
-x, --regexRegular expression for 'db.table' matching
-i, --ignore-enginesComma delimited list of storage engines to ignore
-m, --no-schemasDo not dump table schemas with the data
-d, --no-dataDo not dump table data
-G, --triggersDump triggers
-E, --eventsDump events
-R, --routinesDump stored procedures and functions
-W, --no-viewsDo not dump VIEWs
-k, --no-locksDo not execute the temporary shared read lock.WARNING: This will cause inconsistent backups
--no-backup-locksDo not use Percona backup locks
--less-lockingMinimize locking time on InnoDB tables.
-l, --long-query-guardSet long query timer in seconds, default 60
-K, --kill-long-queriesKill long running queries (instead of aborting)
-D, --daemonEnable daemon mode
-I, --snapshot-intervalInterval between each dump snapshot (in minutes), requires --daemon, default 60
-L, --logfileLog file name to use, by default stdout is used
--tz-utcSET TIME_ZONE='+00:00' at top of dump to allow dumping of TIMESTAMP data when a server has data in different time zones or data is being moved between servers with different time zones, defaults to on use --skip-tz-utc to disable.
--skip-tz-utc
--use-savepointsUse savepoints to reduce metadata locking issues, needs SUPER privilege
--success-on-1146Not increment error count and Warning instead of Critical in case of table doesn't exist
--lock-all-tablesUse LOCK TABLE for all, instead of FTWRL
-U, --updated-sinceUse Update_time to dump only tables updated in the last U days
--trx-consistency-onlyTransactional consistency only
--complete-insertUse complete INSERT statements that include column names
-h, --hostThe host to connect to
-u, --userUsername with the necessary privileges
-p, --passwordUser password
-P, --portTCP/IP port to connect to
-S, --socketUNIX domain socket file to use for connection
-t, --threadsNumber of threads to use, default 4
-C, --compress-protocolUse compression on the MySQL connection
-V, --versionShow the program version and exit
-v, --verboseVerbosity of output, 0 = silent, 1 = errors, 2 = warnings, 3 = info, default 2
--defaults-fileUse a specific defaults file
常用参数:
-D, --daemon Enable daemon mode ---- 开启后台
--trx-consistency-only Transactional consistency only ---- 一致性导出
[root@test-1 mdata]# myloader --help
Usage:
myloader [OPTION...] multi-threaded MySQL loaderHelp Options:
-?, --helpShow help optionsApplication Options:
-d, --directoryDirectory of the dump to import
-q, --queries-per-transactionNumber of queries per transaction, default 1000
-o, --overwrite-tablesDrop tables if they already exist
-B, --databaseAn alternative database to restore into
-s, --source-dbDatabase to restore
-e, --enable-binlogEnable binary logging of the restore data
-h, --hostThe host to connect to
-u, --userUsername with the necessary privileges
-p, --passwordUser password
-P, --portTCP/IP port to connect to
-S, --socketUNIX domain socket file to use for connection
-t, --threadsNumber of threads to use, default 4
-C, --compress-protocolUse compression on the MySQL connection
-V, --versionShow the program version and exit
-v, --verboseVerbosity of output, 0 = silent, 1 = errors, 2 = warnings, 3 = info, default 2
--defaults-fileUse a specific defaults file
也可以用mysql <来导入,但是开不了并行,使用myloader就很简单,指定directory就可以了,默认并行4.
3. 导出示例
[root@test-1 mdata]# mydumper -B mytest
导出会在当前目录下建立export加上时间戳的目录
[root@test-1 mdata]# ls -ld export-*
drwx------. 2 root root 4096 Jan 28 12:16 export-20170128-121638
[root@test-1 mdata]# cd export-20170128-121638/
[root@test-1 export-20170128-121638]# ls
metadatamytest.c-schema.sqlmytest.mytest-schema.sqlmytest.t1-schema.sqlmytest.t-schema.sqlmytest.v-schema.sql
mytest.a-schema.sqlmytest.c.sqlmytest.mytest.sqlmytest.test_load-schema.sqlmytest.t.sqlmytest.z-schema.sql
mytest.a.sqlmytest.d-schema.sqlmytest-schema-create.sqlmytest.test_load.sqlmytest.u-schema.sqlmytest.z.sql
mytest.b-schema.sqlmytest.mytest1-schema.sqlmytest.stock-schema.sqlmytest.test-schema.sqlmytest.UserInfo-schema.sqlmytest.zz-schema.sql
mytest.b.sqlmytest.mytest1.sqlmytest.stock.sqlmytest.test.sqlmytest.UserInfo.sqlmytest.zz.sql
对于每张表,都有表结构和插入数据两个sql文件
[root@test-1 export-20170128-121638]# cat mytest.a-schema.sql
/*!40101 SET NAMES binary*/;
/*!40014 SET FOREIGN_KEY_CHECKS=0*/;
CREATE TABLE `a` (
`a` int(11) NOT NULL,
`b` int(11) DEFAULT NULL,
`c` int(11) DEFAULT NULL,
`d` int(11) DEFAULT NULL,
PRIMARY KEY (`a`),
UNIQUE KEY `b` (`b`),
KEY `c` (`c`),
KEY `d` (`d`)
) ENGINE=InnoDB DEFAULT CHARSET=utf8mb4;
[root@test-1 export-20170128-121638]# cat mytest.a.sql
/*!40101 SET NAMES binary*/;
/*!40014 SET FOREIGN_KEY_CHECKS=0*/;
/*!40103 SET TIME_ZONE='+00:00' */;
INSERT INTO `a` VALUES
(1,4,1,1),
(2,2,2,2),
(3,5,3,3),
(5,7,8,9);
还有个metadata的文件,记录当前的filename、postition和GTID
[root@test-1 export-20170128-121638]# cat metadata
Started dump at: 2017-01-28 12:16:38
SHOW MASTER STATUS:
Log: bin.000084
Pos: 194
GTID:713a7f7f-6f53-11e6-b7a9-000c29de5d8b:1-284340Finished dump at: 2017-01-28 12:16:38
这么导的好处有两点:
- 恢复的时候开多个并行导入。
- 通过一个备份里面恢复指定的表,而并不希望全部恢复,mydumper的情况下非常简单。mysql官方的导入导出工具很难办到。
mydumper -t 4 -r 1000 --trx-consistency-only -B sbtest -T sbtest1
【MySQL的逻辑备份工具--mydumper】通过通用日志跟踪的方法
(root@localhost) [mysql]> set global general_log = 0;
Query OK, 0 rows affected (0.02 sec)(root@localhost) [mysql]> truncate table general_log;
Query OK, 0 rows affected (0.01 sec)(root@localhost) [mysql]> set global general_log = 1;
Query OK, 0 rows affected (0.00 sec)(root@localhost) [mysql]> set global log_output = 'TABLE';
Query OK, 0 rows affected (0.01 sec)
另一个session执行导出
[root@test-1 mdata]# mydumper -t 4 -r 10000 --trx-consistency-only -B sbtest -T sbtest1** (mydumper:57204): WARNING **: Using trx_consistency_only, binlog coordinates will not be accurate if you are writing to non transactional tables.
关闭通用日志
(root@localhost) [mysql]> set global general_log = 0;
Query OK, 0 rows affected (0.00 sec)
查看通用日志
(root@localhost) [mysql]> select thread_id,left(argument,120) from general_log order by event_time limit 64;
+-----------+--------------------------------------------------------------------------------------------------------------+
| thread_id | left(argument,120)|
+-----------+--------------------------------------------------------------------------------------------------------------+
|43 | /*!40101 SET NAMES binary*/|
|44 | /*!40101 SET NAMES binary*/|
|45 | /*!40101 SET NAMES binary*/|
|46 | /*!40101 SET NAMES binary*/|
|44 | SELECT /*!40001 SQL_NO_CACHE */ * FROM `sbtest`.`sbtest1` WHERE `id` IS NULL OR (`id` >= 1 AND `id` < 12501) |
|45 | SELECT /*!40001 SQL_NO_CACHE */ * FROM `sbtest`.`sbtest1` WHERE (`id` >= 12501 AND `id` < 25001)|
|43 | SELECT /*!40001 SQL_NO_CACHE */ * FROM `sbtest`.`sbtest1` WHERE (`id` >= 25001 AND `id` < 37501)|
|46 | SELECT /*!40001 SQL_NO_CACHE */ * FROM `sbtest`.`sbtest1` WHERE (`id` >= 37501 AND `id` < 50001)|
|44 | SHOW CREATE TABLE `sbtest`.`sbtest1`|
|43 ||
|44 ||
|45 ||
|46 ||
|42 | root@localhost on sbtest using Socket|
|42 | SET SESSION wait_timeout = 2147483|
|42 | SET SESSION net_write_timeout = 2147483|
|42 | SHOW PROCESSLIST|
|42 | FLUSH TABLES WITH READ LOCK|
|42 | START TRANSACTION /*!40108 WITH CONSISTENT SNAPSHOT */|---- 一个线程去执行实例级别的读锁,其他线程读到的就是一致性的数据。
|42 | /*!40101 SET NAMES binary*/|
|42 | SHOW MASTER STATUS|
|42 | SHOW SLAVE STATUS|
|43 | root@localhost onusing Socket|
|43 | SET SESSION wait_timeout = 2147483|
|43 | SET SESSION TRANSACTION ISOLATION LEVEL REPEATABLE READ|
|43 | START TRANSACTION /*!40108 WITH CONSISTENT SNAPSHOT */|
|43 | /*!40103 SET TIME_ZONE='+00:00' */|
|44 | root@localhost onusing Socket|
|44 | SET SESSION wait_timeout = 2147483|
|44 | SET SESSION TRANSACTION ISOLATION LEVEL REPEATABLE READ|
|44 | START TRANSACTION /*!40108 WITH CONSISTENT SNAPSHOT */|
|44 | /*!40103 SET TIME_ZONE='+00:00' */|
|45 | root@localhost onusing Socket|
|45 | SET SESSION wait_timeout = 2147483|
|45 | SET SESSION TRANSACTION ISOLATION LEVEL REPEATABLE READ|
|45 | START TRANSACTION /*!40108 WITH CONSISTENT SNAPSHOT */|
|45 | /*!40103 SET TIME_ZONE='+00:00' */|
|46 | root@localhost onusing Socket|
|46 | SET SESSION wait_timeout = 2147483|
|46 | SET SESSION TRANSACTION ISOLATION LEVEL REPEATABLE READ|---- 有4个线程开启了RR事务隔离级别
|46 | START TRANSACTION /*!40108 WITH CONSISTENT SNAPSHOT */|---- 创建快照读
|46 | /*!40103 SET TIME_ZONE='+00:00' */|
|42 | UNLOCK TABLES /* trx-only */|---- 释放了实例级别的读锁
|42 | sbtest|
|42 | SHOW TABLE STATUS|
|42 | SHOW INDEX FROM `sbtest`.`sbtest1`|
|42 | SELECT /*!40001 SQL_NO_CACHE */ MIN(`id`),MAX(`id`) FROM `sbtest`.`sbtest1`|---- 对自增列求最大最小值进行一个分片
|42 | EXPLAIN SELECT `id` FROM `sbtest`.`sbtest1`|
|42 | SHOW CREATE DATABASE `sbtest`|
|42 ||
|21 | set global general_log = 0|
+-----------+--------------------------------------------------------------------------------------------------------------+
51 rows in set (0.01 sec)
导出来的是多个文件,所以支持单表的并行导入
[root@test-1 export-20170130-000445]# ls -ltr
total 9728
-rw-r--r--. 1 root root68 Jan 30 00:04 sbtest-schema-create.sql
-rw-r--r--. 1 root root 2476583 Jan 30 00:04 sbtest.sbtest1.00000.sql
-rw-r--r--. 1 root root419 Jan 30 00:04 sbtest.sbtest1-schema.sql
-rw-r--r--. 1 root root 2487688 Jan 30 00:04 sbtest.sbtest1.00001.sql
-rw-r--r--. 1 root root 2487689 Jan 30 00:04 sbtest.sbtest1.00002.sql
-rw-r--r--. 1 root root 2487687 Jan 30 00:04 sbtest.sbtest1.00003.sql
-rw-r--r--. 1 root root175 Jan 30 00:04 metadata
5. 原理参考 开源MYSQL多线程逻辑导出工具MYDUMPER原理与改进:
http://www.innomysql.com/article/25456.html
推荐阅读
- 热闹中的孤独
- JAVA(抽象类与接口的区别&重载与重写&内存泄漏)
- 放屁有这三个特征的,请注意啦!这说明你的身体毒素太多
- 一个人的旅行,三亚
- 布丽吉特,人生绝对的赢家
- 慢慢的美丽
- 尽力
- 一个小故事,我的思考。
- 家乡的那条小河
- 《真与假的困惑》???|《真与假的困惑》??? ——致良知是一种伟大的力量