An hour before I was going to leave work for the long holiday weekend I broke replication to a MySQL slave database. Go see fireworks? No thanks, I’ve made my own.
The first thing I’ve learned is that changing to master_host field in a CHANGE MASTER TO command will reset your master log and position values. This will destroy your replication or corrupt your data or both! Probably both. I had recently changed an LAN IP of the master and wanted to point the slave at that, figured if I didn’t change the master log and position that everything would be fine. Nope. What I should have done, and feel foolish for not doing, was write down the current master log file and position after issuing a STOP SLAVE. That’s a really good idea whenever you do something with slave data just in case something goes wrong. Specifically here, after seeing that the file and position where different I could have put the correct numbers back in before restarting the slave. So, there is the first lesson,
Before a CHANGE MASTER TO, run a SHOW SLAVE STATUS\G and record the Master_Log_File and Exec_Master_Log_Pos.
The second thing I’ve learned is that the –master-data option on a mysqldump does not do what I thought it would. It records the master data, which is what it says it does, but that’s the SHOW MASTER STATUS data not the SHOW SLAVE STATUS data. It’s the numbers you need if you want to scp a snapshot to a potential slave and get it runnig. It is not the numbers you need if you do a backup on the slave and want to possibly recover in the case of a failure. I figured this out after uncompressing, importing, and attempting to START SLAVE. This did not make me happy. To recover from this, I ended up running a backup off the master, something I would rather not do for performance reasons but holiday weekend, and importing that backup. I haven’t looked into a long term fix for this yet. It can wait for Monday. So, there is the second lesson,
The –master-data option on mysqldump is the SHOW MASTER STATUS equivalent and not the SHOW SLAVE STATUS equivalent.
The third thing I’ve learned, rediscovered really, is that database imports from mysqldump take a long time to import. A really long time. Seriously. For this I wrote a quick script that makes things a little bit faster. The time required to import a db from mysqldump is the sum of time required to import each individual table. My script parses the dump file and splits it into a bunch of individual table files which reduces the time required to that of the longest table import. The script is hacked together, written in perl (which I’m not the best at), and missing prompts, help, or safety measures, but here it is in case you want it. You’ll need to edit the mysql command in the function in order to connect to mysql and use the correct database.
#!/usr/bin/perl use threads; #requires the path for the db gzipped dump my $gzipped = $ARGV[0]; `mkdir ./dbbackupimport`; `cp $gzipped ./dbbackupimport/backup.sql.gz`; `gzip -d ./dbbackupimport/backup.sql.gz`; #break the dump file up into files per table open(my $dumpfh, "<", "./dbbackupimport/backup.sql"); open(my $currentfh, ">", "./dbbackupimport/backupheader.sql"); my @tables = (); while(<$dumpfh>) { $line = $_; if($line =~ /^DROP TABLE IF EXISTS \`([\w]+)\`.*/) { $table = $1; print "Found table $table\n"; push @tables, $table; if($currentfh) { close $currentfh; } #include the header so the imports can disable keys `cat ./dbbackupimport/backupheader.sql > ./dbbackupimport/$table.sql`; open($currentfh, ">>", "./dbbackupimport/$table.sql"); print $currentfh "\n"; } print $currentfh $line; } close $currentfh; #spawn threads to import the data #each thread will execute this function sub import_thread { my $table = @_[0]; print "mysql < ./dbbackupimport/$table\n"; `mysql < ./dbbackupimport/$table.sql`; return $table; } #spawn the threads @threads = (); for my $table (@tables) { push @threads, threads->create('import_thread', $table); } #collect the threads for my $thread (@threads) { my $table = $thread->join(); print "Finished importing $table\n"; }
