Background:
We are always receiving disk space alerts on a certain file system in a server owned by our team. What we usually do is to run a script (from crontab) which basically cleans up the MySQL log file. Yes, the nasty log files. But, I think, they forgot to consider having a script that trims the database/s - with entries that dates back 2006-07! Well, a colleague gave us a script that supposedly do this but it got some errors which, for now, is useless.
After checking with the senior engineers (read: oldies), it was agreed to delete some data and leave 30 days worth of records. As I dig deeper, I found out that one of the table have 177M rows - yes, that is M-illion! Hmm, it looked like I ate more than I could chew. But as usual, I know I am hard headed (sometimes, ok?) and I wouldn't give up just yet.
I used to do some maintenance when I was in my first job. I did some dumping, checking replications (my boss set it up, not me), deleting, optimize, etc. And it helped me - made me feel confident, speaking what experience can give you.
After some experiment - deletion, converting UNIX timestamp - I started my work. I deleted rows by the millions, about 10 in each run (I used ORDER BY and LIMIT functions here). You may wonder why not in one shot, well, the problem with DELETE is it locks the table and the table is being used by a monitoring tool which cannot be stopped for a long time. From SHOW PROCESSLIST, I can see a lot of INSERT statements queueing - and deleting 10M rows runs for 16+ minutes. I was so happy that I forgot that deleting doesn't equate to reducing disk space automatically. Just a side note, on the back of my mind, something tells me to mention it depends on how the table was created (?), not sure though. I'll read on this more I guess. Going back, I did reduce it to 55M+ and still see a 96% disk space usage. Outrageous! Doing some little research, I came up with a fragmented rows theory - well, originally not mine, of course, I read it somewhere. And to clean this mess up is to OPTIMIZE TABLE. But before doing it, I did some homework. What is the effect of optimization? I felt that something behind this good thing is an evil lurking! Again, the table is locked and it will create a .TMD file (temporary .MYD file - please read more on this) that can wreck havoc. It can fill-up the file system depending on how big it is and how much is left. So be very careful when doing optimization - I knew this first hand. I also ran CHECK TABLE [STATUS] which could give me some indicators if table needs some repair, or anything. 'Though, at times, this won't give you anything at all. From what I just did, it says everything is good. And yet, I got a ton of fragmented rows. Well, could be some limitation - again, read on.
After all these crappy steps, I was ready to corrupt the database. Oh before I forget, P-L-E-A-S-E if you have a test database or something, do these steps there and not on the prod. Trim the the database there then, a short downtime (stopping applications) for moving it back. So here are the simple steps I took:
1. I used this to get my target date as to set my limit later for deletion.
SELECT UNIX_TIMESTAMP('YYYY-MM-DD HH:mm:ss');
2. Now, we're ready to delete some rows with limits. This is IMPORTANT or the next thing you'll know, you just deleted the entire data!
DELETE FROM <tablename> WHERE timestamp < <number from SELECT statement here> LIMIT 10000000;
3. On this part, I optimize. You might wonder why now, not after the whole thing. Well, you run after everything depending on the amount of rows you just deleted. In this case, it just too darn big which could lock my table my a very long time (INSERTs queue up) and create a .TMD file too big that could overwhelm my file system which have a domino effect on other applications/processes that use it.
OPTIMIZE TABLE <tablename>;
Let this run to completion or it could render the table unusable or corrupted data! You've been warned. Of course, you can run this again to fix it or do a REPAIR TABLE. But who knows, you might also lose it all. As the S.A. says, "He who laughs last has a backup."
4. And then I am "maarte" so I add this. Its significance is here.
FLUSH TABLE <tablename>;
That pretty sums up what I just did. So long. And yes, please I'd like to hear from you. Corrections are always welcome. Cheers!
Still sleepy when I received an SMS from my colleague that the mysqld is not running. I am, of course, pretty sure that it is when I left. Hmm, what could be the cause? Let's see if we can find something later.
ReplyDeleteA-OK! It seems that I did no wrong after all! :P
ReplyDeletePlease note that when I checked, mysqld was started at 0800H+, local time. And the last of maintenance I did was on the Friday the 13th!
I searched for the logs that would back me up, or should I say exonerate me. And here are what I found:
...
/app/gmd/sw/mysql-max-3.23.52-sun-solaris2.8-sparc/bin/mysqld: ready for connections
091113 18:46:45 /app/gmd/sw/mysql-max-3.23.52-sun-solaris2.8-sparc/bin/mysqld: Normal shutdown
091113 18:46:46 /app/gmd/sw/mysql-max-3.23.52-sun-solaris2.8-sparc/bin/mysqld: Forcing close of thread 7 user: 'user'
091113 18:57:55 /app/gmd/sw/mysql-max-3.23.52-sun-solaris2.8-sparc/bin/mysqld: Shutdown Complete
091113 18:57:55 mysqld ended
091113 18:59:25 mysqld started # my last restart
Cannot initialize InnoDB as 'innodb_data_file_path' is not set.
If you do not want to use transactional InnoDB tables, add a line
skip-innodb
to the [mysqld] section of init parameters in your my.cnf
or my.ini. If you want to use InnoDB tables, add to the [mysqld]
section, for example,
innodb_data_file_path = ibdata1:10M:autoextend
But to get good performance you should adjust for your hardware
the InnoDB startup options listed in section 2 at
http://www.innodb.com/ibman.html
/app/gmd/sw/mysql-max-3.23.52-sun-solaris2.8-sparc/bin/mysqld: ready for connections
091116 00:24:47 mysqld started
Cannot initialize InnoDB as 'innodb_data_file_path' is not set.
If you do not want to use transactional InnoDB tables, add a line
skip-innodb
to the [mysqld] section of init parameters in your my.cnf
or my.ini. If you want to use InnoDB tables, add to the [mysqld]
section, for example,
innodb_data_file_path = ibdata1:10M:autoextend
But to get good performance you should adjust for your hardware
the InnoDB startup options listed in section 2 at
http://www.innodb.com/ibman.html
/app/gmd/sw/mysql-max-3.23.52-sun-solaris2.8-sparc/bin/mysqld: ready for connections
091116 6:00:01 /app/gmd/sw/mysql-max-3.23.52-sun-solaris2.8-sparc/bin/mysqld: Normal shutdown
091116 6:00:02 /app/gmd/sw/mysql-max-3.23.52-sun-solaris2.8-sparc/bin/mysqld: Shutdown Complete
091116 06:00:02 mysqld ended
091116 06:00:21 mysqld started
Cannot initialize InnoDB as 'innodb_data_file_path' is not set.
...
Another log shows:
...
/app/gmd/sw/mysql-max-3.23.52-sun-solaris2.8-sparc/bin/mysqld, Version: 3.23.52-max-log, started with:
Tcp port: 3306 Unix socket: /tmp/mysql.sock
Time Id Command Argument
091113 18:59:31 1 Connect user@server on nadb
091113 18:59:40 2 Connect user@server on nadb
.
.
.
091113 21:53:08 5 Quit
091113 21:53:16 6 Quit
091113 23:00:01 48 Connect user@server on nadb
49 Connect user@server on nadb
48 Quit
49 Quit # I believe this indicates exit from DB session
/app/gmd/sw/mysql-max-3.23.52-sun-solaris2.8-sparc/bin/mysqld, Version: 3.23.52-max-log, started with:
Tcp port: 3306 Unix socket: /tmp/mysql.sock
Time Id Command Argument
091116 0:24:51 1 Connect user@server on nadb