When I first described my new terminology for storage tiers, Tim Burlowski, asked how backup and recovery fit into that model.
It turns out that that is an interesting question.
Backup, the only working ILM/HSM solution
Step 10 feet away, and you realize that backup is HSM.
The backup process creates cold data, the backup image. The backup software them moves the backup copy to cheaper tier of storage. When a restore is performed, the backup software moves the backup copy from the cheaper tier to a more expensive tier.
The question on the table, then, is where does this form of HSM still make sense?
Captive IOPS
One obvious place where HSM as part of backup process is for the backup of data on a Captive IOPS tier. The tier is built using very expensive storage. Storing cold data that will probably never get accessed seems odd and a waste of money.
But that's not necessarily true.
If the Captive tier is built using disk drives, and the application has a high IOPS data density, a lot of the capacity of the disks is unused. The cheapest place to store backup copies, in that case, would be on the same disks you're using for primary data.
If the Captive tier is built using SSD's like EMC storage, then it makes a lot of sense to use a backup solution to move backup images off of the flash onto cheaper storage. If the application has a low IOPS data density, then a better approach is to use flash cache in front of disk.
If the Captive tier is built using a flash cache in front of disk, the cheapest place to store the backup copies may be on the disk drives.
More generally, the backup methodology of a captive tier made up of flash cache and disk, will be the same as the backup methodology of the capacity tier.
So what about the capacity tier and backup?
In the general case, the capacity tier will be backed up much in the same way that it always has been.
A capacity tier that has real snapshots can eliminate the HSM part of the backup process.
In fact, in an earlier series on backup I proved that for a VMware infrastructure, the cheapest place to store backup copies was on the same disks that stored the active data.
Now I want to generalize that comment for all storage..
If you consider IOPS density, disk drives must contain large amounts of cold data. A subset of the cold data in a data center is the backup data. That data can either be stored on a separate set of disks or on the same disks that are being used to serve IOPS.
To store the data on the same disks the following things need to be true
- Creating the backup image must consume minimal IOPS
- The existence of the backup image must impose no performance penalty
- Be able to store a large number of images.
- The storage must be configured using some form of RAID-6 or RAID 1-0
- The cost of the backup images must be cost effective
It turns out that for NetApp systems all five are true.
So as disk capacities expand, storage architects who have the option to use real snapshots will stop moving data off of the capacity tier to some cheaper tier.
Said in a slightly more poetic way:
The final resting place for data on disk will be the disk where the data first got created.
The net effect will be a much simpler and more cost effective storage infrastructure to support backup.
One minor addendunm added after I wrote this post.
As Martin G says in the comments below, having the backup copies on the local disks doesn't protect you from site failures. So, of course, some form of storage replication is required. Thankfully NetApp has a cool technology called SnapMirror that replicates all of the snapshots to a remote destination in a space efficient way.
In an earlier post on VMware and backup I show how all of this fits together.
I should have included this need to replicate the data in my original blog post. So I am rectifying that error now.

But Snapshots are not a backup by themselves; if something catastrophic happens to your source disks, you've lost your recovery point. So you can replicate to another array, put it onto tape but don't put all your eggs into one basket. Well, that's my best practise, YMMV.
Posted by: Martin G | December 16, 2008 at 01:05 AM
Kostafis -- this is dangerously bad advice for customers, if you think about it.
Storing your backup data on the same spindles, in the same array as your production data?
Sounds like a newbie mistake that you'd only make once in your career :-)
-- Chuck
Posted by: Chuck Hollis | December 16, 2008 at 05:41 AM
Martin G,
Of course you have to do some kind of DR for the data using some kind of storge replication.
I just think of that as DR for the backup not backup.
But good point. I should have made that clearer.
cheers,
kostadis
Posted by: Kostadis Roussos | December 16, 2008 at 06:16 AM
Chuck,
If you mean that we need to store the data on a remote system from the local disks, i agree. And that's what SnapMirror is for.
If you mean, that even if you can replicate the data onto remote disks, this is still a newbie mistake, then I must respectfully disagree.
cheers,
kostadis
Posted by: Kostadis Roussos | December 16, 2008 at 06:38 AM
Sorry, this wasn't at all what I was imagining. I'll have to continue this conversation on my blog.
Posted by: tim | June 04, 2009 at 02:20 PM