« Shooting Fish in a Barrel | Main | Unified Host Utilities Kit 5.0 for ESX Server is Released »

December 12, 2008

Comments

Hi Nick

Given your large installed base, I have no doubt whatsoever that you've found cases where your dedupe scheme works as advertised.

Genomic data, for example, strikes me as highly redundant. Lots of VMs with redundant binaries, sure -- I'll grant you that. Can't speak to the other use cases, though.

I think you're missing the key point here, so I'll try one more time.

Everyone has an issue with the cheap marketing stunt of a "guarantee" that was obviously constructed by a lawyer.

One where it's assumed you'll be switching from RAID 10 to RAID 6, that you won't be using email, you won't be using databases, you won't be using any data that's already compressed like PowerPoint and PDFs, you will need to use expensive professional services, etc. etc.

Not much of a guarantee, right?

I think all of us are happy to acknowledge and debate the various pros and cons of NetApp's dedupe. No problem there.

What we're not pleased with is the sleazy marketing campaign. If EMC were to do something similarly tacky, I'd probably quit in protest.

-- Chuck

@Chuck

Genome data dedupes very badly. Unless identical DNA sequences are being stored, which is something most try to avoid for obvious reasons -- these files can be monstrous in size. The normal approach is to compress with specialized routines that give 20:1 or higher, as the usual compression techniques only give about 3:1 ratios or thereabouts.

Best use cases are backup data, VMware, geoseismic, etc; page 16 of this http://media.netapp.com/documents/tr-3505.pdf gives the kind of %ages that might be expected. Deduplication has pretty broad application, and not specific to just one or two specialized data types. Duke Institute for Genome Sciences are using dedupe for VMware, as Nick noted.

I note your displeasure with some amusement. It is, after all, an optional guarantee. If buyers of storage don't wish to avail themselves of it, they don't need to purchase it. I certainly don't see why NetApp giving customers guarantees should cause you so much publically expressed heartburn. Certainly not on the grounds you quote; as Gartner note here (http://mediaproducts.gartner.com/gc/webletter/netapp/issue19/gartner2.html) all we're ensuring is best practice (something I know you are keen on).

And to your objection that the agreement appears to have been constructed by a lawyer, I can only say; clara pacta, boni amici. Clear agreements, good friends.

Chuck probably has a proof showing that it can't possibly be true:

http://blogs.netapp.com/simple_steve/2008/12/reasoning-with.html

If sleazy and tacky behavior are the thresholds at which anyone resigns from employment at EMC then the parking lots at 176 South St. should have been empty years ago.


-Tim-

Hi Nick,

I have 2 3040c clusters...running VMWare over NFS and we are seeing a savings of 77% from ASIS or DeDup. I also have another 2TB volume that we use for student home directories (over 13,000 home folders for students) where we are seeing a 17% savings with dedup.

Verify your Comment

Previewing your Comment

This is only a preview. Your comment has not yet been posted.

Working...
Your comment could not be posted. Error type:
Your comment has been posted. Post another comment

The letters and numbers you entered did not match the image. Please try again.

As a final step before posting your comment, enter the letters and numbers you see in the image below. This prevents automated programs from posting comments.

Having trouble reading this image? View an alternate.

Working...

Post a comment

© NetApp, Inc.  |  "Safe Harbor" Statement  |  Privacy Policy