Skip Links

Network World

  • Social Web 
  • Email 
  • Close

Saving space with de-duplication

The newest alternative for saving space
Storage Alert By Mike Karp , Network World , 11/02/2006
Sign up for this newsletter now!

Storage analyst Deni Connor focuses on storage, application and infrastructure management in this twice-weekly newsletter.

  • Share/Email
  • Tweet This
  • Comment
  • Print

With the steadily increasing amounts of data that reside on enterprise systems, it’s little wonder that we look for any help we can get to tame the data beast.

For years now, when moving old data off to tape, or when transmitting data across a network, we have been able to take advantage of various compression techniques that reduce the number of bits required to save or move the data. Also, for almost 20 years now we have been able to compress file formats with tools such as ZIP and ARC.

Data compression of course has at least one drawback - the compression and decompression necessary as a front end to a read or write process impose a performance penalty on the system, so each decision to do compression becomes something of a tradeoff between performance and space savings.

The newest alternative for saving space these days is data de-duplication, which is a wholly different approach than compression. De-duplication reduces storage by ensuring that only a single instance of each file (or, with some implementations, a file block) exists. Redundant files and blocks are never written to disk, but are replaced by reference pointers, allowing blocks - or even whole files - to be represented by just a few bytes of information.

De-duplication technology is sold as a part of several companies’ product sets, often appearing as an important feature in a virtual tape library (VTL) or continuous data protection (CDP) solution. I am unaware of anyone offering a stand-alone data de-duplication product, although there is no reason why that couldn’t be done.

Smaller companies have taken the lead when it comes to de-duping. Asigra, Diligent and Permabit, for example, have de-duplication as a part of their VTL or CDP offerings, and Mimosa provides it with its NearPoint e-mail archiving product. (Diligent CEO Doron Kempel discusses de-duplication on Network World's Hot Seat.)

The value of de-duplication is easy to measure. Effectively reducing the total amount of data you have to manage means that, for a while at least, you will be able to defer some of the knee-jerk storage purchasing that many of you seem to do.

If you plan on your site’s data growing at 25% per quarter and are presently buying accordingly, that probably means the reset of your hardware storage purchases can be deferred for at least two quarters. If, as Diligent claims, they can reduce the total data a company must manage by as much as 75%, you might extend this to three quarters. This can be a significant savings indeed - perhaps with very short term payback - and leaves you with more easily managed storage over the long run.

Deni Connor is principal analyst for Storage Strategies NOW.

  • Share/Email
  • Tweet This
  • Comment
  • Print
Partner Content

Explore the Ultrium Edge

The powerful tape technology can address data security with tape encryption as well as long term data protection.

Find Out More

Disk and Tape Square Off

Discover what disk and tape really cost and which solution provides lower total cost of ownership and optimizes energy use for your organization

Download this White Paper

Don't Fall for the Myths

The Clipper Group explores the truth behind the myths of tape, digging into the misconceptions in the disk vs. tape debate.

Review this information

information examination

An examination of information security issues, methods and securing data with LTO-4 tape drive encryption

Read this analysis

Comment
Login
Forgot your account info?
Add comment
Anonymous comments subject to approval. Register here for member benefits.
Have a NetworkWorld account? Log in here. Register now for a free account.

Videos

rssRss Feed