United States
Site Map Contacts Hitachi Global Community
Hu's Blog - Data Storage and Virtualization Thought Leader Hitachi - Inspire the Next

Hu Yoshida's Blog - Vice President | Chief Technology Officer

Home > Corporate > HDS Blogs > HDS Bloggers > Hu's Blog
Products, Solutions and more

Hu's Blog

A Recipe for Deduplicated, Data Replication

by Hu Yoshida on Jul 14, 2006

Awhile back I posted a blog, Data Replication: When push comes to pull, where I described our Hitachi Universal Replicator, as a pull technology for asynchronous replication. With HUR the remote site USP pulls data from the production site USP based on a time stamped journal. The benefit being the reduction of resource consumption at the production USP, by transferring the majority of the workload for asynchronous replication to the remote USP. I compared that to feeding my daughter when she was a baby, where I had to "push" the food versus feeding her when she was older and able to feed (pull) herself.

I also extended the analogy to the present time where she is an adult, living away from home, and able to cook for herself. At this time I only need to send a recipe to her to recreate the same meal that I cook at home. In other words, I was alluding to a future time when we could do replication of data without having to send all of the data to the remote site. I would only need to send meta data to recreate the data. Wouldn’t it be great if we could replicate 250TB of data at a remote site, thousands of miles away, without having to actually send the entire 250TB! 

This week HDS announced the availability of ProtecTIER deduplication which can be combined with our Universal Replicator technology to make this a reality.

ProtecTIER has a very efficient indexing schema which enables a petabyte of data to be indexed by 4GB of memory. The power of this index has been shown to provide around a 25:1 reduction in stored data due to deduplication of redundant data patterns. This is not only a tremendous savings in storage cost, it also reduces the cost and effort of replication.

Now I can replicate 250TB of data to a remote site, by sending only 10TB of deduplicated data, a tremendous saving in bandwidth and an even more tremendous savings in our most valuable resource, time!

Sending deduplicated data with HUR is like sending a recipe to my daughter.

Related Posts Plugin for WordPress, Blogger...

Comments (2 )

Victoria on 18 Jul 2006 at 6:55 pm

Hi Hu, just wanted to point your readers to an article that just came out on you (unabashedly promoted, since I wrote it…ahem).

Well. :D


Jeremiah Owyang on 18 Jul 2006 at 7:16 pm

Victoria, Great write-up, it’s easy to understand even for non-technical folks!

Look forward to reading more of your work.

Hu Yoshida - Storage Virtualization Thought LeaderMust-read IT Blog

Hu Yoshida
Vice President and Chief Technology Officer

Connect with Us


Switch to our mobile site