Tag Archive for backup

Disaster Recovery on Demand

Our prior post, A Roadmap to High-Value Cloud Infrastructure: Disaster Recovery and Data Protection, discussed both the benefits and limitations of a cloud-based disaster recovery (DR) strategy. As we highlighted last week, traditional disaster recovery options leave open a huge hole: At one extreme are high-cost, quick recovery time options using hosted or colocation sites and at the other extreme are low-cost, long recovery time options such as tape or disk backup.

As a result, organizations who cannot afford a cold or hot standby site have been left to either cobble together a complicated and risky DR strategy or rely solely on offsite data protection, which affords them limited business continuity.

You’ll recall that we outlined three cloud-based options that are beginning to fill this hole:

  • Recovering applications on virtual servers through a cloud provider’s catalog
  • Recovering virtual machines directly in cloud compute
  • Importing on-premises virtual machines into the cloud via conversion scripts and tools

Unfortunately, these cloud options, while viable (and certainly better than data recovery only options), still present an extraordinary amount of risk and complexity – exactly the opposite of what you want during a disaster.

Last week, we promised you a fourth answer that helps you address those issues. Today, TwinStrata launched a new solution: on-demand DR infrastructure that is completely interoperable with on-premises virtualized environments. Unlike traditional DR strategies, CloudArray DRaaS overcomes the hurdles to DR on-demand, offering:

  • 2-4 hour spin up times of DR infrastructure, whenever it is needed, without having to pay for it in the meantime. The only time organizations need to pay for the infrastructure is during disaster and disaster test scenarios.
  • A VMware environment that work seamlessly with customer’s on-premises environment. All controls and privileges are available to customers so that they can orchestrate failover and failback as seamlessly as an on-premise to on-premise environment.

CloudArray with DR On-Demand

  • No need for a time-consuming spin up and rebuild of individual servers from a cloud provider’s catalog. Instead virtual machines can be recovered via standard hypervisor tools.
  • No need for risky conversion utilities to move virtual machines from one hypervisor format to another, potentially compromising recoverability.

Most importantly, it positions an affordable DR offering that rivals and sometimes even improves upon the recovery times of cold standby disaster sites or hot-standby DR sites for a fraction of the cost.

Disaster Recovery Options: Traditional vs. TwinStrata

The far-reaching implication is a viable disaster recovery plan for virtually any organization, particularly the IT-constrained mid-market organization that may not have the wherewithal to build or manage a secondary data center. Moreover, smaller organizations struggling with a DR plan can now be compliant with regulatory standards (i.e. HIPAA, FINRA, etc.) that require them to have a disaster recovery and business continuity plan. Larger organizations, which already have a DR plan, can still benefit from substantial cost savings. Either way, an affordable, enterprise-class disaster recovery strategy has just become attainable for virtually any organization.

You can read the full press release for more information or join us for our webinar on May 30th, “Affordable, On-Demand Disaster Recovery: Introducing CloudArray DRaaS.”

Embrace Storage Hogs with Cloud Storage

Do you know who your storage hogs are? Every organization has them, whether it’s folks who save everything or applications that just need more capacity every year. And let’s face it: The need for increased storage capacity isn’t just a result of your company pack rats – it’s often a necessity due to growing data sets, data retention policies and regulatory compliance requirements.

On average, organizations expand their need for storage by 20-60% on an annual basis, outgrowing capacity of existing storage regularly. Can cloud storage help contain the costs of this spiraling growth?

With cloud storage gateways like TwinStrata CloudArray, many organizations successfully manage growing capacity without the additional storage upgrades, floor space and maintenance headaches associated with traditional on-premise infrastructure. Starting with backups, archives or secondary tiers of storage, you can begin to offload growing data to the cloud and build a plan to keep your on-premise capacity growth in check. You can spend less time trying to control and chastise your storage hogs, and spend more time building your business.

Now while we wouldn’t suggest letting storage users go “hog wild,” we do suggest learning more about how cloud storage can ease managing capacity growth. To that end, TwinStrata is giving away free storage hogs (pictured above) to visitors to our CloudExpo Booth #339 at the Javits Center, NY, June 11-14. Please mention this blog post to receive your very own while they last.

Visit us at CloudExpo and learn how easy it is to manage your data using the cloud.

Download a Free Copy of the Info-Tech Cloud Backup Vendor Landscape Report

Info-Tech Research Group

Info-Tech Research Group recently published a very thorough 30+ page cloud storage vendor landscape report, the first of its kind that compares cloud storage gateways and appliances head-to-head. The report compares product offerings from a number of vendors including CoreVault, CTERA, Nasuni, Panzura, Storsimple, TwinStrata and Zmanda, highlighting the strengths and weaknesses of each respective solution. It goes on to choose a set of champions based on specific use cases that include:

  • Mounting Storage to Cloud IaaS Servers
  • Unstructured Data/NAS Replacement
  • Global Distributed Enterprises
  • Private Clouds

You can download a FREE copy of the report here.

I should mention that this report was not commissioned or sponsored by TwinStrata and highlights the strengths and weaknesses of all solutions, including ours.

If you are using or considering adding cloud storage to your IT environment, we think you will find this report very illuminating.

Clouds, Consistency, and Progress Bars

I have the bad habit of staring at progress bars.

I was backing up my Mac laptop to a CloudArray volume.¹ With gigabit ethernet, a full backup to a CloudArray volume takes less time than my local USB drive. Apple’s Time Machine actually only does a full backup once, followed by hourly incrementals that are rolled together into dailies, weeklies, and synthetic fulls. That’s a fantastic model for the cloud, since it saves a lot on bandwidth, but I usually tear down most of my volumes and run the full backup again. It’s a good way for me to keep an eye on a number of different variables that can affect CloudArray performance.

Anyway, I set up a backup volume and sat back to watch the progress bars. Here’s a good one:

Flush the first

At this point, the backup on my laptop was mostly done. You can see that my CloudArray cache still had 35 gigs of dirty data, and it was just starting to work on flushing 8 gigs out to the cloud. Also, I’d been staring too long, and popped off to do important CTO-type stuff.

A few minutes later, important CTO-type stuff being done, I checked back in on my progress bar:

flush2

The same flush was still in progress, and it was mostly done. But wait! The cache still reports 35 gigs of dirty data! (Actually, 35.1… the operating system hadn’t finished flushing its own cache the last time I checked.) But if an 8 gigabyte flush was mostly done, shouldn’t the cache be almost 8 gigs cleaner? What ever can be going on?

The answer, of course, is a teachable moment.

I’ve been building storage arrays of one type or another for pretty much my whole career. The most important aspect of any array’s firmware is its consistency model, by which I mean: how does it ensure that the data that it stores accurately represents the data that the host applications wrote? If an application writes “AB” to the disk, how does the firmware ensure that the next time it reads from that disk, it gets back “AB”? That is absolutely the most fundamental requirement of a storage system: everything else is just icing.

That might not sound like that hard of a problem, but the nuances of storing data in a complex, shared, networked controller can be subtle. For example, if my application writes “A”, then “B”, then “C” to different locations, I always want to return A, B, C for those locations. But if you add in a cache to the controller, and assume that the cache will fail (you always assume that every hardware component will eventually fail), then it’s not enough to just store the data in the cache. If you are implementing a write-back cache, you have to store information about the order in which those writes occur, so that the underlying backing store (a physical disk, say) gets those writes in the same order. Otherwise, when that cache fails, your application might read back A and C, but not B.

Why is that a problem? What if your application is a database, (A, B) is a credit card transaction, and C is the database checkpoint? In that case, your database will correctly read A, read corrupted data in place of B, and C will tell it that the corrupted junk is just fine. That’s bad.

If your cache firmware is well implemented, though, and only gets the chance to write two blocks before the cache hardware fails, then it will write A and B. Now, when your database tries to reread the data, it’ll find (A, B), but without that crucial C, it’ll do a proper rollback of the transaction.

In CloudArray, we’ve got an added complication: our backing store is not a local physical drive. It’s a massively scalable set of redundant data centers probably located a thousand miles away from our cache. The performance difference between our local devices and the cloud is several orders of magnitude. So how can we maintain consistency?

The answer lies in our rather complex representation of block devices as objects. First, we notice that strict write ordering is not an absolute necessity. We simply need to ensure that our data in the cloud represents some state that existed in our virtual volume, so that if C is present in the cloud, then (A, B) is there, too, but we don’t need to represent each of the intermediate states (A), (A, B), (A, B, C). Then, we have to partition our incoming data into sequences that can represent transitions between these states: these sequences are what we call a flush, and we try to design those partitions to maximize bandwidth utilization while also minimizing the temporal distance between state transitions. Finally, after we’ve transmitted a flush to the cloud, we have to perform an atomic commit on our representation, so that the new state of the cloud is entirely consistent.

And we have to do that in a way that is mindful of the architecture of cloud storage systems, which are often designed around the (not at all scary and in fact quite cool in a nerdy way, in spite of what some people say) eventual consistency model.

What’s all that got to do with my progress bar?

Well, in order to make sure that our cloud data maintains consistency, especially in the presence of sometimes quite flaky networks, we can’t clean out our cache until we’ve successfully committed and verified the most recent state transition, i.e. the last flush. So my progress bar is not really indicating the amount of data that’s been emptied out of the cache: it only tells me how much of the most recent state has been transmitted to the cloud. The data can’t be marked clean in the cache until the actual, final commit has been completed.

So what happens when the flush completes? Let’s see:

flush3

Huh. There it is. The cache now has only 27.1 gigabytes of dirty pages left. Mission accomplished.

And if my CloudArray were to experience some kind of catastrophe right now, like some dastardly CTO yanking out a cache storage device, what would happen? Once I restored it to operation, then Time Machine would pull the nice, consistent image out of the cloud, notice the missing 27.1 gigs, and pick right up from there. Like I said, it’s a nice piece of software, but it does rely on consistent storage.

¹It’s pretty easy to set up a Time Machine backup using the the Studio Network Solutions globalSAN iSCSI initiator for OS X: just install it, point it at a CloudArray, and voila! Up pops whatever capacity I need. Launch Time Machine, set the CloudArray volume as the target disk, and I’ve got a whole bunch of progress bars to stare at.

How to Back Up Your Cloud Storage

Protecting your data in the cloud with a perspective on the Gmail outage

Recently, Howard Marks posted an article on Network Computing entitled “Can Cloud Snapshots Replace Backup?” Howard’s key implication in the article is that cloud storage gateways, onramps or enablers that reside on-premise can replace conventional backup with snapshots for primary data stored in the cloud. However, the article leaves open a point of exposure around the reliability of the cloud storage provider if both the primary copy of data and snapshots are in the cloud. In that case, the storage provider is responsible for the safekeeping of both your primary and backup data.

Let’s take a closer look at what may possibly go wrong with a cloud storage provider and your primary data in the cloud.

  1. For starters, a provider could experience data loss. Better providers offer redundancy and multi-datacenter replication that reduce or eliminate risk of disk or data center failures. For instance, Amazon claims 99.999999999% durability of data stored on S3.  Assuming the cloud provider you choose maintains best practices around redundancy and data protection, the odds of data loss due to hardware of site failures are rather low.
  2. The more likely scenario is that a provider suffers an outage which prevents data access. To be fair, better providers offer 99.9% or even 99.99% availability, but none offer a 100% guarantee against outages, so  a cloud provider outage could range from a temporary annoyance to a painful business outage. More 9′s of availability help, but also keep in mind your network availability is also a limiting factor.
  3. A third, though less likely, case is that the cloud storage provider goes out of business. Again this would depend highly on the quality of the storage provider, which may leave users a migration path, such as when EMC Atmos Online stopped supporting production customers. Whether or not there is a migration path, this case should be part of any risk analysis when it comes to storing business critical data in the cloud.
  4. Finally, the world is still buzzing from thousands of Gmail users losing access to email accounts in late February, that was blamed on a bad software update. The words of Google vice president of engineering and site reliability Ben Treynor are quite telling on how Google is recovering these customers: “To protect your information from these unusual bugs, we also back it up to tape. Since the tapes are offline, they’re protected from such software bugs.” The  sobering message is that just because data is stored in the cloud, it is not exempt from software bugs or even human error, and may still require an offline storage tier for recovery.

Although snapshots in the cloud can functionally replace backup, how can we address the paranoid system administrators who are now white-knuckled at the thought of the 4 aforementioned risks? SLAs or a refund of monthly fees may be a small consolation in the face of lost data.

Is there a foolproof way to eliminate any business continuity dependency on the cloud provider? It turns out there are in fact a few:

  • Keep a local data copy. This is almost too simple, but some cloud storage enablers allow you to keep full local copies of your data along with snapshots in the cloud. Some, like CloudArray, even allow you to grow a local cache to ensure entire data sets reside on-premise. Additional local storage tends to be cheap, while off-site storage is very expensive, making this a compelling way to continue operations even if a cloud provider suffers an outage.
  • Use multiple cloud storage providers. Some cloud storage enablers support multiple cloud storage providers. If so, replicating across providers is relatively simple, albeit at double the capacity and bandwidth costs. The additional cost may be a bit much to swallow, reducing the viability of this alternative.
  • Periodically copy data/snapshots off-cloud. This may sound familiar to the backup administrator. Recently, Marcel van den Berg published design guidelines for Veeam backup and stated “As a number one rule I would advise to store the backup on a different storage platform than the storage platform which is being protected.” The notion of where a storage platform in the cloud begins or ends is admittedly vague. It’s not clear that storing backups in the same cloud observes this fundamental rule. Enter “off-cloud” copies or backups which can reside on-premise or on a separate cloud provider and the problem can be solved. The notion of having primary data in the cloud and backups local may sound strange at first, but is actually a very viable option that is supported by cloud storage enablers like CloudArray and provides protection against both on-premise and cloud disasters.

With these alternatives, it is viable to store primary data in the cloud while minimizing risks of cloud provider failures, software reliability and even human error.

No data center is perfect, which is what makes off-site backup and disaster recovery (DR) planning a requirement for most organizations. Likewise, storing primary data in the cloud is no exemption from proper DR planning and best practices.

Bottom line: If you are considering using a cloud storage enabler, gateway or onramp to store primary data in the cloud, consider working with a product that minimizes your risk and a company that has already thought through and addressed the possible risks and DR contingency options.

Do you use cloud snapshots for backup? Let us know…

Higher Education Lunch Session: Learn How To Capitalize on Cloud Storage Today

 

Is Data Growth Getting You Down? Then Look Up To the Cloud!

  • Thursday, February 10, 2001 
  • TwinStrata Offices
  • 24 Prime Parkway, Suite 301A
  • Natick, MA 01760
  • 9:30 AM to 1:00 PM 

TwinStrata Can Help Colleges & Universities Capitalize on Cloud Storage…Simply, Securely, and Affordably

Are you looking for ways to battle shrinking budgets yet still address the need for ever-increasing storage capacity, efficient off-site data protection and affordable disaster recovery across your campuses?

Using TwinStrata CloudArray®, an affordable iSCSI data protection and disaster recovery cloud storage solution, can bring relief. It’s non-intrusive and non-disruptive, requiring no “rip and replace” of any existing application or infrastructure. 

Attend our Higher Education Lunch session on Thursday, February 10, 2011 and learn how TwinStrata can help you:

  • Turn capex into opex  
  • Balance your capacity demand and cost
  • Seamlessly scale IT using cloud storage
  • Reduce IT complexity and increase IT efficiency

The program will include:

  • Lunch
  • A $25 VISA Gift card for attending
  • An opportunity to meet our Development and Management Teams
  • Participation in a CloudArray workshop
  • A grand prize drawing (must be in attendance to win)

TwinStrata and PEER 1 Team Up to Deliver Enterprise Class Cloud Storage

 

TwinStrata conintues to broaden its partner ecosystem with leading cloud storage providers. Here is yet another example…

NATICK, Mass. and VANCOUVER, British Columbia, Jan. 31, 2011 /PRNewswire/ — TwinStrata, Inc., the leading innovator in iSCSI SAN, data protection and disaster recovery solutions using cloud storage, today announced it has achieved Bronze Partner Program status with international hosting provider, PEER 1 Hosting (TSX: PIX), further demonstrating TwinStrata’s continued commitment to supporting the industry’s key cloud storage providers along with their customers.

Read the full press release…