Posts Tagged ‘backup to cloud’

How to simplify off-site backup for virtual environments

Wednesday, September 14th, 2011

If you happen to be managing a virtual server environment, such as VMware or Hyper-V, you are probably aware of the importance of backup software to protect your valuable applications and data in case of disaster. You may also be keenly conscious of the unique needs that virtual server environments pose, such as virtual machine sprawl, contention for server resources and spiraling data growth (often duplicate data), in which case there is probably no need to spend a lot of time touting the benefits of Veeam Backup & Replication.

Over the past few years, Veeam has become the leader in backup for virtual environments offering class-leading consolidated and highly efficient backup and replication that was designed and optimized to handle the special requirements associated with ever-popular virtual machine backups.

Starting with Veeam as a foundation, is it possible to make backup and recovery even simpler? Well, one of the challenges of backup is moving backup data off site for disaster recovery (DR) purposes. You might settle for the manual and rather unreliable process of shipping tapes or media offsite with the hope of getting your data back in a reasonable period of time should a disaster strike. Alternatively, you might build out a disk-to-disk backup infrastructure that extends off site to a secondary or collocation facility, housing additional storage equipment that you own and manage (learn more in my recent article Data Protection and Backup: The Shortcomings of Disk-to-Disk). But perhaps you are looking for a faster, simpler way to achieve your off-site DR objectives, without manual and unreliable processes and without significant capital and administrative investment.

Backup to Cloud

What if you could simply add off-site storage for backups without the need for tapes and without the need for a secondary site? What if at the touch of a button you could access secure off-site cloud storage that is bandwidth optimized with local-key encryption and offers instant recoverability virtually anywhere and a choice of cloud providers. That is TwinStrata CloudArray.

Needless to say, we were recently happy to announce that every copy of Veeam Backup & Replication now comes bundled with a free CloudArray virtual appliance up to 1TB. Backing up virtual server environments just became simpler and the need for tapes and offsite facilities may quickly become a distant memory. Each CloudArray appliance can scale to Petabytes of data with an iSCSI interface that allows seamlessly storing backups or archives. CloudArray also offers additional storage capacity wherever it is needed. If you are a backup administrator considering off-site backup and DR options, consider a better, simpler option that only takes minutes to deploy. Try Veeam with your free CloudArray virtual appliance and let us know how simple off-site backup for your virtual environment can be.

Download a Free Copy of the Info-Tech Cloud Backup Vendor Landscape Report

Tuesday, August 9th, 2011

Info-Tech Research Group

Info-Tech Research Group recently published a very thorough 30+ page cloud storage vendor landscape report, the first of its kind that compares cloud storage gateways and appliances head-to-head. The report compares product offerings from a number of vendors including CoreVault, CTERA, Nasuni, Panzura, Storsimple, TwinStrata and Zmanda, highlighting the strengths and weaknesses of each respective solution. It goes on to choose a set of champions based on specific use cases that include:

  • Mounting Storage to Cloud IaaS Servers
  • Unstructured Data/NAS Replacement
  • Global Distributed Enterprises
  • Private Clouds

You can download a FREE copy of the report here.

I should mention that this report was not commissioned or sponsored by TwinStrata and highlights the strengths and weaknesses of all solutions, including ours.

If you are using or considering adding cloud storage to your IT environment, we think you will find this report very illuminating.

Clouds, Consistency, and Progress Bars

Monday, April 25th, 2011

I have the bad habit of staring at progress bars.

I was backing up my Mac laptop to a CloudArray volume.¹ With gigabit ethernet, a full backup to a CloudArray volume takes less time than my local USB drive. Apple’s Time Machine actually only does a full backup once, followed by hourly incrementals that are rolled together into dailies, weeklies, and synthetic fulls. That’s a fantastic model for the cloud, since it saves a lot on bandwidth, but I usually tear down most of my volumes and run the full backup again. It’s a good way for me to keep an eye on a number of different variables that can affect CloudArray performance.

Anyway, I set up a backup volume and sat back to watch the progress bars. Here’s a good one:

Flush the first

At this point, the backup on my laptop was mostly done. You can see that my CloudArray cache still had 35 gigs of dirty data, and it was just starting to work on flushing 8 gigs out to the cloud. Also, I’d been staring too long, and popped off to do important CTO-type stuff.

A few minutes later, important CTO-type stuff being done, I checked back in on my progress bar:

flush2

The same flush was still in progress, and it was mostly done. But wait! The cache still reports 35 gigs of dirty data! (Actually, 35.1… the operating system hadn’t finished flushing its own cache the last time I checked.) But if an 8 gigabyte flush was mostly done, shouldn’t the cache be almost 8 gigs cleaner? What ever can be going on?

The answer, of course, is a teachable moment.

I’ve been building storage arrays of one type or another for pretty much my whole career. The most important aspect of any array’s firmware is its consistency model, by which I mean: how does it ensure that the data that it stores accurately represents the data that the host applications wrote? If an application writes “AB” to the disk, how does the firmware ensure that the next time it reads from that disk, it gets back “AB”? That is absolutely the most fundamental requirement of a storage system: everything else is just icing.

That might not sound like that hard of a problem, but the nuances of storing data in a complex, shared, networked controller can be subtle. For example, if my application writes “A”, then “B”, then “C” to different locations, I always want to return A, B, C for those locations. But if you add in a cache to the controller, and assume that the cache will fail (you always assume that every hardware component will eventually fail), then it’s not enough to just store the data in the cache. If you are implementing a write-back cache, you have to store information about the order in which those writes occur, so that the underlying backing store (a physical disk, say) gets those writes in the same order. Otherwise, when that cache fails, your application might read back A and C, but not B.

Why is that a problem? What if your application is a database, (A, B) is a credit card transaction, and C is the database checkpoint? In that case, your database will correctly read A, read corrupted data in place of B, and C will tell it that the corrupted junk is just fine. That’s bad.

If your cache firmware is well implemented, though, and only gets the chance to write two blocks before the cache hardware fails, then it will write A and B. Now, when your database tries to reread the data, it’ll find (A, B), but without that crucial C, it’ll do a proper rollback of the transaction.

In CloudArray, we’ve got an added complication: our backing store is not a local physical drive. It’s a massively scalable set of redundant data centers probably located a thousand miles away from our cache. The performance difference between our local devices and the cloud is several orders of magnitude. So how can we maintain consistency?

The answer lies in our rather complex representation of block devices as objects. First, we notice that strict write ordering is not an absolute necessity. We simply need to ensure that our data in the cloud represents some state that existed in our virtual volume, so that if C is present in the cloud, then (A, B) is there, too, but we don’t need to represent each of the intermediate states (A), (A, B), (A, B, C). Then, we have to partition our incoming data into sequences that can represent transitions between these states: these sequences are what we call a flush, and we try to design those partitions to maximize bandwidth utilization while also minimizing the temporal distance between state transitions. Finally, after we’ve transmitted a flush to the cloud, we have to perform an atomic commit on our representation, so that the new state of the cloud is entirely consistent.

And we have to do that in a way that is mindful of the architecture of cloud storage systems, which are often designed around the (not at all scary and in fact quite cool in a nerdy way, in spite of what some people say) eventual consistency model.

What’s all that got to do with my progress bar?

Well, in order to make sure that our cloud data maintains consistency, especially in the presence of sometimes quite flaky networks, we can’t clean out our cache until we’ve successfully committed and verified the most recent state transition, i.e. the last flush. So my progress bar is not really indicating the amount of data that’s been emptied out of the cache: it only tells me how much of the most recent state has been transmitted to the cloud. The data can’t be marked clean in the cache until the actual, final commit has been completed.

So what happens when the flush completes? Let’s see:

flush3

Huh. There it is. The cache now has only 27.1 gigabytes of dirty pages left. Mission accomplished.

And if my CloudArray were to experience some kind of catastrophe right now, like some dastardly CTO yanking out a cache storage device, what would happen? Once I restored it to operation, then Time Machine would pull the nice, consistent image out of the cloud, notice the missing 27.1 gigs, and pick right up from there. Like I said, it’s a nice piece of software, but it does rely on consistent storage.

¹It’s pretty easy to set up a Time Machine backup using the the Studio Network Solutions globalSAN iSCSI initiator for OS X: just install it, point it at a CloudArray, and voila! Up pops whatever capacity I need. Launch Time Machine, set the CloudArray volume as the target disk, and I’ve got a whole bunch of progress bars to stare at.

Cloud Storage Arithmetic: 80% Faster Recovery

Thursday, March 31st, 2011

The topics of ROI and cost savings frequently come up when discussing moving a portion of data storage infrastructure to the cloud. While it is good practice to compare the costs of purchasing dedicated infrastructure versus the pay-as-you-go cloud model to find the “break-even point” when the initial costs of investing in the cloud are replaced by month-over-month cost savings,  the math really does not speak to the operational improvement businesses can experience from moving to cloud storage.

Let’s take our recent customer case study, where AFGE, the largest federal employee union in the United States, was able to reduce VMWare off-site restore times by 80% versus tape using a combination of CloudArray software and Veeam Backup software.  How exactly should a business value such a substantial operational improvement in off-site backup? Well, here’s how AFGE values it:

“Now that we know the backup data is successfully and securely going offsite, we can rest easily. With the deployment of CloudArray, we were able to cut our storage costs, reduce data recovery times from one week to one day, and eliminate much of the manual work of handling all those tapes resulting in a savings of one quarter of a FTE,” said Taylor Higley, IT director, AFGE.

As you can see, there is a lot to be said about improvements in operational efficiency. While cost savings is always a relevant consideration in moving to the cloud, so is peace of mind in protecting/recovering valuable data.

Find out how your business can benefit from improved IT operational efficiency. Learn more about CloudArray.

Higher Education Lunch Session: Learn How To Capitalize on Cloud Storage Today

Tuesday, February 1st, 2011

 

Is Data Growth Getting You Down? Then Look Up To the Cloud!

  • Thursday, February 10, 2001 
  • TwinStrata Offices
  • 24 Prime Parkway, Suite 301A
  • Natick, MA 01760
  • 9:30 AM to 1:00 PM 

TwinStrata Can Help Colleges & Universities Capitalize on Cloud Storage…Simply, Securely, and Affordably

Are you looking for ways to battle shrinking budgets yet still address the need for ever-increasing storage capacity, efficient off-site data protection and affordable disaster recovery across your campuses?

Using TwinStrata CloudArray®, an affordable iSCSI data protection and disaster recovery cloud storage solution, can bring relief. It’s non-intrusive and non-disruptive, requiring no “rip and replace” of any existing application or infrastructure. 

Attend our Higher Education Lunch session on Thursday, February 10, 2011 and learn how TwinStrata can help you:

  • Turn capex into opex  
  • Balance your capacity demand and cost
  • Seamlessly scale IT using cloud storage
  • Reduce IT complexity and increase IT efficiency

The program will include:

  • Lunch
  • A $25 VISA Gift card for attending
  • An opportunity to meet our Development and Management Teams
  • Participation in a CloudArray workshop
  • A grand prize drawing (must be in attendance to win)

TwinStrata and PEER 1 Team Up to Deliver Enterprise Class Cloud Storage

Monday, January 31st, 2011

 

TwinStrata conintues to broaden its partner ecosystem with leading cloud storage providers. Here is yet another example…

NATICK, Mass. and VANCOUVER, British Columbia, Jan. 31, 2011 /PRNewswire/ — TwinStrata, Inc., the leading innovator in iSCSI SAN, data protection and disaster recovery solutions using cloud storage, today announced it has achieved Bronze Partner Program status with international hosting provider, PEER 1 Hosting (TSX: PIX), further demonstrating TwinStrata’s continued commitment to supporting the industry’s key cloud storage providers along with their customers.

Read the full press release…

Cloud Storage Effect on Storage Management: Reduced Complexity, Maximized Resources, Improved Efficiency

Monday, January 24th, 2011

 

IT administrators continue to face the age-old challenges of storage management complexity and cost while the burden of managing exponential data growth has businesses of all sizes considering the best ways to store, protect, and archive their files, Exchange, and SharePoint data. The need to maximize resources and infrastructure, optimize storage requirements, and improve efficiencies remain top drivers for most of these businesses today. 

With all of these factors to consider, one of the most difficult skill sets for IT to find and retain are expert level administrators for specific storage management disciplines including storage administrators.

When you deploy an Enterprise or Mid-Range storage array, you generally need a team of people who are specialized in configuring, provisioning, and managing those storage arrays (let alone the compliance, disaster recovery, and other more advanced storage specializations).  Decisions made daily include RAID configuration, performance tuning, device management, storage pool provisioning, management of remote replication, management of consistency groups, and management of capacity and storage tiering. These are highly specialized and vendor specific skills. They will extend out to your application servers with CLI and API command sets which must be used to perform even simple client side tasks.

Most, if not all of these technology skill demands will disappear once you deploy Cloud Storage. Of course, if you deploy a Private Cloud, you will merely be moving the skill pools to a different area, but they will still largely vanish from your day-to-day data center operations.  With Public Clouds, they will go away almost immediately and entirely.

As Cloud Storage gets provisioned through CloudArray, your administrators will largely be working at the level of an average system administrator skill set when it comes to provisioning and managing storage.  Configuration requirements will be reduced to basic volume count, volume size, encryption requirements, and page size requirements.  None of this requires advanced degrees, decades of storage management experience, or high level vendor certifications.

By deploying a Cloud Storage model – especially for routine use cases such as online backup, archive, and disaster recovery operations – you can begin to free up highly skilled administrators and other IT specialists to redeploy and focus on other critical areas of your IT operations. Cloud Storage doesn’t necessarily mean direct reductions in headcount. Efficiency is in part about resource re-deployment without having to incur additional costs for people or infrastructure. Conversely, Cloud Storage might even allow growth in areas you otherwise couldn’t hire into before.

Essentially, as more leading-edge technologies begin to creep into IT shops and data centers, Cloud Storage is a direct and immediate way to reduce management complexity and costs affording IT the chance to spend more time on business applications, business continuity, and strategic IT planning and projects.

The best way to see this is to download and try it for yourself.  Visit www.TwinStrata.com for more information.