Containers Expo Blog Authors: Jamie Madison, Dalibor Siroky, PagerDuty Blog, Stackify Blog, Matt Lonstine

Related Topics: Containers Expo Blog, @CloudExpo

Containers Expo Blog: Blog Feed Post

Cloud Replication Is Not Backup, But Backup Is!

Backup requires management of multiple historic copies of a data set. Clearly, cloud storage in itself isn't backup

Early Bird Registration at Cloud Expo

Go read that headline again: W. Curtis "Mr. Backup" Preston points out on his blog that replication is not backup, and we can't disagree.

Keeping alternative copies of data in multiple locations is a great idea, reducing the risk of data loss and potentially enabling enhanced access, but it's not a historical data protection (aka, backup) strategy.

Backup requires management of multiple historic copies of a data set. Clearly, cloud storage in itself isn't backup.

Backup vs. Storage

SNIA defines "backup" thus:
  1. [Data Recovery] A collection of data stored on (usually removable) non-volatile storage media for purposes of recovery in case the original copy of data is lost or becomes inaccessible; also called a backup copy.
    To be useful for recovery, a backup must be made by copying the source data image when it is in a consistent state.
  2. [Data Recovery] The act of creating a backup. See archive.

Backup has always been a challenge for corporate IT. It's not "in the critical path", affecting the daily activities of business users and customers, so it usually gets short-shrift when it comes to financial and organizational support. Yet the ability to restore data quickly becomes job one for IT when it is lost or corrupted. I think Preston spells it out wonderfully in the first chapter of his seminal book, UNIX Backup and Recovery. Systems always fail, data is always lost, and having a good backup is the surest way to recover.

Storage industry folks have been suggesting that new technologies eliminate "traditional backups" ever since there has been an industry to speak of. Some of these technologies (RAID, replication, high availability, hash-based integrity checks) are great innovations in keeping online data alive, but they fall flat when it comes to data corruption. Others (mirroring, snapshots, versioning, CAS, CDP) are great at retaining multiple copies of data, but even these aren't true backup solutions. Good backup is much more than mere data protection: Backup must manage data, not just protect it. No basic storage technology will eliminate a real backup solution.

Skim through Preston's book (the index is online at Amazon!) and you'll see that merely creating and holding a copy of a given data set is just a small part of a real backup solution. These copies must be tracked, managed, and expired. Operating systems and applications must be integrated into the solution. Bare-metal recovery, disasters, and compliance must be considered. Storage folks ignore these hard-learned lessons at their peril, and any storage vendor who says backup is dead is revealing their ignorance or naïveté!

Cloud Storage For Data Protection
Although storage technology will never be a full answer to the data protection quandary, it has a lot to offer when it comes to assisting backup solutions. Disk technology has literally transformed the backup world in the last decade in the form of replication, snapshots, CDP, virtual tape libraries, and deduplication. These technologies give powerful new capabilities to the existing backup frameworks, overcoming the dismally-limited tape cartridge approach of the olden days. A state-of-the-art backup solution now relies much more on disk-based storage systems than tape or optical capacity, and many use disks exclusively.

Cloud storage presents new opportunities to enable more effective and efficient backup solutions. Most cloud storage platforms can be very highly utilized, reducing system cost, and can be flexibly and non-disruptively expanded as capacity needs grow. But some cloud storage systems go way beyond this:

  • One of the hallmarks of public cloud solutions is their physical distance from the systems that use them, decreasing the likelihood of data loss from a local disaster. Backing up to a site hundreds or thousands of miles away has long been a dream of IT, and cloud storage makes this possible and even cost-effective!
  • A few cloud storage platforms offer integrated policy-based replication of data (ahem, Nirvanix), and this additional geographic distribution further reduces the risk of data loss in a disaster. It can also aid in recovery, since data can be available locally at remote locations!
  • Like all disk-based backup targets, cloud storage is online and accessible, making restore operations quicker and easier. There is no need to wait for tapes to be recalled, delivered, located, and loaded when data is on random-access disk! But unlike local disk, public cloud storage can be accessible remotely as well, bringing this ease to distributed businesses and disaster recovery operations.
  • Cloud storage systems can embed metadata with stored content, further accelerating restore operations for systems that can use it since indexes no longer have to be rebuilt. This also enables new archiving and content management features, elevating backup to serve a primary business need.
  • One of the hallmarks of cloud storage platforms is their API-based programmability. Backup and archive management companies are discovering the ease and power of integrating programmable cloud storage right into their applications: Watch this space for announcements!
  • Further storage smarts are being embedded into cloud systems, too. We have seen deduplication and compression (check out Nirvanix partner, Ocarina!), data protection (Partners, Tarmin and Atempo), media transcoding, indexing, content distribution, and more.

Backup Is Backup
Simply adding basic data protection techniques like snapshots or replication to a storage system doesn't make it a backup solution. Storage isn't backup, but backup is! Backup systems can leverage storage capabilities, but a backup management solution will always be required to get complete data protection.

Clearly, cloud storage isn't a backup solution. But as this unique combination of capabilities demonstrate, it's much more than simple storage capacity. Like so many storage technologies before it, cloud storage is an enabler for advanced backup solutions.

Read the original blog entry...

More Stories By Stephen Foskett

Stephen Foskett has provided vendor-independent end user consulting on storage topics for over 10 years. He has been a storage columnist and has authored numerous articles for industry publications. Stephen is a popular presenter at industry events and recently received Microsoft’s MVP award for contributions to the enterprise storage community. As the director of consulting for Nirvanix, Foskett provides strategic consulting to assist Fortune 500 companies in developing strategies for service-based tiered and cloud storage. He holds a bachelor of science in Society/Technology Studies, from Worcester Polytechnic Institute.

@ThingsExpo Stories
In his session at 21st Cloud Expo, Carl J. Levine, Senior Technical Evangelist for NS1, will objectively discuss how DNS is used to solve Digital Transformation challenges in large SaaS applications, CDNs, AdTech platforms, and other demanding use cases. Carl J. Levine is the Senior Technical Evangelist for NS1. A veteran of the Internet Infrastructure space, he has over a decade of experience with startups, networking protocols and Internet infrastructure, combined with the unique ability to it...
"There's plenty of bandwidth out there but it's never in the right place. So what Cedexis does is uses data to work out the best pathways to get data from the origin to the person who wants to get it," explained Simon Jones, Evangelist and Head of Marketing at Cedexis, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
"Cloud Academy is an enterprise training platform for the cloud, specifically public clouds. We offer guided learning experiences on AWS, Azure, Google Cloud and all the surrounding methodologies and technologies that you need to know and your teams need to know in order to leverage the full benefits of the cloud," explained Alex Brower, VP of Marketing at Cloud Academy, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clar...
Large industrial manufacturing organizations are adopting the agile principles of cloud software companies. The industrial manufacturing development process has not scaled over time. Now that design CAD teams are geographically distributed, centralizing their work is key. With large multi-gigabyte projects, outdated tools have stifled industrial team agility, time-to-market milestones, and impacted P&L stakeholders.
Gemini is Yahoo’s native and search advertising platform. To ensure the quality of a complex distributed system that spans multiple products and components and across various desktop websites and mobile app and web experiences – both Yahoo owned and operated and third-party syndication (supply), with complex interaction with more than a billion users and numerous advertisers globally (demand) – it becomes imperative to automate a set of end-to-end tests 24x7 to detect bugs and regression. In th...
"Akvelon is a software development company and we also provide consultancy services to folks who are looking to scale or accelerate their engineering roadmaps," explained Jeremiah Mothersell, Marketing Manager at Akvelon, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
"MobiDev is a software development company and we do complex, custom software development for everybody from entrepreneurs to large enterprises," explained Alan Winters, U.S. Head of Business Development at MobiDev, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
SYS-CON Events announced today that CrowdReviews.com has been named “Media Sponsor” of SYS-CON's 22nd International Cloud Expo, which will take place on June 5–7, 2018, at the Javits Center in New York City, NY. CrowdReviews.com is a transparent online platform for determining which products and services are the best based on the opinion of the crowd. The crowd consists of Internet users that have experienced products and services first-hand and have an interest in letting other potential buye...
"IBM is really all in on blockchain. We take a look at sort of the history of blockchain ledger technologies. It started out with bitcoin, Ethereum, and IBM evaluated these particular blockchain technologies and found they were anonymous and permissionless and that many companies were looking for permissioned blockchain," stated René Bostic, Technical VP of the IBM Cloud Unit in North America, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Conventi...
SYS-CON Events announced today that Telecom Reseller has been named “Media Sponsor” of SYS-CON's 22nd International Cloud Expo, which will take place on June 5-7, 2018, at the Javits Center in New York, NY. Telecom Reseller reports on Unified Communications, UCaaS, BPaaS for enterprise and SMBs. They report extensively on both customer premises based solutions such as IP-PBX as well as cloud based and hosted platforms.
"Space Monkey by Vivent Smart Home is a product that is a distributed cloud-based edge storage network. Vivent Smart Home, our parent company, is a smart home provider that places a lot of hard drives across homes in North America," explained JT Olds, Director of Engineering, and Brandon Crowfeather, Product Manager, at Vivint Smart Home, in this SYS-CON.tv interview at @ThingsExpo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
Coca-Cola’s Google powered digital signage system lays the groundwork for a more valuable connection between Coke and its customers. Digital signs pair software with high-resolution displays so that a message can be changed instantly based on what the operator wants to communicate or sell. In their Day 3 Keynote at 21st Cloud Expo, Greg Chambers, Global Group Director, Digital Innovation, Coca-Cola, and Vidya Nagarajan, a Senior Product Manager at Google, discussed how from store operations and ...
It is of utmost importance for the future success of WebRTC to ensure that interoperability is operational between web browsers and any WebRTC-compliant client. To be guaranteed as operational and effective, interoperability must be tested extensively by establishing WebRTC data and media connections between different web browsers running on different devices and operating systems. In his session at WebRTC Summit at @ThingsExpo, Dr. Alex Gouaillard, CEO and Founder of CoSMo Software, presented ...
WebRTC is great technology to build your own communication tools. It will be even more exciting experience it with advanced devices, such as a 360 Camera, 360 microphone, and a depth sensor camera. In his session at @ThingsExpo, Masashi Ganeko, a manager at INFOCOM Corporation, introduced two experimental projects from his team and what they learned from them. "Shotoku Tamago" uses the robot audition software HARK to track speakers in 360 video of a remote party. "Virtual Teleport" uses a multip...
A strange thing is happening along the way to the Internet of Things, namely far too many devices to work with and manage. It has become clear that we'll need much higher efficiency user experiences that can allow us to more easily and scalably work with the thousands of devices that will soon be in each of our lives. Enter the conversational interface revolution, combining bots we can literally talk with, gesture to, and even direct with our thoughts, with embedded artificial intelligence, whic...
SYS-CON Events announced today that Evatronix will exhibit at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Evatronix SA offers comprehensive solutions in the design and implementation of electronic systems, in CAD / CAM deployment, and also is a designer and manufacturer of advanced 3D scanners for professional applications.
Leading companies, from the Global Fortune 500 to the smallest companies, are adopting hybrid cloud as the path to business advantage. Hybrid cloud depends on cloud services and on-premises infrastructure working in unison. Successful implementations require new levels of data mobility, enabled by an automated and seamless flow across on-premises and cloud resources. In his general session at 21st Cloud Expo, Greg Tevis, an IBM Storage Software Technical Strategist and Customer Solution Architec...
To get the most out of their data, successful companies are not focusing on queries and data lakes, they are actively integrating analytics into their operations with a data-first application development approach. Real-time adjustments to improve revenues, reduce costs, or mitigate risk rely on applications that minimize latency on a variety of data sources. In his session at @BigDataExpo, Jack Norris, Senior Vice President, Data and Applications at MapR Technologies, reviewed best practices to ...
An increasing number of companies are creating products that combine data with analytical capabilities. Running interactive queries on Big Data requires complex architectures to store and query data effectively, typically involving data streams, an choosing efficient file format/database and multiple independent systems that are tied together through custom-engineered pipelines. In his session at @BigDataExpo at @ThingsExpo, Tomer Levi, a senior software engineer at Intel’s Advanced Analytics gr...
When talking IoT we often focus on the devices, the sensors, the hardware itself. The new smart appliances, the new smart or self-driving cars (which are amalgamations of many ‘things’). When we are looking at the world of IoT, we should take a step back, look at the big picture. What value are these devices providing? IoT is not about the devices, it’s about the data consumed and generated. The devices are tools, mechanisms, conduits. In his session at Internet of Things at Cloud Expo | DXWor...