Click here to close now.


Containers Expo Blog Authors: XebiaLabs Blog, Pat Romanski, Elizabeth White, Mike Kavis, Carmen Gonzalez

News Feed Item

New Digital Universe Study Reveals Big Data Gap: Less Than 1% Of World's Data Is Analyzed; Less Than 20% Is Protected

Opportunities Abound for Companies Capable of Protecting and Extracting Value from an Expanding Universe of Data; By 2020, Emerging Markets Will Supplant the Developed World as the Main Producer of the World's Data.

HOPKINTON, Mass., Dec. 11, 2012 /PRNewswire/ --

News Summary:

  • New IDC Digital Universe study, "Big Data, Bigger Digital Shadows, and Biggest Growth in the Far East"(1) (sponsored by EMC) finds that only a tiny fraction of the world's Big Data potential is being realized, though the amount of useful data is expanding.
  • IDC projects that the digital universe will reach 40 zettabytes (ZB) by 2020, an amount that exceeds previous forecasts by 5 ZBs, resulting in a 50-fold growth from the beginning of 2010.
  • This year's study marks the first time IDC was able to capture where the information in the digital universe either originated or was first captured or consumed, revealing some dramatic shifts currently underway.
  • The amount of data that requires protection is growing faster than the digital universe itself, yet levels of protection are not keeping pace.
  • According to the study, 2.8 ZB of data will have been created and replicated in 2012.
  • Machine-generated data is a key driver in the growth of the world's data – which is projected to increase 15x by 2020.
  • By 2020, emerging markets will supplant the developed world as the main producer of the world's data.
  • The investment in spending on IT hardware, software, services, telecommunications and staff that could be considered the "infrastructure" of the digital universe will grow by 40% between 2012 and 2020.  Investment in targeted areas like storage management, security, Big Data, and cloud computing will grow considerably faster.
  • Join the #DigitalUniverse and #EMC conversations on Twitter.

Full Story:

EMC® Corporation (NYSE: EMC) today announced results of the EMC-sponsored IDC Digital Universe study, "Big Data, Bigger Digital Shadows, and Biggest Growth in the Far East"— which found that despite the unprecedented expansion of the digital universe due to the massive amounts of data being generated daily by people and machines, IDC estimates that only 0.5% of the world's data is being analyzed. 

To view the multimedia version of this news release go to

The proliferation of devices such as PCs and smartphones worldwide, increased Internet access within emerging markets and the boost in data from machines such as surveillance cameras or smart meters has contributed to the doubling of the digital universe within the past two years alone -- to a mammoth 2.8 ZB. IDC projects that the digital universe will reach 40 ZB by 2020, an amount that exceeds previous forecasts by 14%.

In terms of sheer volume, 40 ZB of data is equivalent to:

  • There are 700,500,000,000,000,000,000 grains of sand on all the beaches on earth (or seven quintillion five quadrillion). That means 40 ZB is equal to 57 times the amount of all the grains of sand on all the beaches on earth.
  • If we could save all 40 ZB onto today's Blue-ray discs, the weight of those discs (without any sleeves or cases) would be the same as 424 Nimitz-class aircraft carriers.
  • In 2020, 40 ZB will be 5,247 GB per person worldwide.

This year's study marks the first time IDC was able to capture where the information in the digital universe either originated or was first captured or consumed, revealing some dramatic shifts currently underway.  Now in its sixth year, the study – measuring and forecasting the amount of digital information created and copied annually – includes findings around the "Big Data Gap," which is the gap between the amount of data with hidden value and the amount of value that is actually being extracted; the level of data protection required versus what is being delivered; and the geographic implications of the world's data.

Study Highlights:

  • Rapid expansion of the digital universe: IDC projects that the digital universe will reach 40 ZB by 2020, an amount that exceeds previous forecasts.
    • The digital universe will double every two years between now and 2020.
    • There will be approximately 5,247 GB of data for every man, woman and child on earth in 2020.
    • A major factor behind the expansion of the digital universe is the growth of machine generated data, increasing from 11% of the digital universe in 2005 to over 40% in 2020.
  • Large quantities of useful data are getting lost: The promise of Big Data lies within the extraction of value from large, untapped pools of data. However, the majority of new data is largely untagged file-based and unstructured data, which means little is known about it.
    • In 2012, 23% (643 exabytes) of the digital universe would be useful for Big Data if tagged and analyzed. However, currently only 3% of the potentially useful data is tagged, and even less is analyzed.
    • The amount of useful data is expanding with the growth of the digital universe. By 2020, 33% of the digital universe (13,000+ exabytes) will have Big Data value if it is tagged and analyzed.
  • Much of the digital universe is unprotected: The amount of data that requires protection is growing faster than the digital universe itself.
    • Less than a third of the digital universe required data protection in 2010, but that proportion is expected to exceed 40% by 2020.
    • In 2012, while about 35% of the information in the digital universe required some type of data protection, less than 20% of the digital universe actually has these protections.
    • The level of protection varies by region, with much less protection in the emerging markets.
    • Challenges such as advanced threats, the security skills gap and lack of adherence to security best practices among consumers and corporations will continue to compound the issue.
  • A geographic role-reversal is around the corner: Although the digital universe was a developed-world phenomenon in the early days, that is about to change as the population of the emerging markets begins to cast a longer shadow.
    • While emerging markets accounted for 23% of the digital universe as recently as 2010, their share is already up to 36% in 2012.
    • By 2020, IDC predicts that 62% of the digital universe will be attributable to emerging markets.
    • The current global breakdown of the digital universe is: U.S. – 32%, Western Europe – 19%, China – 13%, India – 4%, rest of the world – 32%.
    • By 2020, China alone is expected to generate 22% of the world's data.

Other Key Findings:

  • As cloud computing plays an even more important role in the management of Big Data, the number of servers worldwide is expected to grow tenfold and the amount of information managed directly by enterprise data centers will grow by a factor of 14.
  • The type of data stored in the cloud will also experience a radical transformation over the next few years. By 2020, IDC predicts that 46.7% of data stored in the cloud will be related to entertainment – not enterprise data. Surveillance data, embedded and medical data, and information created by computers, phones and consumer electronics will make up the remainder.
  • The amount of information stored in the digital universe about individual users exceeds the amount of data that they themselves create.
  • Western Europe is currently investing the most to manage the digital universe, spending $2.49 USD per GB. The U.S. comes in second, investing $1.77 per GB, followed by China at $1.31 per GB and India at $0.87 per GB.
  • As the infrastructure of the digital universe becomes ever more connected, information won't reside within the region where it is consumed, nor will it need to. By 2020, IDC estimates that nearly 40% of data will be "touched" by cloud computing (private and public), meaning that somewhere between a byte's origination and consumption, it will be stored or processed in a cloud.

EMC Quote:

Jeremy Burton, Executive Vice President, Product Operations and Marketing, EMC Corporation

"As the volume and complexity of data barraging businesses from all angles increases, IT organizations have a choice: they can either succumb to information-overload paralysis, or they can take steps to harness the tremendous potential teeming within all of those data streams. This year's study underscores the massive opportunity that exists for businesses that not only identify the potential benefits of the digital universe, but recognize the importance of navigating that universe with the right balance of technology, data security practices and IT skills. At EMC, we're uniquely positioned to help customers manage, protect and unlock game-changing value from data that translates directly into competitive advantage."

Additional Resources:

About EMC

EMC Corporation is a global leader in enabling businesses and service providers to transform their operations and deliver IT as a service. Fundamental to this transformation is cloud computing. Through innovative products and services, EMC accelerates the journey to cloud computing, helping IT departments to store, manage, protect and analyze their most valuable asset – information – in a more agile, trusted and cost-efficient way. Additional information about EMC can be found at

(1) IDC Digital Universe Study, sponsored by EMC, December 2012

EMC is a registered trademark of EMC Corporation. Other trademarks are the property of their respective owners.

SOURCE EMC Corporation

More Stories By PR Newswire

Copyright © 2007 PR Newswire. All rights reserved. Republication or redistribution of PRNewswire content is expressly prohibited without the prior written consent of PRNewswire. PRNewswire shall not be liable for any errors or delays in the content, or for any actions taken in reliance thereon.

@ThingsExpo Stories
Today’s connected world is moving from devices towards things, what this means is that by using increasingly low cost sensors embedded in devices we can create many new use cases. These span across use cases in cities, vehicles, home, offices, factories, retail environments, worksites, health, logistics, and health. These use cases rely on ubiquitous connectivity and generate massive amounts of data at scale. These technologies enable new business opportunities, ways to optimize and automate, along with new ways to engage with users.
WebRTC converts the entire network into a ubiquitous communications cloud thereby connecting anytime, anywhere through any point. In his session at WebRTC Summit,, Mark Castleman, EIR at Bell Labs and Head of Future X Labs, will discuss how the transformational nature of communications is achieved through the democratizing force of WebRTC. WebRTC is doing for voice what HTML did for web content.
This week, the team assembled in NYC for @Cloud Expo 2015 and @ThingsExpo 2015. For the past four years, this has been a must-attend event for MetraTech. We were happy to once again join industry visionaries, colleagues, customers and even competitors to share and explore the ways in which the Internet of Things (IoT) will impact our industry. Over the course of the show, we discussed the types of challenges we will collectively need to solve to capitalize on the opportunity IoT presents.
Through WebRTC, audio and video communications are being embedded more easily than ever into applications, helping carriers, enterprises and independent software vendors deliver greater functionality to their end users. With today’s business world increasingly focused on outcomes, users’ growing calls for ease of use, and businesses craving smarter, tighter integration, what’s the next step in delivering a richer, more immersive experience? That richer, more fully integrated experience comes about through a Communications Platform as a Service which allows for messaging, screen sharing, video...
SYS-CON Events announced today that Super Micro Computer, Inc., a global leader in high-performance, high-efficiency server, storage technology and green computing, will exhibit at the 17th International Cloud Expo®, which will take place on November 3–5, 2015, at the Santa Clara Convention Center in Santa Clara, CA. Supermicro (NASDAQ: SMCI), the leading innovator in high-performance, high-efficiency server technology is a premier provider of advanced server Building Block Solutions® for Data Center, Cloud Computing, Enterprise IT, Hadoop/Big Data, HPC and Embedded Systems worldwide. Supermi...
As more intelligent IoT applications shift into gear, they’re merging into the ever-increasing traffic flow of the Internet. It won’t be long before we experience bottlenecks, as IoT traffic peaks during rush hours. Organizations that are unprepared will find themselves by the side of the road unable to cross back into the fast lane. As billions of new devices begin to communicate and exchange data – will your infrastructure be scalable enough to handle this new interconnected world?
SYS-CON Events announced today that Dyn, the worldwide leader in Internet Performance, will exhibit at SYS-CON's 17th International Cloud Expo®, which will take place on November 3-5, 2015, at the Santa Clara Convention Center in Santa Clara, CA. Dyn is a cloud-based Internet Performance company. Dyn helps companies monitor, control, and optimize online infrastructure for an exceptional end-user experience. Through a world-class network and unrivaled, objective intelligence into Internet conditions, Dyn ensures traffic gets delivered faster, safer, and more reliably than ever.
SYS-CON Events announced today that Sandy Carter, IBM General Manager Cloud Ecosystem and Developers, and a Social Business Evangelist, will keynote at the 17th International Cloud Expo®, which will take place on November 3–5, 2015, at the Santa Clara Convention Center in Santa Clara, CA.
The Internet of Things (IoT) is growing rapidly by extending current technologies, products and networks. By 2020, Cisco estimates there will be 50 billion connected devices. Gartner has forecast revenues of over $300 billion, just to IoT suppliers. Now is the time to figure out how you’ll make money – not just create innovative products. With hundreds of new products and companies jumping into the IoT fray every month, there’s no shortage of innovation. Despite this, McKinsey/VisionMobile data shows "less than 10 percent of IoT developers are making enough to support a reasonably sized team....
The IoT market is on track to hit $7.1 trillion in 2020. The reality is that only a handful of companies are ready for this massive demand. There are a lot of barriers, paint points, traps, and hidden roadblocks. How can we deal with these issues and challenges? The paradigm has changed. Old-style ad-hoc trial-and-error ways will certainly lead you to the dead end. What is mandatory is an overarching and adaptive approach to effectively handle the rapid changes and exponential growth.
With major technology companies and startups seriously embracing IoT strategies, now is the perfect time to attend @ThingsExpo in Silicon Valley. Learn what is going on, contribute to the discussions, and ensure that your enterprise is as "IoT-Ready" as it can be! Internet of @ThingsExpo, taking place Nov 3-5, 2015, at the Santa Clara Convention Center in Santa Clara, CA, is co-located with 17th Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry players in the world. The Internet of Things (IoT) is the most profound change in personal an...
Developing software for the Internet of Things (IoT) comes with its own set of challenges. Security, privacy, and unified standards are a few key issues. In addition, each IoT product is comprised of at least three separate application components: the software embedded in the device, the backend big-data service, and the mobile application for the end user's controls. Each component is developed by a different team, using different technologies and practices, and deployed to a different stack/target - this makes the integration of these separate pipelines and the coordination of software upd...
As a company adopts a DevOps approach to software development, what are key things that both the Dev and Ops side of the business must keep in mind to ensure effective continuous delivery? In his session at DevOps Summit, Mark Hydar, Head of DevOps, Ericsson TV Platforms, will share best practices and provide helpful tips for Ops teams to adopt an open line of communication with the development side of the house to ensure success between the two sides.
The IoT is upon us, but today’s databases, built on 30-year-old math, require multiple platforms to create a single solution. Data demands of the IoT require Big Data systems that can handle ingest, transactions and analytics concurrently adapting to varied situations as they occur, with speed at scale. In his session at @ThingsExpo, Chad Jones, chief strategy officer at Deep Information Sciences, will look differently at IoT data so enterprises can fully leverage their IoT potential. He’ll share tips on how to speed up business initiatives, harness Big Data and remain one step ahead by apply...
There will be 20 billion IoT devices connected to the Internet soon. What if we could control these devices with our voice, mind, or gestures? What if we could teach these devices how to talk to each other? What if these devices could learn how to interact with us (and each other) to make our lives better? What if Jarvis was real? How can I gain these super powers? In his session at 17th Cloud Expo, Chris Matthieu, co-founder and CTO of Octoblu, will show you!
Today air travel is a minefield of delays, hassles and customer disappointment. Airlines struggle to revitalize the experience. GE and M2Mi will demonstrate practical examples of how IoT solutions are helping airlines bring back personalization, reduce trip time and improve reliability. In their session at @ThingsExpo, Shyam Varan Nath, Principal Architect with GE, and Dr. Sarah Cooper, M2Mi's VP Business Development and Engineering, will explore the IoT cloud-based platform technologies driving this change including privacy controls, data transparency and integration of real time context w...
The Internet of Everything is re-shaping technology trends–moving away from “request/response” architecture to an “always-on” Streaming Web where data is in constant motion and secure, reliable communication is an absolute necessity. As more and more THINGS go online, the challenges that developers will need to address will only increase exponentially. In his session at @ThingsExpo, Todd Greene, Founder & CEO of PubNub, will explore the current state of IoT connectivity and review key trends and technology requirements that will drive the Internet of Things from hype to reality.
"Matrix is an ambitious open standard and implementation that's set up to break down the fragmentation problems that exist in IP messaging and VoIP communication," explained John Woolf, Technical Evangelist at Matrix, in this interview at @ThingsExpo, held Nov 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA.
Nowadays, a large number of sensors and devices are connected to the network. Leading-edge IoT technologies integrate various types of sensor data to create a new value for several business decision scenarios. The transparent cloud is a model of a new IoT emergence service platform. Many service providers store and access various types of sensor data in order to create and find out new business values by integrating such data.
There are so many tools and techniques for data analytics that even for a data scientist the choices, possible systems, and even the types of data can be daunting. In his session at @ThingsExpo, Chris Harrold, Global CTO for Big Data Solutions for EMC Corporation, will show how to perform a simple, but meaningful analysis of social sentiment data using freely available tools that take only minutes to download and install. Participants will get the download information, scripts, and complete end-to-end walkthrough of the analysis from start to finish. Participants will also be given the pract...