|By Rick Kawamura||
|March 8, 2012 06:00 AM EST||
I recently presented, "The Moneyball Approach to Big Data - Creating an Unfair Advantage," at the Wall Street Technology Association's Hot Technologies Forum in New York. Everyone is talking about Big Data, but when it comes to taking action, most are taking a "wait-and-see" approach, and that concerns me. Skepticism or "late-adopter" mentality is understandable - if you want to forego a low-risk, high-reward opportunity and let your competition gain the advantage.
My job is to create value for my customers, and I'd hate to see anyone miss out on the opportunity presented by Big Data.
What's the Problem?
The Corporate Executive Board identified three potential barriers to Big Data implementation:
- Information Attainability (the right information is available and easy to find)
- Information Usefulness (information is of good quality and is usable in format)
- Employee Capability (employees analyze information effectively to make good decisions)
While these are definitely three potential barriers to implementation, the main problem I see in the market is pre-implementation where the "wait and see" mentality is caused by either a feeling that Big Data is over-hyped ("maybe it will go away"), or paralysis by analysis (the enormity and complexity of Big Data is too confusing to take action).
What Is Big Data?
The most common definition I've seen for Big Data is summarized by the three Vs:
- Volume: It's big - terabytes and petabytes of data
- Variety: It comes in many forms - internal, external, structured, and unstructured
- Velocity: It is growing and changing rapidly - making real-time capture and action hugely important
This definition is always supported by numbers showing the vastness and enormity of Big Data:
- The New York Stock Exchange creates 1 terabyte of data per day (InformationWeek)
- 10,000 payment-card transactions are made per second worldwide (American Banker)
- 30 billion pieces of content are shared on Facebook every month (McKinsey)
- Twitter feeds generate 8 terabytes of data every day (InformationWeek)
The Internet plays a huge role in the rapid growth of Big Data, giving individuals the ability to post and upload immense amounts of pictures, text, video, and mobile data. It also gives businesses a channel to offer access to customers and partners through web-based applications (think Oracle, salesforce.com, social media, procurement, logistics, publishers, etc.). One way to visualize this explosion of applications and data is the Bessemer Venture Partners Cloudscape. And that's just in the cloud. Don't forget all of the apps and data behind the firewall of every organization, whether commercial, governmental, or charitable. Big Data truly is BIG.
Before you go out and buy massive amounts of storage, take a look at what you currently consume and utilize and start from there in easily digestible portions. Forrester estimates that enterprises currently utilize less than 5% of available data. In a survey of global executives, IBM shows that 33% have made decisions with inaccurate data or data they don't trust; half don't have sufficient information from across their organization to do their job; 75% believe more predictive information would drive better decisions; yet 87% have yet to even start taking advantage of opportunities to leverage information to their advantage. You don't need to immediately implement a solution getting you to 100% or even 50% of available data - 6-10% will do for now.
If there are 200 million tweets a day equaling 8 terabytes of data, but only 1,000 of those tweets relate to your product or company, do you need to store and analyze all 8 terabytes every day? Think of it this way: there's a huge difference between "I have terabytes of data - videos, satellite pictures, social media conversations, and research reports" and "I know where Public Enemy #1 is." It comes down to Data vs. Intelligence. Data is useless if you can't extract meaningful intelligence from it. And the quality of the intelligence is much less dependent on the volume of data than it is on the relevance of the data and your ability to access it.
The Magic Words: Relevance and Accessibility
Although Big Data lives up to its name, don't get caught up in all the massive numbers. Focus on what's relevant to your business. Consider this: Sybase published Big Data, Big Opportunity that stated "for the median Fortune 1000 company... a 10% increase in usability of data translates to an increase of $2.01 billion in total revenue per year, [and] a 10% increase in accessibility to data translates to an additional $65.67 million in net income per year." Just because your company currently may have access to only 5 percent of the relevant data that is available, don't despair. You don't have to go from 5 percent to 100 percent. You really only need to go from 5 percent to 5.5 percent to reap great rewards.
The Secret to Taming Big Data
Despite all the hype and discussion around Big Data's massiveness, I've yet to find a single article mentioning the difficulty of accessing data that is spread throughout all of the various source applications. Until recently there were no Big Data integration platforms that could deal with the exploding number of applications and all of the data they contain, as well as the speed at which both are changing. Just a glance at the daily domain statistics on www.domaintools.com gives you an idea of the volume of sites being created, deleted, and transferred every 24 hours. Not every integration solution can manage the intensity of that kind of change to give you access to the relevant data - the business intelligence - your business needs when you need it.
The whole point of gaining access to relevant data is that it must be actionable. Otherwise it's a big waste of time and effort. What's amazingly useful about Big Data, and the web-based nature of so much of it, is that with a Big Data integration platform you can access any data you can see on a website and you can just as easily transform that data, perform an operation on it, and automate a resulting action. Here's an example:
You know that consumers and even your B2B purchasers research prices online and that loyalty to any one vendor has deteriorated as buyers have more pricing knowledge just a search and mouse-click away. But you're smarter than your competitors because you're already doing the extra 10 percent. You set up automated monitoring of your competitor's pricing, and when their price drops below yours, your Big Data integration platform calculates the difference plus 10%, logs into your ecommerce site and adjusts your prices automatically, all in mere moments. The beauty is that this can all be set up in hours, if not a few days, and you don't have to bring in an army of developers or consultants to create custom code to do any of it.
If I told you I could guarantee any application or data you can see in your web browser (customer data, bank transactions, twitter, blogs, supply chain vendors, government data, competitor prices, etc.) could be automatically accessed and loaded into the application, database, or spreadsheet of your choice, how many game-changing Big Data projects could you imagine? Understanding the point-in-time cash position of billions of dollars across 300 banks? No problem. Monitoring competitor pricing on 50,000 SKUs every day? Simple. Automating a 23-step manual invoicing process to get paid millions of dollars two days faster? Done. Real-time, automated access to the relevant data you need is the key to success with Big Data.
Every company can benefit from Big Data in many ways, but most don't realize it. Hundreds of scenarios are possible using real-time application integration platforms that could save your company millions of dollars; grow revenue by double-digit percentages; create more personalized products that delight your customers; automate real-time feedback on your brand, products, and competitor prices; create your own custom research that allows you to see trends before your competitors do; and overall make your company a much more agile business that scales with your new-found vigor and growth. Don't let the size of Big Data paralyze you; get real-time access to the data that is relevant to your company's growth and take action.
Microservices are a very exciting architectural approach that many organizations are looking to as a way to accelerate innovation. Microservices promise to allow teams to move away from monolithic "ball of mud" systems, but the reality is that, in the vast majority of organizations, different projects and technologies will continue to be developed at different speeds. How to handle the dependencies between these disparate systems with different iteration cycles? Consider the "canoncial problem" in this scenario: microservice A (releases daily) depends on a couple of additions to backend B (re...
Dec. 1, 2015 09:00 AM EST Reads: 481
Container technology is shaping the future of DevOps and it’s also changing the way organizations think about application development. With the rise of mobile applications in the enterprise, businesses are abandoning year-long development cycles and embracing technologies that enable rapid development and continuous deployment of apps. In his session at DevOps Summit, Kurt Collins, Developer Evangelist at Built.io, examined how Docker has evolved into a highly effective tool for application delivery by allowing increasingly popular Mobile Backend-as-a-Service (mBaaS) platforms to quickly crea...
Dec. 1, 2015 08:00 AM EST Reads: 396
Too often with compelling new technologies market participants become overly enamored with that attractiveness of the technology and neglect underlying business drivers. This tendency, what some call the “newest shiny object syndrome” is understandable given that virtually all of us are heavily engaged in technology. But it is also mistaken. Without concrete business cases driving its deployment, IoT, like many other technologies before it, will fade into obscurity.
Dec. 1, 2015 08:00 AM EST Reads: 395
We all know that data growth is exploding and storage budgets are shrinking. Instead of showing you charts on about how much data there is, in his General Session at 17th Cloud Expo, Scott Cleland, Senior Director of Product Marketing at HGST, showed how to capture all of your data in one place. After you have your data under control, you can then analyze it in one place, saving time and resources.
Dec. 1, 2015 08:00 AM EST Reads: 251
The Internet of Things is clearly many things: data collection and analytics, wearables, Smart Grids and Smart Cities, the Industrial Internet, and more. Cool platforms like Arduino, Raspberry Pi, Intel's Galileo and Edison, and a diverse world of sensors are making the IoT a great toy box for developers in all these areas. In this Power Panel at @ThingsExpo, moderated by Conference Chair Roger Strukhoff, panelists discussed what things are the most important, which will have the most profound effect on the world, and what should we expect to see over the next couple of years.
Dec. 1, 2015 06:30 AM EST Reads: 515
Growth hacking is common for startups to make unheard-of progress in building their business. Career Hacks can help Geek Girls and those who support them (yes, that's you too, Dad!) to excel in this typically male-dominated world. Get ready to learn the facts: Is there a bias against women in the tech / developer communities? Why are women 50% of the workforce, but hold only 24% of the STEM or IT positions? Some beginnings of what to do about it! In her Day 2 Keynote at 17th Cloud Expo, Sandy Carter, IBM General Manager Cloud Ecosystem and Developers, and a Social Business Evangelist, wil...
Dec. 1, 2015 05:00 AM EST Reads: 620
PubNub has announced the release of BLOCKS, a set of customizable microservices that give developers a simple way to add code and deploy features for realtime apps.PubNub BLOCKS executes business logic directly on the data streaming through PubNub’s network without splitting it off to an intermediary server controlled by the customer. This revolutionary approach streamlines app development, reduces endpoint-to-endpoint latency, and allows apps to better leverage the enormous scalability of PubNub’s Data Stream Network.
Dec. 1, 2015 05:00 AM EST Reads: 359
Apps and devices shouldn't stop working when there's limited or no network connectivity. Learn how to bring data stored in a cloud database to the edge of the network (and back again) whenever an Internet connection is available. In his session at 17th Cloud Expo, Ben Perlmutter, a Sales Engineer with IBM Cloudant, demonstrated techniques for replicating cloud databases with devices in order to build offline-first mobile or Internet of Things (IoT) apps that can provide a better, faster user experience, both offline and online. The focus of this talk was on IBM Cloudant, Apache CouchDB, and ...
Dec. 1, 2015 04:45 AM EST Reads: 458
I recently attended and was a speaker at the 4th International Internet of @ThingsExpo at the Santa Clara Convention Center. I also had the opportunity to attend this event last year and I wrote a blog from that show talking about how the “Enterprise Impact of IoT” was a key theme of last year’s show. I was curious to see if the same theme would still resonate 365 days later and what, if any, changes I would see in the content presented.
Dec. 1, 2015 03:00 AM EST Reads: 470
Cloud computing delivers on-demand resources that provide businesses with flexibility and cost-savings. The challenge in moving workloads to the cloud has been the cost and complexity of ensuring the initial and ongoing security and regulatory (PCI, HIPAA, FFIEC) compliance across private and public clouds. Manual security compliance is slow, prone to human error, and represents over 50% of the cost of managing cloud applications. Determining how to automate cloud security compliance is critical to maintaining positive ROI. Raxak Protect is an automated security compliance SaaS platform and ma...
Dec. 1, 2015 03:00 AM EST Reads: 469
Most of the IoT Gateway scenarios involve collecting data from machines/processing and pushing data upstream to cloud for further analytics. The gateway hardware varies from Raspberry Pi to Industrial PCs. The document states the process of allowing deploying polyglot data pipelining software with the clear notion of supporting immutability. In his session at @ThingsExpo, Shashank Jain, a development architect for SAP Labs, discussed the objective, which is to automate the IoT deployment process from development to production scenarios using Docker containers.
Dec. 1, 2015 01:15 AM EST Reads: 127
Countless business models have spawned from the IaaS industry – resell Web hosting, blogs, public cloud, and on and on. With the overwhelming amount of tools available to us, it's sometimes easy to overlook that many of them are just new skins of resources we've had for a long time. In his general session at 17th Cloud Expo, Harold Hannon, Sr. Software Architect at SoftLayer, an IBM Company, broke down what we have to work with, discussed the benefits and pitfalls and how we can best use them to design hosted applications.
Nov. 30, 2015 03:45 PM EST Reads: 116
The Internet of Things (IoT) is growing rapidly by extending current technologies, products and networks. By 2020, Cisco estimates there will be 50 billion connected devices. Gartner has forecast revenues of over $300 billion, just to IoT suppliers. Now is the time to figure out how you’ll make money – not just create innovative products. With hundreds of new products and companies jumping into the IoT fray every month, there’s no shortage of innovation. Despite this, McKinsey/VisionMobile data shows "less than 10 percent of IoT developers are making enough to support a reasonably sized team....
Nov. 30, 2015 03:00 PM EST Reads: 497
Just over a week ago I received a long and loud sustained applause for a presentation I delivered at this year’s Cloud Expo in Santa Clara. I was extremely pleased with the turnout and had some very good conversations with many of the attendees. Over the next few days I had many more meaningful conversations and was not only happy with the results but also learned a few new things. Here is everything I learned in those three days distilled into three short points.
Nov. 30, 2015 02:00 PM EST Reads: 375
DevOps is about increasing efficiency, but nothing is more inefficient than building the same application twice. However, this is a routine occurrence with enterprise applications that need both a rich desktop web interface and strong mobile support. With recent technological advances from Isomorphic Software and others, rich desktop and tuned mobile experiences can now be created with a single codebase – without compromising functionality, performance or usability. In his session at DevOps Summit, Charles Kendrick, CTO and Chief Architect at Isomorphic Software, demonstrated examples of com...
Nov. 30, 2015 01:45 PM EST Reads: 439
As organizations realize the scope of the Internet of Things, gaining key insights from Big Data, through the use of advanced analytics, becomes crucial. However, IoT also creates the need for petabyte scale storage of data from millions of devices. A new type of Storage is required which seamlessly integrates robust data analytics with massive scale. These storage systems will act as “smart systems” provide in-place analytics that speed discovery and enable businesses to quickly derive meaningful and actionable insights. In his session at @ThingsExpo, Paul Turner, Chief Marketing Officer at...
Nov. 30, 2015 01:45 PM EST Reads: 442
In his keynote at @ThingsExpo, Chris Matthieu, Director of IoT Engineering at Citrix and co-founder and CTO of Octoblu, focused on building an IoT platform and company. He provided a behind-the-scenes look at Octoblu’s platform, business, and pivots along the way (including the Citrix acquisition of Octoblu).
Nov. 30, 2015 01:00 PM EST Reads: 542
In his General Session at 17th Cloud Expo, Bruce Swann, Senior Product Marketing Manager for Adobe Campaign, explored the key ingredients of cross-channel marketing in a digital world. Learn how the Adobe Marketing Cloud can help marketers embrace opportunities for personalized, relevant and real-time customer engagement across offline (direct mail, point of sale, call center) and digital (email, website, SMS, mobile apps, social networks, connected objects).
Nov. 30, 2015 12:45 PM EST Reads: 347
The Internet of Everything is re-shaping technology trends–moving away from “request/response” architecture to an “always-on” Streaming Web where data is in constant motion and secure, reliable communication is an absolute necessity. As more and more THINGS go online, the challenges that developers will need to address will only increase exponentially. In his session at @ThingsExpo, Todd Greene, Founder & CEO of PubNub, exploreed the current state of IoT connectivity and review key trends and technology requirements that will drive the Internet of Things from hype to reality.
Nov. 30, 2015 10:45 AM EST Reads: 469
Two weeks ago (November 3-5), I attended the Cloud Expo Silicon Valley as a speaker, where I presented on the security and privacy due diligence requirements for cloud solutions. Cloud security is a topical issue for every CIO, CISO, and technology buyer. Decision-makers are always looking for insights on how to mitigate the security risks of implementing and using cloud solutions. Based on the presentation topics covered at the conference, as well as the general discussions heard between sessions, I wanted to share some of my observations on emerging trends. As cyber security serves as a fou...
Nov. 30, 2015 10:30 AM EST Reads: 361