Welcome!

Containers Expo Blog Authors: Pat Romanski, Liz McMillan, Elizabeth White, Craig Lowell, Scott Allen

Related Topics: Containers Expo Blog, Java IoT, Industrial IoT, Microservices Expo

Containers Expo Blog: Article

Data Virtualization for BI Agility – a One-Trick Pony Won’t Cut It

Data virtualization thus needs to be built-on data integration to truly enable BI agility

In a recent article, CIO.com said that analytics and BI will be the top technology priorities for CIOs in 2012, based on a Gartner Inc. survey of IT executives. However, if you look back in time, reports show that BI was a top priority even then. Although we have fast-forwarded many years, the priorities haven't really changed. BI is still top of mind.

Granted, the amount of data that needs to be processed is growing by the day, and the need for businesses to have timely insight into things that matter is becoming more immediate. But wasn't this the case earlier as well? Businesses have always had this mindset - hence the reason for growth and continuous innovation.

What's new? Nothing, on the face of it. Except that with all things being equal, the fundamental problem, or shall I say problems, seem to have taken a backseat, yet again. We seem to keep talking about the symptoms instead of treating the issue at hand. In a recent report by Gleanster, LLC, the biggest challenges for enabling BI agility, are:

  • Breaking down data / departmental silos
  • Integrating with applications (e.g., CRM), operations and other platforms
  • Achieving acceptable data quality

The report also points out that the most commonly used metrics by businesses are time-to-decision or time-to-response to information requests; information access (comprehensiveness, accuracy, and consistency); and volume and quality or actionable insights. These, in essence, are the fundamental requirements that need to be fulfilled to the hilt in order to enable BI agility.

For those in the know, this is not something BI tools can address on their own. A recent blog by Forrester Research, Inc., states that traditional BI approaches often fall short because BI hasn't fully empowered information workers, who still largely depend on IT, and because BI platforms, tools, and applications aren't agile enough. Now that we have this background in place, I can start my analysis.

Based on what we are seeing in some ongoing polls, without the underpinnings of a self-service driven agile data integration strategy in place, BI agility will continue to remain a pipedream. Yes, of late, data virtualization has emerged as an agile data integration approach that can enable BI agility. But as all solutions are not created equal, let's try to address the challenges we discussed with the proposed solution.

As I always say, the devil's in the details. Data virtualization built on data federation does one thing and only one thing very well - it accesses and merges data from several different data sources, in real-time, without physical data movement. It can turn many data silos into one and integrate with applications. But how about data quality? Is federated data truly ready for consumption? All I hear is silence.

A BI tool won't do anything to improve data quality as it simply assumes the availability of the most current and accurate data. What happens if there are inaccuracies and inconsistencies after federating data across various systems in real time? A more fundamental question - what if you cannot effectively analyze and profile the federated data in the first place? Well, you need further processing.

Did you read the fine print? I think it just said, deal with it. Or worse yet, I have also heard the excuse - BI tools do not expect consistent and accurate data. Very convenient wouldn't you say? Bottom line, you not only lose the time advantage that you gained in not moving the data physically, but you now have to deal with quality and consistency on a reactive basis. So much for an agile data integration approach.

We discussed quality and consistency. Now, how about the role of business users? Shouldn't the analyst define business entities, analyze and identify issues with the data, create rules to correct inaccuracies and inconsistencies, and then play a key part in making sure the federated data is as requested? Ask any BI professional, business users know the data the best. Data federation does little to get them involved.

Next, let's talk about the role of IT. Is it just about prioritizing a backlog of growing requests, building out the solution, testing and then deploying it? Shouldn't IT interact with the analyst instantly and throughout the process? This is critical to IT building exactly what the business wanted. Without self-service, agility can't be ensured. However, data federation has been typically a coding-heavy IT tool.

Although data federation has been around for a long time, it hasn't gone too far. Data virtualization built on data federation seems to be a case of doing the same thing again, and expecting a different answer. Federating data across many diverse data sources, in real time, without physical data movement, is what I call, par for the course. To enable BI agility, you need to go beyond looking under the hood.

Since data virtualization built on data federation cannot profile both data sources and logic, apply complex data quality rules and advanced data transformations on federated data as it is in flight, involve the business user early and often, and reuse the virtual views not just for BI tools, portals and composite applications, but also for batch - it looks like we have a choice to make?

The choices are - manual coding, further processing using other tools, and custom solutions. Really! Is this truly a choice you have the luxury or the extra budget to make? Are you going to sign-up for a solution that promises agility and then leaves a major portion of the task to you or to another tool? What's even more dangerous is that lack of critical functionality is simply passed off as good-to-have.

The Gartner Magic Quadrant for Data Integration Tools, October 27, 2011, says it well - it's "the ability to switch seamlessly and transparently between delivery modes (bulk/batch vs. granular real-time vs. federation) with minimal rework." Data virtualization thus needs to be built-on data integration to truly enable BI agility. Having said that, I believe the days of a one-trick pony are numbered.

•   •   •

Don't forget to join me at Informatica World 2012, May 15-18 in Las Vegas, to learn the tips, tricks and best practices for using the Informatica Platform to maximize your return on big data, and get the scoop on the R&D innovations in our next release, Informatica 9.5. For more information and to register, visit www.informaticaworld.com.

More Stories By Ash Parikh

Ash Parikh is responsible for driving Informatica’s product strategy around real-time data integration and SOA. He has over 17 years of industry experience in driving product innovation and strategy at technology leaders such as Raining Data, Iopsis Software, BEA, Sun and PeopleSoft. Ash is a well-published industry expert in the field of SOA and distributed computing and is a regular presenter at leading industry technology events like XMLConference, OASIS Symposium, Delphi, AJAXWorld, and JavaOne. He has authored several technical articles in leading journals including DMReview, AlignJournal, XML Journal, JavaWorld, JavaPro, Web Services Journal, and ADT Magazine. He is the co-chair of the SDForum Web services SIG.

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


@ThingsExpo Stories
IoT generates lots of temporal data. But how do you unlock its value? You need to discover patterns that are repeatable in vast quantities of data, understand their meaning, and implement scalable monitoring across multiple data streams in order to monetize the discoveries and insights. Motif discovery and deep learning platforms are emerging to visualize sensor data, to search for patterns and to build application that can monitor real time streams efficiently. In his session at @ThingsExpo, ...
Verizon Communications Inc. (NYSE, Nasdaq: VZ) and Yahoo! Inc. (Nasdaq: YHOO) have entered into a definitive agreement under which Verizon will acquire Yahoo's operating business for approximately $4.83 billion in cash, subject to customary closing adjustments. Yahoo informs, connects and entertains a global audience of more than 1 billion monthly active users** -- including 600 million monthly active mobile users*** through its search, communications and digital content products. Yahoo also co...
"There's a growing demand from users for things to be faster. When you think about all the transactions or interactions users will have with your product and everything that is between those transactions and interactions - what drives us at Catchpoint Systems is the idea to measure that and to analyze it," explained Leo Vasiliou, Director of Web Performance Engineering at Catchpoint Systems, in this SYS-CON.tv interview at 18th Cloud Expo, held June 7-9, 2016, at the Javits Center in New York Ci...
"Tintri was started in 2008 with the express purpose of building a storage appliance that is ideal for virtualized environments. We support a lot of different hypervisor platforms from VMware to OpenStack to Hyper-V," explained Dan Florea, Director of Product Management at Tintri, in this SYS-CON.tv interview at 18th Cloud Expo, held June 7-9, 2016, at the Javits Center in New York City, NY.
The best-practices for building IoT applications with Go Code that attendees can use to build their own IoT applications. In his session at @ThingsExpo, Indraneel Mitra, Senior Solutions Architect & Technology Evangelist at Cognizant, provided valuable information and resources for both novice and experienced developers on how to get started with IoT and Golang in a day. He also provided information on how to use Intel Arduino Kit, Go Robotics API and AWS IoT stack to build an application tha...
SYS-CON Events announced today that LeaseWeb USA, a cloud Infrastructure-as-a-Service (IaaS) provider, will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. LeaseWeb is one of the world's largest hosting brands. The company helps customers define, develop and deploy IT infrastructure tailored to their exact business needs, by combining various kinds cloud solutions.
Whether your IoT service is connecting cars, homes, appliances, wearable, cameras or other devices, one question hangs in the balance – how do you actually make money from this service? The ability to turn your IoT service into profit requires the ability to create a monetization strategy that is flexible, scalable and working for you in real-time. It must be a transparent, smoothly implemented strategy that all stakeholders – from customers to the board – will be able to understand and comprehe...
The cloud market growth today is largely in public clouds. While there is a lot of spend in IT departments in virtualization, these aren’t yet translating into a true “cloud” experience within the enterprise. What is stopping the growth of the “private cloud” market? In his general session at 18th Cloud Expo, Nara Rajagopalan, CEO of Accelerite, explored the challenges in deploying, managing, and getting adoption for a private cloud within an enterprise. What are the key differences between wh...
SYS-CON Events announced today that 910Telecom will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Housed in the classic Denver Gas & Electric Building, 910 15th St., 910Telecom is a carrier-neutral telecom hotel located in the heart of Denver. Adjacent to CenturyLink, AT&T, and Denver Main, 910Telecom offers connectivity to all major carriers, Internet service providers, Internet backbones and ...
SYS-CON Events announced today that Venafi, the Immune System for the Internet™ and the leading provider of Next Generation Trust Protection, will exhibit at @DevOpsSummit at 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Venafi is the Immune System for the Internet™ that protects the foundation of all cybersecurity – cryptographic keys and digital certificates – so they can’t be misused by bad guys in attacks...
Large scale deployments present unique planning challenges, system commissioning hurdles between IT and OT and demand careful system hand-off orchestration. In his session at @ThingsExpo, Jeff Smith, Senior Director and a founding member of Incenergy, will discuss some of the key tactics to ensure delivery success based on his experience of the last two years deploying Industrial IoT systems across four continents.
There will be new vendors providing applications, middleware, and connected devices to support the thriving IoT ecosystem. This essentially means that electronic device manufacturers will also be in the software business. Many will be new to building embedded software or robust software. This creates an increased importance on software quality, particularly within the Industrial Internet of Things where business-critical applications are becoming dependent on products controlled by software. Qua...
SYS-CON Events has announced today that Roger Strukhoff has been named conference chair of Cloud Expo and @ThingsExpo 2016 Silicon Valley. The 19th Cloud Expo and 6th @ThingsExpo will take place on November 1-3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. "The Internet of Things brings trillions of dollars of opportunity to developers and enterprise IT, no matter how you measure it," stated Roger Strukhoff. "More importantly, it leverages the power of devices and the Interne...
Machine Learning helps make complex systems more efficient. By applying advanced Machine Learning techniques such as Cognitive Fingerprinting, wind project operators can utilize these tools to learn from collected data, detect regular patterns, and optimize their own operations. In his session at 18th Cloud Expo, Stuart Gillen, Director of Business Development at SparkCognition, discussed how research has demonstrated the value of Machine Learning in delivering next generation analytics to imp...
In addition to all the benefits, IoT is also bringing new kind of customer experience challenges - cars that unlock themselves, thermostats turning houses into saunas and baby video monitors broadcasting over the internet. This list can only increase because while IoT services should be intuitive and simple to use, the delivery ecosystem is a myriad of potential problems as IoT explodes complexity. So finding a performance issue is like finding the proverbial needle in the haystack.
The Internet of Things will challenge the status quo of how IT and development organizations operate. Or will it? Certainly the fog layer of IoT requires special insights about data ontology, security and transactional integrity. But the developmental challenges are the same: People, Process and Platform. In his session at @ThingsExpo, Craig Sproule, CEO of Metavine, demonstrated how to move beyond today's coding paradigm and shared the must-have mindsets for removing complexity from the develo...
SYS-CON Events announced today that MangoApps will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. MangoApps provides modern company intranets and team collaboration software, allowing workers to stay connected and productive from anywhere in the world and from any device.
Basho Technologies has announced the latest release of Basho Riak TS, version 1.3. Riak TS is an enterprise-grade NoSQL database optimized for Internet of Things (IoT). The open source version enables developers to download the software for free and use it in production as well as make contributions to the code and develop applications around Riak TS. Enhancements to Riak TS make it quick, easy and cost-effective to spin up an instance to test new ideas and build IoT applications. In addition to...
The 19th International Cloud Expo has announced that its Call for Papers is open. Cloud Expo, to be held November 1-3, 2016, at the Santa Clara Convention Center in Santa Clara, CA, brings together Cloud Computing, Big Data, Internet of Things, DevOps, Digital Transformation, Microservices and WebRTC to one location. With cloud computing driving a higher percentage of enterprise IT budgets every year, it becomes increasingly important to plant your flag in this fast-expanding business opportuni...
"We've discovered that after shows 80% if leads that people get, 80% of the conversations end up on the show floor, meaning people forget about it, people forget who they talk to, people forget that there are actual business opportunities to be had here so we try to help out and keep the conversations going," explained Jeff Mesnik, Founder and President of ContentMX, in this SYS-CON.tv interview at 18th Cloud Expo, held June 7-9, 2016, at the Javits Center in New York City, NY.