Welcome!

Containers Expo Blog Authors: Liz McMillan, Pat Romanski, Yeshim Deniz, Elizabeth White, Zakia Bouachraoui

Related Topics: @CloudExpo, Java IoT, Microservices Expo, Containers Expo Blog, Government Cloud, @DXWorldExpo, SDN Journal

@CloudExpo: Article

Lightning in the Clouds, Big Data

Venture into the clouds, but prepare for lightning

Cloud computing has been marketed as one of the key advances in technology and every day we hear about new areas where cloud services are being utilized. Cloud is the bright shining star that is being leveraged for it's elastic, on-demand, resource pooling capabilities. However there have been Cloud outages recently that have adversely impacted businesses. These outages highlight the risks of the Cloud and bring into focus that such risks need to be effectively managed. Cloud outages are like lightning in the Clouds, lightning can cause problems where it strikes and preparation is important to avoid damage.

This year Amazon, Salesforce, Google, Gmail, Google App Engine, Microsoft Office 365, Azure had outages that impacted businesses. During this holiday season, some Netflix subscribers were hit with an outage on Christmas Eve that was caused by Amazon cloud servers.  Amazon tracked the issue to Elastic Load Balancing, that enables spreading traffic across many servers.The wasn't good timing for the outage as subscribers were looking forward to watching movies during this period. Microsoft Azure storage had an outage during the holidays that impacted the management portal. Big Data services and  providers have also reported access issues.

There are many ways of minimizing risks associated with such outages.  The key is to distribute the load that comes in and to have redundancy, so that failure in one area can be picked up by another area. There are many approaches to achieving such redundancy. One way to do this is to distribute systems across locations so that if one location is down, the other one can pick up the load. Multiple location failover can be more expensive, hence the costs have to be weighed against the benefits and availability requirements. Another approach is to have back up clouds for applications with requirements for high availability, to which traffic can be diverted to reduce the risks of downtime in a specific cloud. Again, this can be more costly compared to having the regular clouds. A similar approach is to have the applications spread out across many clouds, so that if there are downtime issues, not all applications go down at once.   If applications are located in many clouds, this can enhance availability if one of the clouds goes down.

It is important to have adequate monitoring for applications and cloud services to be informed when the services are down and to take necessary actions. Both availability and performance monitoring should be conducted to identify any problems. Availability monitoring tracks if the applications and services are up and running, performance monitoring looks into  performance metrics such as response time for applications and services. Availability and performance should be specified for services and applications based on the requirements.  Key events have to be identified for monitoring and specific alerts have to be set up and as soon as these alerts send notifications, specific actions should be taken.  All these aspects should be defined in a plan that lays out all the details of events and related actions so that any outages can be handled effectively. Preparation is key to ensure the proper actions are taken to address and recover from any such issues. The risks related to Cloud outages can be managed by having the proper approaches for availability and performance, regularly monitoring and taking appropriate actions in a timely manner.

(This has been extracted from and is reference to Ajay Budhraja's blog)

More Stories By Ajay Budhraja

Ajay Budhraja has over 24 years in Information Technology with experience in areas such as Executive leadership, management, strategic planning, enterprise architecture, system architecture, software engineering, training, methodologies, networks, and databases. He has provided Senior Executive leadership for nationwide and global programs and has implemented integrated Enterprise Information Technology solutions.

Ajay has a Masters in Engineering (Computer Science), and a Masters in Management and Bachelors in Engineering. He is a Project Management Professional certified by the PMI and is also CICM, CSM, ECM (AIIM) Master, SOA, RUP, SEI-CMMI, ITIL-F, Security + certified.

Ajay has led large-scale projects for big organizations and has extensive IT experience related to telecom, business, manufacturing, airlines, finance and government. He has delivered internet based technology solutions and strategies for e-business platforms, portals, mobile e-business, collaboration and content management. He has worked extensively in the areas of application development, infrastructure development, networks, security and has contributed significantly in the areas of Enterprise and Business Transformation, Strategic Planning, Change Management, Technology innovation, Performance management, Agile management and development, Service Oriented Architecture, Cloud.

Ajay has been leading organizations as Senior Executive, he is the Chair for the Federal SOA COP, Chair Cloud Solutions, MidTech Leadership Steering Committee member and has served as President DOL-APAC, AEA-DC, Co-Chair Executive Forum Federal Executive Institute SES Program. As Adjunct Faculty, he has taught courses for several universities. He has received many awards, authored articles and presented papers at worldwide conferences.

IoT & Smart Cities Stories
The deluge of IoT sensor data collected from connected devices and the powerful AI required to make that data actionable are giving rise to a hybrid ecosystem in which cloud, on-prem and edge processes become interweaved. Attendees will learn how emerging composable infrastructure solutions deliver the adaptive architecture needed to manage this new data reality. Machine learning algorithms can better anticipate data storms and automate resources to support surges, including fully scalable GPU-c...
Machine learning has taken residence at our cities' cores and now we can finally have "smart cities." Cities are a collection of buildings made to provide the structure and safety necessary for people to function, create and survive. Buildings are a pool of ever-changing performance data from large automated systems such as heating and cooling to the people that live and work within them. Through machine learning, buildings can optimize performance, reduce costs, and improve occupant comfort by ...
The explosion of new web/cloud/IoT-based applications and the data they generate are transforming our world right before our eyes. In this rush to adopt these new technologies, organizations are often ignoring fundamental questions concerning who owns the data and failing to ask for permission to conduct invasive surveillance of their customers. Organizations that are not transparent about how their systems gather data telemetry without offering shared data ownership risk product rejection, regu...
René Bostic is the Technical VP of the IBM Cloud Unit in North America. Enjoying her career with IBM during the modern millennial technological era, she is an expert in cloud computing, DevOps and emerging cloud technologies such as Blockchain. Her strengths and core competencies include a proven record of accomplishments in consensus building at all levels to assess, plan, and implement enterprise and cloud computing solutions. René is a member of the Society of Women Engineers (SWE) and a m...
Poor data quality and analytics drive down business value. In fact, Gartner estimated that the average financial impact of poor data quality on organizations is $9.7 million per year. But bad data is much more than a cost center. By eroding trust in information, analytics and the business decisions based on these, it is a serious impediment to digital transformation.
Digital Transformation: Preparing Cloud & IoT Security for the Age of Artificial Intelligence. As automation and artificial intelligence (AI) power solution development and delivery, many businesses need to build backend cloud capabilities. Well-poised organizations, marketing smart devices with AI and BlockChain capabilities prepare to refine compliance and regulatory capabilities in 2018. Volumes of health, financial, technical and privacy data, along with tightening compliance requirements by...
Predicting the future has never been more challenging - not because of the lack of data but because of the flood of ungoverned and risk laden information. Microsoft states that 2.5 exabytes of data are created every day. Expectations and reliance on data are being pushed to the limits, as demands around hybrid options continue to grow.
Digital Transformation and Disruption, Amazon Style - What You Can Learn. Chris Kocher is a co-founder of Grey Heron, a management and strategic marketing consulting firm. He has 25+ years in both strategic and hands-on operating experience helping executives and investors build revenues and shareholder value. He has consulted with over 130 companies on innovating with new business models, product strategies and monetization. Chris has held management positions at HP and Symantec in addition to ...
Enterprises have taken advantage of IoT to achieve important revenue and cost advantages. What is less apparent is how incumbent enterprises operating at scale have, following success with IoT, built analytic, operations management and software development capabilities - ranging from autonomous vehicles to manageable robotics installations. They have embraced these capabilities as if they were Silicon Valley startups.
As IoT continues to increase momentum, so does the associated risk. Secure Device Lifecycle Management (DLM) is ranked as one of the most important technology areas of IoT. Driving this trend is the realization that secure support for IoT devices provides companies the ability to deliver high-quality, reliable, secure offerings faster, create new revenue streams, and reduce support costs, all while building a competitive advantage in their markets. In this session, we will use customer use cases...