Welcome!

Containers Expo Blog Authors: Pat Romanski, Zakia Bouachraoui, Liz McMillan, Sematext Blog, Yeshim Deniz

Related Topics: @DevOpsSummit, Microsoft Cloud, Linux Containers, Containers Expo Blog, Agile Computing

@DevOpsSummit: Blog Feed Post

Five Reasons to Ditch Email Alerts By @PagerDuty | @DevOpsSummit [#DevOps]

Looking to improve email alerts? Look again. Here are 5 reasons why you should ditch email alerts if you’re still using them

Five Reasons to Ditch Email Alerts

Want to improve your email alerts? Think again

Monitoring systems can help you better manage your uptime, but even though you may spend a lot of time configuring checks and thresholds to identify problems early, your alerts are only as good as your incident response processes. One of the biggest challenges we’ve seen when talking with customers is getting bogged down in email alerts. Despite the increasing disarray of our inboxes, many monitoring systems and IT Operations teams still rely on email for alerting, even though most agree it’s messy and too easy to miss. Looking to improve email alerts? Look again. Here are 5 reasons why you should ditch email alerts if you’re still using them:

1. Email alerts are too easy to miss

“Hey did you see this latest cat video my friend emailed to me?”

Even if you’re staring at your email inbox constantly, it’s not hard to imagine a critical alert getting buried by other alerts or work-related emails. For this reason, top Operations teams typically use at least two notification channels where one is a phone call or SMS message. Having an audible sound with the alert definitely helps it get noticed.

2. You can’t assign an email to someone

“Um, is someone on this?”

Time is critical during a severe incident and you don’t want your team wondering about who’s on point for addressing it. If your alerts are getting emailed to multiple people, there’s no way to know for sure who on the team should respond first. Has someone else already seen the email and are they already working on it? Am I really the best person to respond, or should I wait for someone with more experience to take it? Top Operations teams with a strong culture of response make sure each incident is automatically assigned to the person responsible for fixing it. Incident management tools and ticketing systems can enforce this workflow by automatically assigning an incident to the engineer on-call and by tracking assignee status for each open incident.

In PagerDuty, we use your on-call schedules to determine who’s on point right now, and assign the incident accordingly.

3. You can’t aggregate or bundle emails

“Will it ever stop?”

Alert storms suck. When stuff really goes wrong, all of your monitoring systems will be sending alerts, multiple times per minute. Those alerts can quickly flood your inbox making it virtually unusable. PagerDuty will aggregate alerts for a single incident and will bundle alerts for multiple incidents (after the first notification for each) so repeated alerts will notify you only once. Dashboards are helpful here too so you can get a quick picture of how many incidents are open and where they’re coming from.

4. Email doesn’t offer visibility for the team

“What’s the latest status?”

It’s hard to tell from email who’s working on an incident, how long it has been open, and the latest status. This information is useful not only to your team, but also to your management and other business stakeholders. It’s annoying to be pinged constantly by people wanting an update on the issue when you’re trying to fix it. By taking your incidents into a system like PagerDuty, you can get all of this information in a single dashboard view that’s accessible to management as well as everyone on your team. We can’t promise that the CEO and CTO still won’t ask, but at least there’s a place you can direct them to where they can get the information for themselves.

5. You can’t create metrics with email alerts

“How are we doing?”

Top Operations teams track metrics to continually measure, evaluate, and improve their performance. We’ve blogged before about what metrics you should track and all of them would be incredibly difficult to measure from emails. Tracking when an incident is opened, how long it takes for the first person to notice & respond, and ultimately how long it takes your team to resolve it are critical for proactively managing your uptime. With this data, you can create dashboards on team performance and weekly reports to facilitate conversations within your team and company.

Want to learn more about incident resolution best practices and how IT stacks up today? Email alerts may be only one challenge you’re facing, but you’re not alone. Learn more about the key facets of an intelligent incident resolution strategy and common challenges in a commissioned study conducted by Forrester Consulting on behalf of PagerDuty. Download the study to read more.

The post 5 Reasons to Ditch Email Alerts appeared first on PagerDuty.

Read the original blog entry...

More Stories By PagerDuty Blog

PagerDuty’s operations performance platform helps companies increase reliability. By connecting people, systems and data in a single view, PagerDuty delivers visibility and actionable intelligence across global operations for effective incident resolution management. PagerDuty has over 100 platform partners, and is trusted by Fortune 500 companies and startups alike, including Microsoft, National Instruments, Electronic Arts, Adobe, Rackspace, Etsy, Square and Github.

IoT & Smart Cities Stories
Your applications have evolved, your computing needs are changing, and your servers have become more and more dense. But your data center hasn't changed so you can't get the benefits of cheaper, better, smaller, faster... until now. Colovore is Silicon Valley's premier provider of high-density colocation solutions that are a perfect fit for companies operating modern, high-performance hardware. No other Bay Area colo provider can match our density, operating efficiency, and ease of scalability.
ScaleMP is the leader in virtualization for in-memory high-end computing, providing higher performance and lower total cost of ownership as compared with traditional shared-memory systems. The company's innovative Versatile SMP (vSMP) architecture aggregates multiple x86 systems into a single virtual x86 system, delivering an industry-standard, high-end shared-memory computer. Using software to replace custom hardware and components, ScaleMP offers a new, revolutionary computing paradigm. vSMP F...
At CloudEXPO Silicon Valley, June 24-26, 2019, Digital Transformation (DX) is a major focus with expanded DevOpsSUMMIT and FinTechEXPO programs within the DXWorldEXPO agenda. Successful transformation requires a laser focus on being data-driven and on using all the tools available that enable transformation if they plan to survive over the long term. A total of 88% of Fortune 500 companies from a generation ago are now out of business. Only 12% still survive. Similar percentages are found throug...
As you know, enterprise IT conversation over the past year have often centered upon the open-source Kubernetes container orchestration system. In fact, Kubernetes has emerged as the key technology -- and even primary platform -- of cloud migrations for a wide variety of organizations. Kubernetes is critical to forward-looking enterprises that continue to push their IT infrastructures toward maximum functionality, scalability, and flexibility. As they do so, IT professionals are also embr...
Atmosera delivers modern cloud services that maximize the advantages of cloud-based infrastructures. Offering private, hybrid, and public cloud solutions, Atmosera works closely with customers to engineer, deploy, and operate cloud architectures with advanced services that deliver strategic business outcomes. Atmosera's expertise simplifies the process of cloud transformation and our 20+ years of experience managing complex IT environments provides our customers with the confidence and trust tha...
As you know, enterprise IT conversation over the past year have often centered upon the open-source Kubernetes container orchestration system. In fact, Kubernetes has emerged as the key technology -- and even primary platform -- of cloud migrations for a wide variety of organizations. Kubernetes is critical to forward-looking enterprises that continue to push their IT infrastructures toward maximum functionality, scalability, and flexibility. As they do so, IT professionals are also embr...
CloudEXPO has been the M&A capital for Cloud companies for more than a decade with memorable acquisition news stories which came out of CloudEXPO expo floor. DevOpsSUMMIT New York faculty member Greg Bledsoe shared his views on IBM's Red Hat acquisition live from NASDAQ floor. Acquisition news was announced during CloudEXPO New York which took place November 12-13, 2019 in New York City.
The platform combines the strengths of Singtel's extensive, intelligent network capabilities with Microsoft's cloud expertise to create a unique solution that sets new standards for IoT applications," said Mr Diomedes Kastanis, Head of IoT at Singtel. "Our solution provides speed, transparency and flexibility, paving the way for a more pervasive use of IoT to accelerate enterprises' digitalisation efforts. AI-powered intelligent connectivity over Microsoft Azure will be the fastest connected pat...
The graph represents a network of 1,329 Twitter users whose recent tweets contained "#DevOps", or who were replied to or mentioned in those tweets, taken from a data set limited to a maximum of 18,000 tweets. The network was obtained from Twitter on Thursday, 10 January 2019 at 23:50 UTC. The tweets in the network were tweeted over the 7-hour, 6-minute period from Thursday, 10 January 2019 at 16:29 UTC to Thursday, 10 January 2019 at 23:36 UTC. Additional tweets that were mentioned in this...
Artificial intelligence, machine learning, neural networks. We're in the midst of a wave of excitement around AI such as hasn't been seen for a few decades. But those previous periods of inflated expectations led to troughs of disappointment. This time is (mostly) different. Applications of AI such as predictive analytics are already decreasing costs and improving reliability of industrial machinery. Pattern recognition can equal or exceed the ability of human experts in some domains. It's devel...