Welcome!

Containers Expo Blog Authors: Liz McMillan, Yeshim Deniz, Elizabeth White, Zakia Bouachraoui, Pat Romanski

Related Topics: @DevOpsSummit, Microsoft Cloud, Linux Containers, Containers Expo Blog, Agile Computing

@DevOpsSummit: Blog Feed Post

Five Reasons to Ditch Email Alerts By @PagerDuty | @DevOpsSummit [#DevOps]

Looking to improve email alerts? Look again. Here are 5 reasons why you should ditch email alerts if you’re still using them

Five Reasons to Ditch Email Alerts

Want to improve your email alerts? Think again

Monitoring systems can help you better manage your uptime, but even though you may spend a lot of time configuring checks and thresholds to identify problems early, your alerts are only as good as your incident response processes. One of the biggest challenges we’ve seen when talking with customers is getting bogged down in email alerts. Despite the increasing disarray of our inboxes, many monitoring systems and IT Operations teams still rely on email for alerting, even though most agree it’s messy and too easy to miss. Looking to improve email alerts? Look again. Here are 5 reasons why you should ditch email alerts if you’re still using them:

1. Email alerts are too easy to miss

“Hey did you see this latest cat video my friend emailed to me?”

Even if you’re staring at your email inbox constantly, it’s not hard to imagine a critical alert getting buried by other alerts or work-related emails. For this reason, top Operations teams typically use at least two notification channels where one is a phone call or SMS message. Having an audible sound with the alert definitely helps it get noticed.

2. You can’t assign an email to someone

“Um, is someone on this?”

Time is critical during a severe incident and you don’t want your team wondering about who’s on point for addressing it. If your alerts are getting emailed to multiple people, there’s no way to know for sure who on the team should respond first. Has someone else already seen the email and are they already working on it? Am I really the best person to respond, or should I wait for someone with more experience to take it? Top Operations teams with a strong culture of response make sure each incident is automatically assigned to the person responsible for fixing it. Incident management tools and ticketing systems can enforce this workflow by automatically assigning an incident to the engineer on-call and by tracking assignee status for each open incident.

In PagerDuty, we use your on-call schedules to determine who’s on point right now, and assign the incident accordingly.

3. You can’t aggregate or bundle emails

“Will it ever stop?”

Alert storms suck. When stuff really goes wrong, all of your monitoring systems will be sending alerts, multiple times per minute. Those alerts can quickly flood your inbox making it virtually unusable. PagerDuty will aggregate alerts for a single incident and will bundle alerts for multiple incidents (after the first notification for each) so repeated alerts will notify you only once. Dashboards are helpful here too so you can get a quick picture of how many incidents are open and where they’re coming from.

4. Email doesn’t offer visibility for the team

“What’s the latest status?”

It’s hard to tell from email who’s working on an incident, how long it has been open, and the latest status. This information is useful not only to your team, but also to your management and other business stakeholders. It’s annoying to be pinged constantly by people wanting an update on the issue when you’re trying to fix it. By taking your incidents into a system like PagerDuty, you can get all of this information in a single dashboard view that’s accessible to management as well as everyone on your team. We can’t promise that the CEO and CTO still won’t ask, but at least there’s a place you can direct them to where they can get the information for themselves.

5. You can’t create metrics with email alerts

“How are we doing?”

Top Operations teams track metrics to continually measure, evaluate, and improve their performance. We’ve blogged before about what metrics you should track and all of them would be incredibly difficult to measure from emails. Tracking when an incident is opened, how long it takes for the first person to notice & respond, and ultimately how long it takes your team to resolve it are critical for proactively managing your uptime. With this data, you can create dashboards on team performance and weekly reports to facilitate conversations within your team and company.

Want to learn more about incident resolution best practices and how IT stacks up today? Email alerts may be only one challenge you’re facing, but you’re not alone. Learn more about the key facets of an intelligent incident resolution strategy and common challenges in a commissioned study conducted by Forrester Consulting on behalf of PagerDuty. Download the study to read more.

The post 5 Reasons to Ditch Email Alerts appeared first on PagerDuty.

Read the original blog entry...

More Stories By PagerDuty Blog

PagerDuty’s operations performance platform helps companies increase reliability. By connecting people, systems and data in a single view, PagerDuty delivers visibility and actionable intelligence across global operations for effective incident resolution management. PagerDuty has over 100 platform partners, and is trusted by Fortune 500 companies and startups alike, including Microsoft, National Instruments, Electronic Arts, Adobe, Rackspace, Etsy, Square and Github.

IoT & Smart Cities Stories
Codete accelerates their clients growth through technological expertise and experience. Codite team works with organizations to meet the challenges that digitalization presents. Their clients include digital start-ups as well as established enterprises in the IT industry. To stay competitive in a highly innovative IT industry, strong R&D departments and bold spin-off initiatives is a must. Codete Data Science and Software Architects teams help corporate clients to stay up to date with the mod...
There are many examples of disruption in consumer space – Uber disrupting the cab industry, Airbnb disrupting the hospitality industry and so on; but have you wondered who is disrupting support and operations? AISERA helps make businesses and customers successful by offering consumer-like user experience for support and operations. We have built the world’s first AI-driven IT / HR / Cloud / Customer Support and Operations solution.
At CloudEXPO Silicon Valley, June 24-26, 2019, Digital Transformation (DX) is a major focus with expanded DevOpsSUMMIT and FinTechEXPO programs within the DXWorldEXPO agenda. Successful transformation requires a laser focus on being data-driven and on using all the tools available that enable transformation if they plan to survive over the long term. A total of 88% of Fortune 500 companies from a generation ago are now out of business. Only 12% still survive. Similar percentages are found throug...
Druva is the global leader in Cloud Data Protection and Management, delivering the industry's first data management-as-a-service solution that aggregates data from endpoints, servers and cloud applications and leverages the public cloud to offer a single pane of glass to enable data protection, governance and intelligence-dramatically increasing the availability and visibility of business critical information, while reducing the risk, cost and complexity of managing and protecting it. Druva's...
BMC has unmatched experience in IT management, supporting 92 of the Forbes Global 100, and earning recognition as an ITSM Gartner Magic Quadrant Leader for five years running. Our solutions offer speed, agility, and efficiency to tackle business challenges in the areas of service management, automation, operations, and the mainframe.
The Jevons Paradox suggests that when technological advances increase efficiency of a resource, it results in an overall increase in consumption. Writing on the increased use of coal as a result of technological improvements, 19th-century economist William Stanley Jevons found that these improvements led to the development of new ways to utilize coal. In his session at 19th Cloud Expo, Mark Thiele, Chief Strategy Officer for Apcera, compared the Jevons Paradox to modern-day enterprise IT, examin...
With 10 simultaneous tracks, keynotes, general sessions and targeted breakout classes, @CloudEXPO and DXWorldEXPO are two of the most important technology events of the year. Since its launch over eight years ago, @CloudEXPO and DXWorldEXPO have presented a rock star faculty as well as showcased hundreds of sponsors and exhibitors! In this blog post, we provide 7 tips on how, as part of our world-class faculty, you can deliver one of the most popular sessions at our events. But before reading...
DSR is a supplier of project management, consultancy services and IT solutions that increase effectiveness of a company's operations in the production sector. The company combines in-depth knowledge of international companies with expert knowledge utilising IT tools that support manufacturing and distribution processes. DSR ensures optimization and integration of internal processes which is necessary for companies to grow rapidly. The rapid growth is possible thanks, to specialized services an...
At CloudEXPO Silicon Valley, June 24-26, 2019, Digital Transformation (DX) is a major focus with expanded DevOpsSUMMIT and FinTechEXPO programs within the DXWorldEXPO agenda. Successful transformation requires a laser focus on being data-driven and on using all the tools available that enable transformation if they plan to survive over the long term. A total of 88% of Fortune 500 companies from a generation ago are now out of business. Only 12% still survive. Similar percentages are found throug...
Scala Hosting is trusted by 50 000 customers from 120 countries and hosting 700 000+ websites. The company has local presence in the United States and Europe and runs an internal R&D department which focuses on changing the status quo in the web hosting industry. Imagine every website owner running their online business on a fully managed cloud VPS platform at an affordable price that's very close to the price of shared hosting. The efforts of the R&D department in the last 3 years made that pos...