Back to Changelog
Back to Changelog

August 29, 2024

On-Call

🚨 Alert Grouping

🚨 Alert Grouping

Alert grouping reduces noise and alert fatigue by consolidating related alerts into a single notification. This improves response efficiency, enhances prioritization, simplifies communication, and ultimately leads to faster incident resolution and better overall system reliability.

Alert grouping is especially helpful for organizations with more robust observability stacks that use separate monitors for different aspects of a given service. For example—a monitor for error rates, a monitor for latency, a monitor for CPU, and so on, possibly even using multiple monitoring tools as alert sources. When something goes wrong with that particular service, it might trigger several or all related monitors to start sending off alerts. Without automatic alert grouping, it’s up to the responder to identify whether the alerts are related and what to do about them. 

Using alert grouping means that in this example, the responder only gets paged from the first alert that comes in. Because Rootly is able to identify that the subsequent alerts are related to the initial alert, the responder is not paged for each monitor that gets triggered, but instead sees the additional alerts grouped as Alert Group Members under the original alert (referred to in Rootly as the Alert Group Leader).

Alert Groups are shown together along with the services, teams, and incident they're related to.

Grouped alerts are also shown together in a nested view from the Alert index as well:

Alert group members are nested under the alert group leader in the Alert Index page.

To set up alert grouping, head to Alerts from the left-hand navigation bar in Rootly web, then click the Grouping tab.

Click + New Alert Group and enter a name (required) and a description (optional).

Under Destinations, you’ll select the Services, Teams, and/or Escalation Policies you’d like to include in this group.

Under Time Window, specify a time period during which the Alert Group should stay open and accept new alerts. Our default recommendation is 10 minutes. 

Optionally, you can define additional requirements for the Alert Group under Content Matching by requiring alerts in the group to have the same title, urgency, or any other payload field associated with your alerts.

Configuring a new Alert Group is quick and easy.

Here's Alex to take you on a video tour of Alert Grouping!

🌝 New & Improved

🆕 Added checkbox to have related alerts automatically resolved when incident is canceled. Teams no longer have to manually resolve each related alert when canceling an incident.

💅 Custom timestamp fields will now align to each user’s preferred timezone setting. Previously, only out-of-box timestamp fields were displayed in the user’s preferred timezone.

💅 Increased Rootly mobile app push notifications volume on iOS devices to help responders better hear incoming pages.

💅 Optimized backend logic in fetching incident subscribers to improve overall platform responsiveness.

🐛 Fixed text wrapping issue on web UI for displaying action items that contain long names.

🐛 Alerts details page now refreshes automatically when the status is updated.

🐛 Fixed intermittent issue with action item events failing to trigger workflows.

Previous post
Previous post
You are viewing the latest post

Fields List View and More UX Improvements for Forms & Form Fields

Bulk Actions for Incidents and Workflows

Share and Set Permissions for Rootly Metrics Dashboards

Bulk Edit Task and Follow-up Action Items

Create Fully Custom Slack Forms Using Our New Form Builder

Specify Public vs Internal Incident Titles on Your Rootly Status Pages

Integrate with Multiple Instances of Jira and Confluence

Auto map any user from PagerDuty and Jira to Rootly incidents with new user lookup

Customize Your Incident Overview Banner in Slack

Improved Confluence Retrospective Templates

Workflows Can Now Automatically Update Previously Posted Slack Messages

Automate Component Status Updates on Statuspage.io

Rootly Joins the Vendr+ Marketplace

Assign Multiple Users to the Same Incident Role

Incident Triage: Investigate Potential Incidents Faster

Rootly + Fivetran Data Connector Integration

Less Noise, More Flexibility: New Slack Notification & Emoji Options

SCIM Group Support: Access Control Made Easy at Enterprise Scale

Integrate With External Systems via HTTP Client

Custom Google Doc Permissions

Advanced Slack Customization via Slack Blocks

Notion Projects Integration with Rootly

WhatsApp Integration

Manually Export Action Items to Ticketing Apps (Jira, Zendesk, Linear, and more)

Automated Dashboard Reports