Workshop
Coordinated Incident Response
Ineffective incident response can affect revenue, team morale, and development velocity. There's nothing more stressful than siloed communication while debugging a production outage. Datadog's Incident Response tools—Incident Management and On-Call—enable you to manage your team's incident response in a central location. Within Datadog, you're able to declare incidents in response to alerts, page other teams, investigate issues, coordinate response efforts, and automate post-incident tasks, all without switching contexts or tools.
In this hands-on workshop, you'll work through a realistic incident scenario using Datadog On-Call to configure schedules and get paged when monitors trigger, Incident Management to coordinate response through the incident workbench and timeline, Slack for ChatOps collaboration through dedicated incident channels, Status Pages to communicate with external stakeholders, and Incident Automations to streamline repetitive tasks. You'll also generate postmortems and explore Incident Analytics to improve your response process over time.
This workshop uses a dedicated Slack workspace. A Slack account is required for hands-on labs.