Skip to main content

Command Palette

Search for a command to run...

Leadership AI Incident Response: The First Six Hours

Your AI feature is on the front page of tech media. It is 9 am on a Tuesday. Your phone is ringing. Here is the six hour playbook for the leader whose job starts when engineering incident response ends.

Updated
โ€ข18 min read
Leadership AI Incident Response: The First Six Hours

Bottom line. When an AI incident hits public attention, the first six hours decide whether your company emerges with credibility intact or with a reputational wound that takes a year to heal. Engineering's job in those six hours is to stop the bleeding. Your job as a senior leader is something different: make four specific decisions in the first hour, four more in hours 2-6, and run a specific communications cadence that tells the story with enough honesty to earn trust without enough speculation to create new problems. This briefing is the leadership-side playbook, built for the leader who has not had a public AI incident before and doesn't know the specific moves.

Course 3's P4.4 covered AI incident response from the PM and engineering perspective. L3.4 is the leadership companion: what a CEO, CTO, VP Eng, or General Counsel actually does in the first six hours of a public AI incident. Different audience, different decisions, same underlying event. You need both frameworks if your company ships AI at any scale.


Why the first six hours matter more than the next six days

Start with the asymmetry. In most incidents, the tech response takes hours to days and the comms response takes days to weeks. For a public-attention AI incident, the ratio inverts: the public impression of your company is often locked in within the first six hours, and everything that happens afterward is fighting against (or, if you're lucky, reinforcing) that impression.

Why? Because in the first six hours:

  • Social media amplification happens or it doesn't.
  • The first journalist to cover the story picks their framing.
  • Your customers either hear from you or from the press.
  • Your employees either feel led or abandoned.
  • Regulators either see a responsive company or a hiding one.

A leader who responds well in the first six hours has a week to work on the substantive response with public patience on their side. A leader who responds poorly spends the next week digging out from a narrative that's already set. The six-hour window is the leverage point.

Five windows. The leadership decisions cluster in windows 1 and 2. Everything after hour 6 is execution on what you decided in the first six hours.


Hour 1: the four leadership decisions

When the incident surfaces, the first hour is for decisions only. Not statements, not fixes, not press releases โ€” decisions. Get four of them right and everything else becomes tractable. Get them wrong and the next six hours are recovery work.

Decision 1: severity classification

The question: is this a P0 (company-level crisis, all hands on deck), a P1 (serious, but scoped to one feature or one customer segment), a P2 (real but contained), or a P3 (noisy but not material)?

Why it matters: the severity drives everything downstream โ€” who wakes up, who cancels their Tuesday, how fast you communicate, how senior the spokesperson is. Over-classifying a P2 as a P0 wastes the company's response capacity on a problem that didn't need it. Under-classifying a P0 as a P2 means the right people don't hear about it until the damage is wider.

How to classify fast: use two questions.

  1. Is there public visibility? (Social media, press coverage, a viral screenshot.) Public visibility โ†’ P1 minimum, often P0.
  2. Is there material customer, user, or financial harm? (Not "upset user" โ€” real harm: wrong financial data, PII leak, content that could create legal exposure.) Harm โ†’ P0.

Two yeses = P0. One yes = P1. Zero yeses = P2 or below.

The failure mode: defaulting to "investigate first, classify later." No. Classify now on the information you have; reclassify later if needed. The cost of over-responding for the first hour and then downgrading is low; the cost of under-responding in the first hour and then upgrading is high because the response window has already closed.

Decision 2: the incident commander

The question: who is the single named person running this incident end to end until it's resolved?

Why it matters: without a single named incident commander, the response fragments. Every senior person gets half-involved, nobody has the full picture, decisions get made by whoever is loudest in the moment, and the commander role ends up being "whoever's on the Zoom call this hour." This is how six-hour windows get wasted on coordination instead of action.

Who should be the incident commander:

  • P0: ideally the CEO or COO, or an explicit senior deputy with full authority for the duration of the incident. The commander needs to be able to make any decision without needing to escalate.
  • P1: the VP of Engineering, the head of Product, or the Chief of Staff with an explicit temporary mandate.
  • P2: the relevant product or engineering director.

The commander owns the incident. They run the war room. They make the comms call. They decide when to stand down. Everyone else โ€” including executives above them โ€” supports the commander for the duration. The single most common incident response failure I've seen in 2026 is having no named commander, which is the organisational version of nobody being in charge.

The specific thing the commander needs in hour 1: explicit authority in writing (even a Slack message from the CEO suffices) and a dedicated channel for the incident response team. Everything else follows from those two.

Decision 3: the public posture

The question: what is your company's overall public posture going to be for this incident โ€” transparent / measured-response / legal-cautious / no-comment?

Why it matters: different postures are appropriate for different incident shapes, and choosing the posture in hour 1 prevents different senior people from improvising conflicting approaches later. Specifically:

  • Transparent: you will share what you know, admit what you don't, commit to updates on a specific cadence. Works for incidents where the facts are clear, the company's role is clear, and honesty is the cheapest path to trust.
  • Measured-response: you will acknowledge the incident, confirm you're investigating, commit to a timeline for more information. Works for incidents where the facts are still emerging and over-committing to specifics would be wrong.
  • Legal-cautious: you will confirm the incident and decline further comment pending investigation. Works for incidents with potential legal liability where any specific statement could be used against you.
  • No-comment: you will not address the incident publicly. Only works for P3 incidents that haven't reached broad public attention; using this on a P1 or P0 is corporate malpractice in 2026.

The 2026 reality: transparent is almost always the right posture for AI incidents because the alternative invites speculation that fills the information vacuum with worst-case assumptions. Legal-cautious can be appropriate for actual legal exposure (e.g., ongoing regulatory investigation), but it should be an explicit choice made with legal counsel, not a default.

The specific trap: drifting into legal-cautious because "we don't want to say the wrong thing" without explicit legal counsel guidance. This is defensive posturing masquerading as prudence and it consistently produces worse outcomes than thoughtful transparency.

Decision 4: the stand-up

The question: are you standing up a war room / incident response bridge, and who's on it?

Why it matters: coordination in the first six hours happens in real time, and a dedicated sync channel (war room, Zoom bridge, Slack huddle) is enormously more effective than async messaging for fast-moving incidents. The cost is a few hours of time from everyone involved; the value is decisions made in minutes instead of hours.

Who should be in the war room for a P0/P1:

  • The incident commander (running it).
  • Engineering lead for the affected feature.
  • Product or PM lead for the affected feature.
  • General counsel or senior legal.
  • Head of communications or senior PR.
  • Customer success / account management (to reach affected customers).
  • Executive sponsor (CEO, COO, or CTO โ€” present, not running).

Seven or eight people, always on. Incident commander runs the agenda in 30-minute cycles: status check, decision items, comms update, outstanding gaps, next cycle. Very structured; very fast.

The specific trap: letting the war room grow to 20+ people. A P0 incident room of 20 people is an audience, not a decision forum. Keep it small. Anyone else who needs to know can read the incident channel.


Hours 2-6: the four decisions after the first hour

Once the classification, commander, posture, and war room are set, the first hour is essentially done. Hours 2 through 6 contain a different set of leadership decisions โ€” slower, less urgent, but more consequential for the public narrative.

Decision 5: the first public statement

The question: what do you say publicly, and when?

Why it matters: the first public statement is usually the most-read and most-quoted piece of communication in the entire incident. It sets the frame. If it's late, vague, or defensive, you spend the rest of the cycle fighting against it.

Timing: the first public statement should land within 2-4 hours of the incident becoming public. Not 30 minutes (too fast, too many unknowns), not 12 hours (too late, vacuum has been filled by others). Two to four hours is fast enough to signal responsiveness and slow enough to be accurate.

Content: 4-7 sentences maximum. The ingredients from Course 3 P4.4:

  • Acknowledge the incident happened.
  • Name what you currently know factually.
  • Name what you're investigating and what you don't yet know.
  • Commit to a specific next-update time.
  • Provide a channel for affected parties to reach you.
  • If and only if you can do it honestly, name what you are doing right now to address it.

What to exclude: apologies that aren't specific to actions the company took, promises you can't keep (like "this will never happen again"), blame directed at vendors or users, and PR-flavored language that sounds rehearsed. The single most common 2026 first-statement failure is sounding rehearsed โ€” which reads as corporate distancing even when the content is accurate.

Decision 6: the customer outreach

The question: which customers do you reach out to individually, and how fast?

Why it matters: individual customer outreach is the highest-leverage move in an AI incident. A customer who hears from you before hearing from the press experiences the incident as a company that cares; a customer who hears from the press first experiences the incident as a company that's hiding. The difference in outcome is dramatic.

Who to reach out to individually in the first 4 hours:

  • Any customer you know was directly affected (their data, their users, their account).
  • Your top 5-10 customers by ARR, regardless of whether they were affected (they'll be asked about the incident by their own teams and boards).
  • Any customer in a regulated industry (banking, healthcare, legal) who will need to answer their own regulator's questions.
  • Any customer with a named "trust" requirement in their contract.

The outreach is a direct call or personal email from a named human (CSM, AE, or executive sponsor), not a mass notification. The message is: "We've had an incident. Here's what happened, here's what we're doing, here's what it means for your specific account, here's how to reach me with questions." Five sentences.

The specific trap: batching customer outreach with a mass email and no personal follow-up. Mass emails in incident windows read as corporate defense; individual outreach reads as human responsibility. The labor cost is real (a senior CSM can make 10-20 outreach calls per hour), but the ROI on customer retention is higher than any other response investment.

Decision 7: the internal communication

The question: what do employees hear, and when, from company leadership?

Why it matters: employees are a trust proxy. If your own team doesn't hear from leadership about a public incident, they experience the incident the same way the press does โ€” from the outside โ€” and they start to speculate. Internal speculation drives external leaks, contradicts your public messaging, and erodes the trust that makes future incident responses easier.

What internal communication looks like in the first 6 hours:

  • A company-wide message (Slack, email, whatever your channel is) within 1-2 hours of the incident becoming public. Not waiting for the external comms to land first.
  • The message acknowledges the incident, names the incident commander, and asks employees to direct questions to a specific internal channel.
  • Explicit guidance on external communication: "Do not comment to press or on social media. Direct all inquiries to [comms team]."
  • A commitment to a specific internal update (e.g., "We will send a second internal update by 3 PM today regardless of external status").

The specific thing employees need: confirmation that leadership knows and is responding. They don't need details of the root cause; they need to know that someone is in charge. Seven sentences is enough.

Decision 8: the regulatory notification

The question: which regulators or compliance bodies need to be notified, and on what timeline?

Why it matters: regulatory notification deadlines are legal obligations with hard consequences for missing them. The GDPR breach notification requirement is 72 hours. Sectoral rules (HIPAA, PCI, SOX) have their own timelines. State-level rules in the US have their own. Missing a deadline is often a more serious problem than the underlying incident.

In the first 6 hours, you don't need to have notified regulators. You need to have:

  • A named person (usually GC or a senior compliance person) who is tracking which notifications apply.
  • A working timeline for each notification, with deadlines.
  • An initial draft of each notification (the regulator doesn't receive it yet, but the draft exists).
  • A plan for how the notifications will be reviewed and sent before their respective deadlines.

The specific trap: assuming regulatory notifications can wait until "we have more information." Some jurisdictions require notification of potential breaches even before full investigation. Check with legal in the first hour; don't defer. The cost of an early-and-partial notification is lower than the cost of a late-and-full one.


The public communication cadence

One more element that deserves its own treatment: the rhythm of public updates. After the first statement, the tempo matters.

The right rhythm for a P0/P1 public incident:

  • Hour 0-2: incident surfaces, decisions made, war room up.
  • Hour 2-4: first public statement published.
  • Hour 4-8: individual customer outreach, internal communication, regulatory notification planning.
  • Hour 8-12: second public statement โ€” confirming status, naming progress, committing to next update.
  • Hour 12-24: third statement at 12 or 18 hour mark if the incident is ongoing; otherwise a final statement at 24 hours.
  • Day 2-3: substantive post-mortem published if the incident is resolved.

The key principle: never go silent. The worst communication pattern in an incident is the first statement at hour 2, then nothing for 24 hours, then a surprise post-mortem. The silence is where speculation fills the gap, and by the time you return with a substantive update, the audience has already built a narrative that your update has to fight against.

The second-worst pattern: excessive detail in the first statement, so that follow-ups have nothing new to say. Keep the first statement short. Save content for the rhythm.


A worked 2026 example: the six hours of a realistic public AI incident

Hour 0 (9:07 AM Tuesday): a tech news outlet publishes a story that your AI-powered support product, deployed at a major customer, produced a response recommending a competitor's service in an internal support thread. The screenshot is on Twitter within 20 minutes. By 9:30 AM, the story has 3,000 retweets and is picked up by two more outlets.

Hour 1 (9:30-10:30 AM):

  • 9:35: Incident surfaces in leadership Slack. The Chief of Staff pings the CEO.
  • 9:40: CEO names the VP Engineering as incident commander. Dedicated Slack channel spun up.
  • 9:45: IC runs first war room bridge. Severity classified as P1 (public visibility yes; material customer harm unclear pending investigation). Seven people on the bridge.
  • 9:55: Public posture decided: transparent. Rationale: facts are clear (the response was generated, it was visible, the screenshot is real), no legal exposure yet, transparency is cheapest path to trust.
  • 10:15: Initial findings from engineering: the response was generated by a retrieval-augmented feature; the competitor mention came from a knowledge base document that was incorrectly included in the retrieval scope. Fix is simple (restrict retrieval scope) and can ship in 30 minutes.
  • 10:25: First public statement drafted.

Hour 2-4 (10:30 AM - 12:30 PM):

  • 11:00 AM: First public statement published as a blog post and linked from the company's status page and Twitter. Five sentences. Acknowledges the issue, names the root cause at a high level, confirms the fix is in progress, commits to a detailed follow-up within 4 hours. Specific acknowledgment that the affected customer has been contacted.
  • 11:15 AM: VP CSM calls the affected customer directly. The customer is already aware (they're the ones whose employee took the screenshot). Call focuses on: we know, we fixed it, we'll follow up in writing within the hour, we want to support your internal comms.
  • 11:30 AM: Internal all-hands Slack message from the CEO. Seven sentences. Acknowledges the incident, names the IC, directs employees to the incident channel for updates, explicit instruction not to comment externally.
  • 12:00 PM: Second war room cycle. Engineering has deployed the fix to staging; production deploy in 15 minutes. No new affected customers identified.
  • 12:15 PM: Production fix deployed. IC monitors for regression.

Hour 4-6 (12:30-2:30 PM):

  • 12:45 PM: Second public statement: root cause in more detail, confirmation that the fix is deployed, commitment to a post-mortem within 24 hours.
  • 1:00 PM: Individual outreach to top 5 customers (not the one directly affected) with a brief "here's what happened, here's why it won't affect you, here's who to call with questions" message from their CSMs.
  • 1:30 PM: Legal review confirms no regulatory notification is required (incident didn't involve PII or breach categories). Confirmed in war room.
  • 2:00 PM: Internal Slack update. Employees receive a factual summary with a promise of a more detailed retrospective at the next all-hands.
  • 2:30 PM: Third public statement: "the fix is deployed, the incident is contained, we'll publish a full post-mortem tomorrow."

Next 24 hours: detailed post-mortem drafted, reviewed, and published. The story mostly dies on day 2 because the response was visibly competent.

Total leadership time invested in first 6 hours: ~30 hours across 8 people (4 hours average each). Total externally visible output: 3 blog posts, ~15 individual customer conversations, 2 internal Slack messages, 1 comprehensive post-mortem (next day).

Counterfactual if the response had been poorly handled: the first statement lands at hour 8 instead of hour 2, with no named IC until hour 4, no customer outreach in the first day, and a vague "we are investigating" posture. By the time the detailed post-mortem lands at 48 hours, the Twitter narrative has already set, two more outlets have published follow-ups, one customer has sent a concerned note to their board, and the company spends the next quarter in recovery mode. Estimated cost difference: ~$2-10M in customer trust and sales momentum. The playbook exists because the cost of skipping it is enormous.


The failure mode: "let's investigate first, then communicate"

The single most expensive leadership failure in AI incident response in 2026: delaying the first public statement until "we fully understand what happened". The logic is reasonable โ€” you don't want to communicate something that turns out to be wrong โ€” but the practical outcome is almost always worse.

Why it fails: investigations take time. "Fully understand" is often 12-48 hours away, sometimes longer. During that window, the public information vacuum is filled by speculation, social media threads, and early journalism. By the time you have a complete picture, the narrative has been written by others, and your complete-picture statement now has to fight against the existing narrative instead of establishing one.

The defence: publish what you know early, explicitly naming what you don't yet know. Update as you learn more. The public accepts "we know X, we don't yet know Y, we'll tell you by Z" because it's honest. The public rejects silence because silence feels like hiding.

The specific phrase to internalise: "tell them what you know, tell them what you're finding out, tell them when they'll hear from you again." Those three things cover the first statement, and they're all honestly available in hour 2 of any incident โ€” before the investigation is complete.


What to decide on Monday

  • Run a tabletop exercise in the next 30 days. Pick a realistic incident scenario. Walk through the first hour's four decisions and the hours 2-6 four more. Identify who your named IC would be and whether they have explicit authority in writing.
  • Draft your company's standard first-statement template now. Four to seven sentences. Leave blanks for the specifics. Make sure legal, comms, and the CEO have all pre-approved the structure.
  • Identify your war room composition in advance. Who's in the P0 war room, who's in the P1 war room? Have the Slack channels and bridges pre-provisioned so standing up the war room is a 30-second task.
  • Build your customer outreach list. Top 10 by ARR, plus any customers with explicit trust requirements. Keep it current. Your CSM team should be able to start calls within 30 minutes of an incident.
  • Confirm regulatory notification ownership. Named person (usually GC), named timelines, pre-drafted templates for the regulators that matter to your business.
  • Practice the communication cadence. First statement by hour 2-4, second by hour 8-12, post-mortem by day 2-3. Publish on rhythm even when there's nothing new to say; silence is the enemy.
  • Never delay communication until investigation is complete. Publish what you know; update as you learn.

And that closes Module L3 โ€” Money, Risk, Governance. You now have the three-bucket budget model, the five-question GC briefing, the vendor DPA audit, and the six-hour leadership incident playbook. Four briefings, four decisions, ready for when your AI program runs into the hard questions.

Next up: Module L4 โ€” Horizon and Action. Three briefings to close out Course 4. What the next 18 months probably look like, what to decide now vs defer vs ignore, and the reading list plus two habits that keep you current in 10 minutes a week.


Course navigation

โฌ…๏ธ Previous๐Ÿ“ You are hereNext โžก๏ธ
โฌ…๏ธ Previous
L3.3 ยท IP, Data, and Vendor Data Handling
L3.4 of L4.3Next โžก๏ธ
L4.1 ยท The Next 18 Months

๐Ÿ“š AI for Leaders ยท Course Home โ€” 15 briefings, four modules.


Cover photo via Unsplash. This post is part of the AI for Leaders series.

More from this blog

Learn AI - Zero to Hero

111 posts