Subscribe to the Non-Human & AI Identity Journal

What do organisations get wrong about crisis tabletop exercises?

Many exercises test whether teams can follow a scenario, but not whether they can adapt when the scenario breaks. That misses the real risk. A useful exercise should force incomplete information, shifting priorities, and rapid handoffs so the organisation can see where decision-making fails in practice.

Why This Matters for Security Teams

Most crisis tabletops fail because they become rehearsals for compliance, not probes of decision quality. Teams often over-script the scenario, assign the right people, and then assume a clean run means readiness. That misses the point: real incidents introduce ambiguity, partial telemetry, conflicting priorities, and handoff failures. The result is a false sense of confidence that only surfaces when a live event breaks the plan.

For identity-heavy incidents, this is especially dangerous. Non-human identities are frequently overlooked until they are the thing that widens the blast radius, and NHI Mgmt Group notes that 80% of identity breaches involved compromised non-human identities such as service accounts and API keys in the Ultimate Guide to NHIs. That is a reminder that the exercise should test revocation, containment, and privilege reduction under pressure, not just who can read the incident script. Good exercises also map to broader governance expectations in the NIST Cybersecurity Framework 2.0, especially around response coordination and recovery. In practice, many security teams discover their weakest point only when a tabletop forces a real decision on access, scope, or business continuity.

How It Works in Practice

A useful tabletop starts with an intentionally incomplete picture. Instead of giving teams a neat timeline, facilitators should reveal logs late, change the source of compromise mid-exercise, and introduce business pressure such as a customer deadline, a regulator call, or an executive override. The goal is to see whether the organisation can prioritise containment, preserve evidence, and keep services running without assuming every answer is available at once.

This is where identity and secrets handling should be tested directly. If a service account, API key, or automation token is suspected, participants should have to decide who can revoke it, how quickly, and what breaks if it is disabled. That aligns with guidance in the Ultimate Guide to NHIs, which emphasises lifecycle control, visibility, and rotation. It also fits the NIST Cybersecurity Framework 2.0 approach of integrating response, communications, and recovery rather than treating them as isolated tasks.

  • Test whether the incident commander can change priorities when the first assumption proves wrong.
  • Force handoffs between security, infrastructure, legal, and operations so gaps become visible.
  • Make identity actions concrete: revoke, rotate, quarantine, or reissue credentials during the scenario.
  • Measure how long it takes to identify affected systems, not just how well the team narrates the incident.

When the exercise is well run, the value comes from the friction it exposes: unclear ownership, delayed approvals, missing inventories, and recovery steps that depend on tribal knowledge. These controls tend to break down when the organisation has no live inventory of service accounts or when response authority is split across too many teams, because the exercise then reveals process gaps instead of decision gaps.

Common Variations and Edge Cases

Tighter tabletop design often increases coordination overhead, requiring organisations to balance realism against the time available and the political cost of exposing weak spots. That tradeoff is unavoidable, especially when executives want a polished event and operators need a disruptive one. Current guidance suggests the best exercises are scoped tightly enough to be actionable, but messy enough to reveal whether the organisation can actually absorb surprise.

There is no universal standard for this yet, but mature programmes usually vary the exercise by objective. A ransomware tabletop should stress recovery sequencing and communications. A third-party compromise should stress trust boundaries, dependency mapping, and offboarding. An identity-focused scenario should force immediate questions about privilege, rotation, and secrets exposure. If the environment relies on automation, the exercise should also include service accounts and machine credentials, because manual playbooks often assume human approval paths that do not exist in practice.

Organisations also get stuck when they treat a tabletop as a one-time event. The better pattern is to record the decisions that failed, then retest them after remediation. If a team cannot tell who owns a credential, who can approve revocation, or how to validate business impact after shutdown, the exercise has done its job by surfacing operational debt. That is why current best practice is to use each tabletop as a controlled stress test of governance, not a ceremonial pass-fail meeting.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Non-Human Identity Top 10 address the attack and risk surface, while NIST CSF 2.0 and NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework Control / Reference Relevance
OWASP Non-Human Identity Top 10 NHI-03 Exercises should test NHI rotation and revocation under pressure.
NIST CSF 2.0 RS.MA-1 Tabletops should validate incident response coordination and execution.
NIST AI RMF AI RMF emphasises governance and accountability for operational decisions.

Use tabletop outputs to verify response roles, communications, and escalation paths.