How should security teams govern AI and workload identities at runtime?

Why This Matters for Security Teams

Runtime governance is where AI and workload identities either stay contained or turn into an incident multiplier. Static roles are a poor fit for autonomous systems because the same agent can call different tools, switch context, and chain actions faster than a human approval process can keep up. Current guidance suggests treating the identity as a live control point, not just a provisioning artifact, with telemetry, policy checks, and escalation thresholds enforced continuously. That approach aligns with NIST Cybersecurity Framework 2.0 and with the workload identity model described in the SPIFFE workload identity specification.

For NHI practitioners, the main issue is not whether the agent has a token, but whether that token should still be usable for the task it is trying to perform right now. That is why the identity lifecycle discussed in the Ultimate Guide to NHIs — Lifecycle Processes for Managing NHIs matters at runtime, not just at issuance. In practice, many security teams encounter excessive privilege only after an agent has already accessed more data or tools than intended, rather than through intentional governance.

How It Works in Practice

Effective runtime governance starts with workload identity, not shared secrets. An AI agent or workload should present a cryptographic identity that can be verified at request time, then receive short-lived access only if the current action matches approved policy. That is where JIT credentials, ephemeral secrets, and intent-based authorisation intersect. The agent asks for access based on task context, and the policy engine decides whether the request fits the declared purpose, destination, and risk level. The Guide to SPIFFE and SPIRE is useful here because it shows how workload identity can be issued and verified without depending on long-lived static credentials.

Security teams should also correlate runtime events across the agent, the workload, and the protected resource. If an AI agent moves from summarising data to calling administrative APIs, that shift should trigger policy reevaluation. The Top 10 NHI Issues highlights why visibility gaps and poor ownership create blind spots that make this kind of governance difficult. In parallel, research into DeepSeek breach conditions shows how exposed secrets and overly broad access can become compounding failures once systems are operational.

Issue time-bound credentials per task, then revoke them automatically when the task ends.

Evaluate policy at request time using context such as tool, target, data sensitivity, and expected action.

Log the agent identity, the workload identity, and the decision outcome in one event stream.

Escalate to human approval when the request crosses a pre-defined blast-radius threshold.

This guidance tends to break down in highly dynamic microservice environments where service meshes, legacy IAM, and unmanaged API keys all coexist because policy decisions become fragmented across too many enforcement points.

Common Variations and Edge Cases

Tighter runtime control often increases operational overhead, so organisations have to balance containment against latency, engineering friction, and policy sprawl. There is no universal standard for every agentic workflow yet, so best practice is evolving rather than settled. For lower-risk agents, coarse-grained RBAC may still be acceptable if it is paired with short-lived credentials and strong telemetry. For higher-risk autonomous systems, RBAC alone is usually too blunt because it cannot express intent, task scope, or changing context.

Edge cases include multi-agent pipelines, delegated tool use, and agents that initiate actions based on external prompts. In those environments, the safest pattern is to separate identity from authority: prove what the agent is, then decide what it may do right now. That is consistent with the Ultimate Guide to NHIs — What are Non-Human Identities and the broader standards discussion in Ultimate Guide to NHIs — Standards. Where secrets are still used, they should be short-lived and tightly bound to workload identity rather than embedded in the agent runtime.

Teams handling regulated workloads should also align with audit expectations early, since runtime denial, approval, and revocation events may need to be explained later. The point is not to eliminate autonomy, but to make autonomy observable, bounded, and reversible.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
OWASP Agentic AI Top 10	A2	Targets unsafe agent autonomy and excessive tool use at runtime.
CSA MAESTRO	GOV-02	Covers governance for autonomous AI workflows and approval controls.
NIST AI RMF	GOVERN	Maps to accountability, transparency, and policy for AI operations.

Define approval gates, oversight, and escalation paths for every high-risk agent action.

#1 Authority in NHI Education, Research and Advisory, empowering organizations to tackle the critical risks posed by Non-Human Identities (NHIs), including AI Agents.

How should security teams govern AI and workload identities at runtime?

Why This Matters for Security Teams

How It Works in Practice

Common Variations and Edge Cases

Standards & Framework Alignment

Related resources from NHI Mgmt Group