By NHI Mgmt Group Editorial TeamPublished 2026-04-06Domain: Agentic AI & NHIsSource: EnforceAuth

TL;DR: NVIDIA’s OpenShell may secure runtime execution for AI agents, but it does not answer who authorises agent actions, what data they may touch, or how those decisions are continuously governed across enterprise systems, according to EnforceAuth. The core failure is assumption-based: sandboxing does not equal authorization, and machine-speed agents break control models built on static access decisions.


At a glance

What this is: This is an analysis of why sandboxed AI agent runtimes still leave a governance gap, with the key finding that runtime controls do not determine what an agent is authorised to do.

Why it matters: It matters because IAM, PAM, and NHI programmes must govern AI agents as active identity actors, not just as isolated runtimes with restricted execution environments.

👉 Read EnforceAuth's analysis of the AI agent authorization gap at enterprise scale


Context

AI agent authorization is the decision layer that determines what an autonomous or semi-autonomous agent may do, with which data, and under which policy. The article argues that enterprise attention has shifted too far toward runtime containment and not far enough toward decision authority, which leaves a gap between what an agent can technically reach and what it is actually authorised to do.

That gap matters for NHI, agentic AI, and human governance programmes because machine-speed actions need continuous policy evaluation, not just a secure sandbox. If a control model stops at startup permissioning or infrastructure restrictions, it can miss the actual risk boundary: the actions the agent is permitted to take across business systems, records, and compliance contexts.


Key questions

Q: How should security teams govern AI agents that can act at machine speed?

A: Security teams should govern AI agents as non-human identities with explicit owners, scoped entitlements, and per-action policy checks. Session-start approval is not enough when an agent can chain multiple actions before human review occurs. The practical test is whether the programme can explain and enforce each sensitive action, not just the initial login or runtime container boundary.

Q: Why do sandboxed runtimes not solve AI agent authorization risk?

A: Sandboxed runtimes limit where an agent executes, but they do not decide what the agent is permitted to do inside enterprise systems. If the agent has broad API, workflow, or data permissions, the real risk remains. Authorization risk persists whenever identity authority is separated from runtime containment and the entitlement model is too coarse to govern actions.

Q: What breaks when AI agent permissions are only reviewed at session start?

A: What breaks is the assumption that access remains stable long enough to be reviewed after the fact. Machine-speed agents can complete multiple actions, cross tool boundaries, and touch sensitive records before a human review cycle begins. That leaves governance with incomplete evidence and no effective intervention point for the most consequential actions.

Q: Who is accountable when an AI agent exceeds its authorised scope?

A: Accountability sits with the organisation that delegated the access and failed to govern the agent’s decision authority. The key question is whether the business can trace the owner, policy, and approval chain for each action. If those elements are unclear, the governance failure is structural, not merely operational.


Technical breakdown

Why runtime sandboxing does not solve authorization

A sandbox constrains execution environment, not business authority. An AI agent can be isolated from the host OS and still retain broad access to enterprise APIs, customer records, or workflow actions if those privileges are granted elsewhere. That is why runtime safety and authorization are separate layers. The runtime answers where code may execute; authorization answers what the identity may do. For agentic systems, those two questions cannot be collapsed into one control plane without creating blind spots.

Practical implication: separate runtime containment from policy enforcement and verify that agent permissions are evaluated at the action layer, not only at session start.

Authorization governance for AI agents and non-human identities

AI agents behave like non-human identities when they are granted enterprise access, even if they are wrapped in an application or platform runtime. In practice, that means governance must track identity, entitlement, data scope, and approval context together. A policy that merely blocks files or network paths does not answer whether the agent can query a customer record, trigger a financial workflow, or call a downstream tool. For IAM teams, the key design issue is not visibility alone, but decision authority over each action path.

Practical implication: model AI agents as governed NHIs and map their entitlements to the same lifecycle, review, and privilege controls used for other machine identities.

Continuous verification across machine-speed actions

Continuous verification is the missing bridge between static approval and real-world agent behaviour. A machine-speed agent may complete many actions before a human review cycle begins, which means start-of-session checks are not enough. This is especially relevant where records, policies, or business workflows change during execution. The control problem is therefore temporal as well as technical. Identity governance has to follow the agent through the session, not just validate it before the first tool call.

Practical implication: require per-action policy evaluation and event logging for agent activity so authorisation can be enforced across the full execution chain.


Read our 52 NHI Breaches Analysis report for a comprehensive view of breaches impacting Non-Human Identities including AI Agents.


NHI Mgmt Group analysis

Authorization has become the real control plane for AI agents. Runtime sandboxes reduce exposure, but they do not decide whether an agent may read a customer record, initiate a workflow, or pass data to another service. That means the critical governance question has moved above infrastructure and into decision authority, where IAM and NHI controls belong. Practitioners should treat authorization as the primary enforcement boundary for agentic systems.

Sandboxing without entitlement governance creates a false sense of containment. An agent can be boxed in at the OS level and still hold excessive application privileges through APIs, workflow tokens, or delegated service access. That separation between infrastructure restraint and identity authority is where many programmes will misread risk. The practitioner conclusion is straightforward: if entitlements are not measured and governed, the sandbox only limits where the agent runs, not what it is allowed to change.

Decision-centric authorization is now a necessary NHI design pattern. The article’s central point is not that AI agents are unsafe by default, but that machine-speed actors require runtime policy decisions tied to identity context. That aligns with OWASP-NHI and zero trust thinking more than with traditional application hardening alone. Teams should understand that identity governance now has to mediate action, not simply grant access.

Fortune 500-scale adoption turns the authorization gap into a category problem. Once AI agents are embedded across major enterprise platforms, the issue is no longer a point control failure. It becomes a repeatable governance model that either exists or does not. The implication for the field is that NHI governance must absorb agentic behaviour without pretending infrastructure guardrails are sufficient.

Policy continuity across the session is the named gap this article exposes. Static access decisions were designed for actors whose privileges remain stable long enough to be reviewed. That assumption fails when an AI agent can act repeatedly at machine speed, crossing tools and data boundaries before a review cycle can react. The implication is that authorization models must be rethought around action-time governance, not just provisioning-time grants.

From our research:

  • 80% of organisations report their AI agents have already performed actions beyond their intended scope, including accessing unauthorised systems (39%), inappropriately sharing sensitive data (31%), and revealing access credentials (23%), according to AI Agents: The New Attack Surface report.
  • Only 52% of companies can track and audit the data their AI agents access, leaving 48% with a complete blind spot for compliance and breach investigation, according to the same report.
  • For a broader framework view, see the Ultimate Guide to NHIs for lifecycle, visibility, and privilege governance patterns that apply across machine identities.

What this signals

Authorization is becoming the differentiator between safe deployment and unmanaged exposure. As AI agents spread across enterprise platforms, teams should expect policy enforcement to move closer to the action layer rather than remain at the platform perimeter. The practical signal is whether identity governance can keep pace with machine-speed decisions without relying on human review as the last line of defence.

Policy continuity is the named concept practitioners should watch. It describes whether a control model can preserve authorization context across a session in which an agent makes multiple decisions, uses multiple tools, and changes state repeatedly. If that continuity is missing, the programme may have logs and guardrails, but not durable governance. For a standards lens, align this work with the NIST AI Risk Management Framework and the OWASP Agentic AI Top 10.


For practitioners

  • Map AI agents as governed non-human identities Inventory every agent that can access enterprise systems, then assign an identity owner, entitlement scope, and review cadence. Treat the agent as a governed executor rather than a feature embedded inside a platform.
  • Separate runtime controls from authorization controls Document which controls constrain execution environment and which controls decide whether an action is allowed. Require both layers before production use, especially where agents can reach customer data or business workflows.
  • Enforce per-action policy evaluation Move beyond session-start approval and require policy checks before each sensitive action, including data retrieval, record updates, and downstream tool calls. Keep an immutable audit trail for every decision point.
  • Review delegated access paths into business systems Trace how agents inherit permissions from service accounts, workflow tokens, or embedded app credentials. Reduce broad delegated access and require explicit approval for high-impact records, transactions, and compliance-sensitive data.
  • Test whether your controls can explain agent decisions Ask whether your IAM and GRC tooling can show who approved the access, which policy applied, and why the agent was allowed to act. If the answer is unclear, the governance model is incomplete.

Key takeaways

  • AI agents expose an authorization problem, not just a runtime safety problem, because infrastructure controls do not decide business access.
  • The scale is already material, with most organisations reporting agent behaviour beyond intended scope and limited ability to audit what the agents touch.
  • The control that matters most is per-action authorization governed as NHI policy, not one-time approval at session start.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and OWASP Non-Human Identity Top 10 address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

FrameworkControl / ReferenceRelevance
OWASP Agentic AI Top 10AGENT-03Agent runtime decisions and tool use raise the exact governance gap discussed here.
OWASP Non-Human Identity Top 10NHI-03AI agents function as NHIs once they receive enterprise credentials and delegated access.
NIST AI RMFThe article is about governing AI behaviour and accountability across enterprise systems.

Assign owners, scope entitlements tightly, and review agent credentials on the same cadence as other machine identities.


Key terms

  • Authorization layer: The authorization layer is the control point that decides whether an identity may perform a specific action on a specific resource. For AI agents, it must operate at runtime and reflect policy context, not just the permissions granted when the session begins.
  • Decision-centric authorization: Decision-centric authorization is a governance model that evaluates each action before it happens, using identity, context, and policy together. For autonomous or machine-speed actors, it is more reliable than static access grants because it tracks behaviour as it unfolds.
  • Policy continuity: Policy continuity is the ability to preserve and enforce the same authorization logic across a full session, even as an agent takes many actions or changes tools. In agentic environments, weak continuity creates a gap between the first approval and the later actions that actually matter.
  • Non-human identity: A non-human identity is any machine-issued or machine-used identity that accesses systems without being a person. That includes service accounts, API keys, tokens, certificates, bots, workloads, and AI agents, all of which require ownership, scope, and lifecycle governance.

Deepen your knowledge

AI agent authorization and non-human identity governance are core topics in our NHI Foundation Level course, the industry's only accredited NHI security programme. If you are building controls for machine-speed actors in a similar environment, it is worth exploring.

This post draws on content published by EnforceAuth: Who Decides What an AI Agent Is Authorized to Do? Read the original.

NHIMG Editorial Note
Published by the NHIMG editorial team on 2026-04-06.
NHI Mgmt Group — the independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org