TL;DR: Anthropic and OpenAI’s enterprise guidance shows that production AI agents succeed with simple composable patterns, layered guardrails, and explicit tool-risk controls, while enterprise teams still struggle with evaluation, security, and delegation across systems, according to WorkOS’s analysis of the two guides. The deeper issue is that traditional IAM assumes stable, reviewable access, but agentic systems can act, branch, and delegate within one session.
At a glance
What this is: This is an independent analysis of enterprise AI agent guidance that finds production readiness depends less on model sophistication and more on composable workflows, layered guardrails, and tool-aware security.
Why it matters: It matters because IAM, NHI, and human access programmes now have to govern systems that make decisions and take actions across multiple services, which changes how privilege, auditability, and control boundaries are defined.
By the numbers:
- Only 44% of developers are reported to follow security best practices for secrets management, exposing a significant developer behaviour gap.
👉 Read WorkOS’s analysis of enterprise AI agent playbooks and MCP security
Context
Enterprise AI agent programmes are no longer limited by model quality alone. The main governance problem is that agents can choose actions across systems, which means existing IAM and application security assumptions about bounded, reviewable execution do not hold cleanly once the workflow becomes agent-driven.
In practice, that shifts the security question from whether a model can answer well to whether the surrounding identity controls can constrain what the agent can touch, when it can act, and how its delegated authority is audited. That is why AI agent identity risk now sits at the intersection of NHI governance, Zero Trust Architecture, and lifecycle control.
Key questions
Q: How should security teams govern AI agents that can take actions across multiple systems?
A: Security teams should govern AI agents as delegated identities with explicit tool scope, session visibility, and audit requirements. The control question is not whether the model can reason, but whether every action path is bounded, attributable, and revocable. If the agent can send messages, update records, or trigger downstream workflows, those capabilities need separate authorisation and monitoring.
Q: Why do AI agents complicate existing IAM controls?
A: AI agents complicate IAM because they do not behave like stable human users or fixed service accounts. They can chain actions, change paths mid-session, and operate across systems that were never designed for agent-timed delegation. That makes static entitlement review and periodic certification a poor fit unless the programme adds runtime visibility and constrained tool access.
Q: What breaks when organisations treat agent workflows like ordinary automation?
A: What breaks is the assumption that the workflow is fully predetermined and therefore safe to govern with simple rules. Agentic systems can choose among tools, reorder steps, and expand their own execution path based on context. That means control design has to account for non-deterministic action selection, not only for scheduled or script-based execution.
Q: Who is accountable when an AI agent makes the wrong decision in production?
A: Accountability should remain with the organisation that authorised the delegation, not with the model itself. Practically, that means the business owner, security owner, and platform owner all need clear responsibility for scope, monitoring, and remediation. If no one can state who approved the agent’s authority, the governance model is incomplete.
Technical breakdown
Composable agent patterns reduce complexity but expand governance surfaces
Anthropic’s production patterns, such as prompt chaining, routing, parallelization, orchestrator-workers, and evaluator-optimizer, are not just architecture choices. They change how work is decomposed, which in turn changes where errors, privilege, and audit events appear. Simple compositions are easier to reason about than large multi-agent stacks, but each additional step creates another point where instructions, tool calls, and outputs can diverge from intent. For identity teams, the key issue is not model complexity. It is the growing number of decision and execution boundaries that need to be governed.
Practical implication: Map every agent workflow step to a distinct approval, logging, or containment control before expanding the design.
Tool risk classification is the real security boundary for agents
OpenAI’s guidance separates data tools, action tools, and orchestration tools because each tier carries different identity and blast-radius consequences. Read-only access can still leak sensitive data, but write-capable tools can create permanent business impact, and orchestration tools can compound permissions across multiple downstream agents. This is where NHI controls matter most: credentials, scopes, and audit trails must reflect what the tool can do, not just what the agent intends. Without that distinction, the identity layer becomes a passive wrapper around active risk.
Practical implication: Classify every tool by reversibility, data exposure, and business impact, then bind access to the lowest possible scope.
MCP turns agent identity into an enterprise integration problem
Model Context Protocol gives agents a structured way to connect to tools and data sources, but it also creates a new authentication and authorization challenge. The enterprise problem is no longer just user login. It is proving which agent is acting, what delegation authority it has, and how that authority is constrained across sessions and enterprise systems. Traditional OAuth and session assumptions were built for human-paced interaction, not for agent workflows that may span long-running, multi-system tasks.
Practical implication: Treat MCP endpoints as privileged integration points and require explicit agent identity, delegation, and audit controls before production use.
Breaches seen in the wild
- Moltbook AI agent keys breach — Moltbook breach exposed 1.5M AI agent keys.
- AI LLM hijack breach — attackers used stolen AWS access keys to hijack Anthropic LLM models on Bedrock.
Read our 52 NHI Breaches Analysis report for a comprehensive view of breaches impacting Non-Human Identities including AI Agents.
NHI Mgmt Group analysis
Enterprise AI agents collapse the assumption that access is stable long enough to govern through normal IAM cadences. The article shows agents taking actions on behalf of users across multiple systems, which means access can be acquired, used, and re-used inside a single workflow. That breaks the governance premise behind periodic access review and static entitlement reasoning. The implication is that identity programmes must stop assuming agent access behaves like human access.
Tool-aware governance is now the decisive control plane for agentic systems. The article’s split between data tools, action tools, and orchestration tools is the right conceptual boundary because the blast radius changes with each tool class. A read operation, a write operation, and a delegated sub-agent are different identity events, not the same thing with different labels. Practitioners should treat tool classification as a first-order governance decision, not a documentation exercise.
Agent identity is becoming a cross-domain problem that sits between NHI, Zero Trust, and application security. Workflows built from prompt chaining or orchestrator-workers can look like automation, but the security burden is identity, delegation, and traceability. That means agent governance cannot live only in application teams or only in IAM teams. The programme has to connect runtime behaviour, enterprise authentication, and session control into one model.
Runtime evaluation is a governance control, not just a model-quality exercise. The article’s emphasis on iterative evaluation and layered guardrails shows that production readiness depends on observing behaviour, not trusting initial configuration. For identity teams, this is the same logic as privileged access monitoring, just applied to agents that can choose actions dynamically. The practitioner conclusion is simple: if behaviour is not continuously measurable, it is not governable.
MCP makes delegation authority visible, which also makes hidden trust decisions easier to expose. Once agents connect to enterprise systems through a protocol layer, the real question becomes who authorised the agent to act, under what constraints, and with what audit evidence. That is a lifecycle problem as much as a runtime problem. Teams should use this shift to re-evaluate how delegation is issued, reviewed, and revoked across both machine and human actors.
From our research:
- Only 52% of companies can track and audit the data their AI agents access, leaving 48% with a complete blind spot for compliance and breach investigation, according to AI Agents: The New Attack Surface report.
- 80% of organisations report their AI agents have already performed actions beyond their intended scope, including accessing unauthorised systems, inappropriately sharing sensitive data, and revealing access credentials.
- Read OWASP Agentic AI Top 10 for the control patterns that map directly to tool misuse, prompt injection, and agent scope drift.
What this signals
Identity teams should expect agent governance to move from policy drafting to runtime containment. The practical problem is not defining intent once, but proving that the agent stays inside that intent as it chains tools and systems together. With 80% of organisations reporting AI agents already acting beyond intended scope, the gap is no longer hypothetical.
Tool-class awareness will become a standard design requirement for agent programmes. A read-only connector, a write-enabled workflow, and a delegated orchestration path cannot share the same identity assumptions. That is why agent adoption will force security architecture reviews to look more like privilege design than software feature selection.
As programmes mature, the most defensible posture will combine explicit delegation, continuous behaviour review, and hard boundaries around high-risk tools. Teams that already treat NHI as a lifecycle and privilege problem will adapt faster than those still framing agent security as a chatbot moderation issue.
For practitioners
- Inventory agent workflows by decision boundary Document where the agent makes a choice, where it calls a tool, and where it hands work to another component. Use that map to identify which steps require logging, approval, or hard stops before production deployment.
- Classify tools by blast radius Separate read-only data tools from write-capable action tools and from orchestration tools that can trigger downstream agents. Assign stronger identity controls to tools that can modify records, send messages, or fan out privilege.
- Bind agent access to explicit delegation records Require each agent session to carry a recorded delegation context that states who authorised it, what systems it may reach, and when that authority expires. Reconcile those records against audit logs and session telemetry.
- Add behavioural evaluation to agent governance Test whether the agent stays within intended scope under prompt injection, unexpected inputs, and tool chaining. Use those tests as part of access acceptance, not only as model QA.
Key takeaways
- AI agents change the identity problem because they can choose, chain, and execute actions across systems, not just answer requests.
- Production readiness depends on layered guardrails, tool-specific controls, and visible delegation, not on model sophistication alone.
- IAM teams should treat agent governance as a runtime privilege problem that spans NHI, application security, and enterprise authentication.
Standards & Framework Alignment
This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.
OWASP Agentic AI Top 10 address the attack and risk surface, while NIST AI RMF and NIST Zero Trust (SP 800-207) set the governance and control requirements practitioners need to meet.
| Framework | Control / Reference | Relevance |
|---|---|---|
| OWASP Agentic AI Top 10 | A1 | Covers agent tool misuse and prompt injection risks raised by this article. |
| NIST AI RMF | Addresses governance and accountability for AI systems with runtime decisions. | |
| NIST Zero Trust (SP 800-207) | PR.AC-4 | Fits the need for continuous verification of agent access across systems. |
Apply least privilege and continuous verification to agent sessions and delegated access.
Key terms
- Agentic workflow: An agentic workflow is a task sequence where software can decide which actions to take, which tools to call, and when to execute them within a bounded environment. In identity terms, it creates delegated execution that must be governed like privilege, not just like automation.
- Tool risk profile: A tool risk profile describes the likelihood and impact of harm if an agent uses a particular function incorrectly or maliciously. Read-only tools, write tools, and orchestration tools have different blast radii, so access control and audit depth should scale with the tool’s business effect.
- Delegation authority: Delegation authority is the right for one identity to act on behalf of another within a defined scope. For agents, it must be explicit, time-bound, and auditable because the system may initiate actions independently once granted access.
- Runtime guardrail: A runtime guardrail is a control that checks behaviour while the agent is operating, rather than only before deployment. It can evaluate prompts, outputs, tool calls, and session patterns to keep the system inside its authorised boundary.
Deepen your knowledge
AI agent governance, delegation control, and runtime privilege boundaries are core topics in our NHI Foundation Level course, the industry's only accredited NHI security programme. If you are building controls for agentic systems alongside service accounts and workloads, it is worth exploring.
This post draws on content published by WorkOS: Enterprise AI Agent Playbook: What Anthropic and OpenAI Reveal About Building Production-Ready Systems. Read the original.
Published by the NHIMG editorial team on 2025-07-23.
NHI Mgmt Group — the independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org