What breaks when policy says one thing and the agent executes another?

The governance model breaks because approval and enforcement are no longer aligned. If the policy layer allows a constrained task but the runtime layer still lets the agent chain tools, widen data access, or propagate actions to other agents, then the control is declarative rather than operational. That creates a false sense of containment.

Why This Matters for Security Teams

When policy says one thing and an agent does another, the problem is not just a misconfiguration. It means the security model has split into two layers: declarative approval and operational execution. That gap is especially dangerous for autonomous software entities that can chain tools, expand scope, and act faster than review loops can react. Current guidance from the OWASP Agentic AI Top 10 and NIST AI Risk Management Framework treats that mismatch as a governance failure, not a minor policy bug.

For NHI programs, the issue often shows up when long-lived secrets, broad RBAC grants, or permissive tool scopes remain in place after the policy has been tightened. NHI Mgmt Group research shows that Top 10 NHI Issues includes excessive privilege as a recurring theme, and that pattern becomes far more dangerous when the workload is an agent with goal-driven behaviour rather than a fixed service. In practice, many security teams encounter the failure only after an agent has already propagated actions into systems no one intended it to touch.

How It Works in Practice

The practical fix is to make authorisation runtime-aware and identity-bound. For autonomous agents, static RBAC is usually too blunt because it assumes predictable access patterns. Instead, policy should be evaluated at the moment of action, with context about the task, target system, data sensitivity, and intended outcome. That is where intent-based authorisation and policy-as-code become important. The direction of travel in CSA MAESTRO agentic AI threat modeling framework and the OWASP Top 10 for Agentic Applications 2026 is clear: the control plane must understand what the agent is trying to do, not only who it nominally is.

That usually means three control shifts:

Issue JIT credentials and short-lived secrets per task, then revoke them automatically when the task ends.
Bind the agent to a workload identity, such as cryptographic identity with SPIFFE or OIDC-backed claims, so the system can prove what the agent is and what it may do.
Evaluate policy at request time using context-aware rules, rather than assuming pre-approved roles will remain safe throughout the workflow.

This also matters for propagation. If one agent can delegate to another, the downstream permissions must not inherit a broader trust envelope than the original task justified. NHI Mgmt Group’s OWASP NHI Top 10 material aligns with that concern, especially where chained actions and secret exposure overlap. These controls tend to break down in multi-agent pipelines with shared memory and shared tool tokens because the runtime cannot reliably distinguish delegated intent from runaway execution.

Common Variations and Edge Cases

Tighter runtime control often increases orchestration overhead, so organisations have to balance containment against operational speed. That tradeoff is most visible in systems that need frequent tool access, long-running workflows, or cross-domain actions. In those environments, best practice is evolving rather than settled: there is no universal standard for every agent pattern yet, but the direction is consistent across NIST AI Risk Management Framework and the OWASP Agentic AI Top 10: privilege must shrink as the task narrows.

There are also edge cases where policy and execution can appear aligned while still being unsafe. A low-risk policy may still fail if the agent can access a broad context window, retrieve secrets from memory, or invoke another agent with a more permissive identity. NHI Mgmt Group’s AI LLM hijack breach analysis is a reminder that hidden execution paths matter as much as written policy. The right question is not whether access was approved once, but whether each action remained justified at the moment it happened.

In hybrid human-and-agent workflows, the best practical pattern is to combine ZSP, JIT credentials, and explicit step-up approval for sensitive actions. That reduces blast radius without pretending autonomous behaviour is fully predictable.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
OWASP Agentic AI Top 10	A2	Agent tool misuse and policy mismatch are core agentic AI risks.
CSA MAESTRO		MAESTRO focuses on modelling and constraining agentic execution paths.
NIST AI RMF	GOVERN	Govern function covers accountability for autonomous AI behaviour.

Model agent workflows, then enforce controls at each decision and tool boundary.

What breaks when policy says one thing and the agent executes another?

Why This Matters for Security Teams

How It Works in Practice

Common Variations and Edge Cases

Standards & Framework Alignment

Related resources from NHI Mgmt Group