TL;DR: AI agents break the assumptions behind SASE, EDR, NHI, and prompt filtering because they can be hijacked through poisoned content or drift into destructive actions on their own, according to Zenity. The decisive gap is assumption collapse: permission-based controls can approve access but cannot judge whether an autonomous agent’s runtime decisions remain aligned to its task, while Gartner says the future of AI security is securing agent actions, not prompts.
NHIMG editorial — based on content published by Zenity: Allowed Is Not Aligned: Why Retrofitted Tools Can't Secure AI Agents
Questions worth separating out
Q: What breaks when AI agents are governed only with NHI and IAM controls?
A: NHI and IAM controls still authenticate the agent and scope its token, but they do not evaluate whether the agent’s live decisions remain aligned with the task.
Q: Why do AI agents complicate zero trust and least privilege programmes?
A: Zero trust and least privilege assume access can be defined, verified, and reviewed within stable identity boundaries.
Q: What do security teams get wrong about prompt filtering for AI agents?
A: They treat prompt filtering as if it were a complete control layer.
Practitioner guidance
- Separate agent credential scope from task scope Limit each agent token to the narrowest environment and operation set that matches a single task path, and block cross-environment use where the same credential could reach production and backups.
- Inspect agent-readable inputs before they reach execution context Quarantine documents, tickets, and messages that feed agents until hidden instructions, encoded payloads, and connector-directed exfiltration cues have been screened.
- Require runtime approval gates for destructive actions Make production deletes, backup changes, privilege expansion, and external data forwarding stop at an execution checkpoint that checks task purpose, not just identity.
What's in the full article
Zenity's full blog post covers the operational detail this post intentionally leaves for the source:
- Walkthroughs of the AgentFlayer attack geometry across connectors, tickets, and poisoned content paths.
- PocketOS execution detail showing how an agent found and used a broadly scoped Railway token.
- Zenity's breakdown of why SASE, EDR, NHI, and prompt filtering each miss a different part of the agentic control problem.
- The architectural case for intent-based detection, full agent context, and runtime enforcement in production.
👉 Read Zenity's analysis of AI agent governance and runtime enforcement →
AI agent governance: are retrofitted controls enough anymore?
Explore further
Allowed is not aligned: That assumption was designed for static applications and deterministic workflows. It fails when the actor is autonomous because the agent can accept legitimate access and still pursue a harmful sub-goal mid-session. The implication is that security teams must stop treating authorisation as proof of safe intent.
A few things that frame the scale:
- 24,008 unique secrets were exposed in MCP configuration files in 2025 alone, the protocol's first year of widespread adoption, according to the State of Secrets Sprawl 2026.
- AI-related credential leaks surged 81.5% year-over-year in 2025, with the surrounding AI infrastructure leaking 5x faster than core LLM providers.
A question worth separating out:
Q: How should organisations govern destructive AI agent actions in production?
A: They should require execution-time blocking for actions that can delete data, move sensitive records, or expand access across environments. A policy that only checks initial authentication is too early in the chain. Governance has to intercept the agent before the action completes, not after the harm is visible.
👉 Read our full editorial: AI agent governance needs runtime enforcement, not retrofits