Subscribe to the Non-Human & AI Identity Journal

Why do AI agents change the way organisations think about zero trust?

AI agents can operate continuously, act at machine speed, and influence multiple systems without waiting for a human decision at each step. That breaks static trust assumptions. Zero trust for AI therefore needs continuous verification, traceable identity, and least privilege for the agent itself, not just the human who requested it.

Why This Matters for Security Teams

AI agents do not fit neatly into human-centric zero trust assumptions because they can execute continuously, chain tools, and make decisions faster than a review cycle can keep up. The practical shift is from trusting a user session to governing an autonomous workload with its own identity, runtime context, and revocation path. That is why current guidance increasingly treats agent identity, not just human identity, as the security boundary, as reflected in the OWASP Agentic AI Top 10 and the NIST AI Risk Management Framework.

Security teams often underestimate how quickly an agent can move from a narrow task to broader system impact once it has tool access. NHIMG research on AI LLM hijack breach cases shows that compromise is rarely confined to the original prompt or workflow; it tends to spread through credentials, connectors, and automation paths. In practice, many security teams encounter lateral movement and privilege escalation only after an agent has already reused access in ways no approval chain anticipated.

How It Works in Practice

zero trust for ai agents starts by treating the agent as a distinct workload identity, not a proxy for the human requester. That means proving what the agent is at runtime through cryptographic identity, then evaluating what it is trying to do against policy in the moment. In mature designs, the agent receives short-lived, task-scoped access, and each tool call is checked against context such as dataset sensitivity, destination system, time window, and prior actions. This is where standards such as NIST SP 800-207 Zero Trust Architecture and implementation approaches like the Guide to SPIFFE and SPIRE become operationally useful.

For AI agents, the control stack usually includes:

  • Workload identity for the agent, separate from human IAM and service accounts.
  • Just-in-time credentials with short TTLs and automatic revocation after task completion.
  • Policy-as-code for real-time authorization decisions, rather than static role grants.
  • Detailed logs that preserve prompt, tool, and decision provenance for investigation.

This model matters because long-lived secrets are especially risky in agentic environments. NHIMG’s State of Secrets in AppSec research highlights how fragmented secrets management already creates operational exposure, and agent workflows amplify that problem by accelerating reuse. The emerging best practice is to minimise standing access, issue ephemeral secrets only when a task requires them, and revoke them automatically when the task ends. These controls tend to break down when agents are given broad connector access in shared production environments because the policy context becomes too coarse to distinguish one safe action from the next.

Common Variations and Edge Cases

Tighter agent controls often increase orchestration overhead, so organisations have to balance safety against latency, developer friction, and operational complexity. There is no universal standard for every deployment pattern yet, especially where agents collaborate across internal and third-party tools, but the direction of travel is clear: static RBAC is usually too blunt for goal-driven systems, while context-aware controls are better aligned to real runtime risk. The CSA MAESTRO agentic AI threat modeling framework is useful here because it frames agent risk as a system property rather than a single control failure.

Edge cases matter most when agents operate across multiple tenants, inherit legacy service accounts, or sit behind RAG pipelines and API gateways that were never designed for autonomous action. In those environments, zero trust has to include containment boundaries, explicit tool allowlists, and runtime guardrails that can interrupt unsafe behavior mid-execution. That is also why NHIMG’s OWASP NHI Top 10 and Analysis of Claude Code Security both emphasise runtime visibility over static assumptions. Best practice is evolving, but the consistent lesson is that an agent should never be trusted simply because the human who launched it was trusted.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework Control / Reference Relevance
OWASP Agentic AI Top 10 A03 Agentic access risks demand runtime controls beyond static roles.
CSA MAESTRO MAESTRO models agentic systems as dynamic threat surfaces needing containment.
NIST AI RMF AI RMF supports governance, monitoring, and accountability for autonomous systems.

Apply AI RMF governance to own agent behavior, logging, and response decisions.