Subscribe to the Non-Human & AI Identity Journal

How should security teams govern agent-to-agent traffic in AI workflows?

Security teams should treat agent-to-agent traffic as a governed identity path, not as ordinary internal API chatter. Apply authentication, rate limits, logging, and authorization at the hop boundary so each agent call is evaluated independently. That approach preserves visibility when workflows split into specialized agents and prevents trust from silently carrying forward across the chain.

Why This Matters for Security Teams

Agent-to-agent traffic is not ordinary east-west traffic. Each hop can change the task, context, and privilege surface, so a trusted upstream agent should not automatically imply trusted downstream access. That matters because workflows often fan out into specialist agents that can chain tools, call other services, or hand off secrets in ways a static IAM model never anticipated. Guidance from the OWASP Agentic AI Top 10 and NHI research such as OWASP NHI Top 10 both point to the same operational risk: trust propagates too easily unless it is re-evaluated at every boundary.

Security teams also need observability that can distinguish normal orchestration from unsafe escalation. The NIST AI Risk Management Framework emphasizes governance and monitoring, while NHIMG research shows how quickly visibility gaps turn into exposure, especially where identity sprawl and third-party connections are involved. In practice, many security teams encounter agent-to-agent misuse only after a workflow has already overreached, rather than through intentional design.

How It Works in Practice

Governing agent-to-agent traffic starts at the hop boundary. Each request should be authenticated, authorized, logged, and rate-limited as a distinct event, even if it comes from an earlier trusted agent in the same workflow. The practical goal is to avoid inheriting privilege across the chain. Current best practice is to treat every agent as a workload identity, then evaluate its intent and context at runtime using policy-as-code and short-lived credentials.

That means the control plane should know four things for every hop: who the calling agent is, what task it is attempting, what data or tool it is requesting, and whether the destination agent is allowed to accept that request in this context. Standards and implementation guidance from CSA MAESTRO agentic AI threat modeling framework and MITRE ATLAS adversarial AI threat matrix support this runtime view of risk. For workload identity, teams increasingly use cryptographic proof such as OIDC-based assertions or SPIFFE-style identities, because the identity must represent the agent itself, not just the container or the API key it borrowed.

  • Issue per-task, short-lived credentials instead of static shared secrets.
  • Log hop-by-hop identity, intent, payload classification, and policy decision.
  • Apply rate limits and circuit breakers to prevent runaway recursive calls.
  • Revoke access automatically when the task completes or the context changes.

NHIMG research on The State of Non-Human Identity Security highlights the visibility and logging gaps that make this difficult in real deployments, especially when agent chains cross teams, tools, or vendors. These controls tend to break down when agents can dynamically delegate to other agents without a central policy checkpoint because the system loses sight of the effective caller.

Common Variations and Edge Cases

Tighter hop-level control often increases orchestration overhead, so organisations have to balance stronger containment against latency, developer friction, and policy maintenance. There is no universal standard for agent handoff semantics yet, so current guidance suggests starting with the most sensitive flows first: agents that can trigger financial actions, touch customer data, or call privileged administrative tools.

One common edge case is delegated tool use. If an agent passes a subtask to another agent, the downstream agent should not inherit the upstream agent’s entire authority. Another is cross-domain workflows, where one agent runs inside a business app and another in infrastructure automation. In those cases, policy must account for data sensitivity, request origin, and whether the receiving agent is permitted to act on behalf of the original workflow. For broader operational context, NHIMG’s Ultimate Guide to NHIs — Lifecycle Processes for Managing NHIs is useful, and the NIST Cybersecurity Framework 2.0 remains a practical anchor for governance, detection, and response.

Teams should also watch for recursive or self-referential agent loops, where one agent repeatedly calls another to evade rate limits or widen access. Those patterns require explicit policy bounds, not just authentication. The control model works best when each hop is independently reviewable and every exception is time-bound, because multi-agent systems fail fastest when trust becomes implicit rather than explicit.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework Control / Reference Relevance
OWASP Agentic AI Top 10 A2 Agent-to-agent trust propagation is a core agentic authorization risk.
CSA MAESTRO T3 MAESTRO covers agent handoffs, orchestration, and runtime policy control.
NIST AI RMF AI RMF governance and monitoring apply to dynamic multi-agent workflows.

Assign owners, monitor agent chains, and review runtime controls for unsafe escalation.