What breaks when an agent can delegate work to another agent?

When one agent delegates to another, the accountability chain becomes part of the security problem. If the downstream service cannot see the original authority, it cannot tell whether the action was properly inherited, expanded, or misused. That makes delegation governance necessary for review, audit, and incident investigation.

Why This Matters for Security Teams

Delegation changes the trust model. Once an agent can hand work to another agent, the security question is no longer only “who called this API?” It becomes “who authorised the chain, what context survived the handoff, and did the downstream agent inherit only the minimum authority needed?” That is where static IAM, broad service accounts, and human-style approval paths start to fail.

Current guidance suggests treating agent-to-agent delegation as a distinct control problem, not a variant of human delegation. The issue shows up when an upstream agent can expand scope through tool chaining, swap context across sessions, or invoke a downstream agent that cannot verify the original intent. That is why the control surface must include runtime policy, workload identity, and traceable delegation metadata, as discussed in the OWASP Agentic AI Top 10 and NHI governance research from Ultimate Guide to NHIs — 2025 Outlook and Predictions. In practice, many security teams encounter delegation abuse only after an incident review reveals that no one could reconstruct the original authority chain.

How It Works in Practice

Agent delegation should be governed as a runtime authorization event, not as a one-time role assignment. The upstream agent needs a narrow, task-specific authority that can be passed as a verifiable claim, then re-evaluated by the downstream agent or policy layer before any action is taken. That means the downstream service should see both the agent’s workload identity and the delegated context, not just a generic token with broad scope.

Practitioners usually combine four building blocks:

Workload identity for each agent, so the system can prove what the agent is, not just what secrets it holds.
Just-in-time, short-lived credentials that expire with the task and limit replay after delegation ends.
Context-aware authorization that checks purpose, data sensitivity, target system, and chain depth at request time.
End-to-end audit trails that preserve the originating agent, the delegating agent, the downstream recipient, and the policy decision.

That model aligns with the runtime policy direction reflected in the CSA MAESTRO agentic AI threat modeling framework and the governance expectations in the NIST AI Risk Management Framework. The practical goal is to make delegation inspectable at every hop, so a downstream agent can reject inherited authority that is too broad, stale, or unrelated to the current task. These controls tend to break down in multi-agent pipelines that use shared service identities, because the chain of custody disappears the moment one generic credential is reused across several autonomous steps.

Common Variations and Edge Cases

Tighter delegation controls often increase orchestration overhead, requiring organisations to balance traceability against latency and developer friction. That tradeoff becomes more visible when agents operate across teams, clouds, or vendors, where policy consistency is harder to maintain and there is no universal standard for delegation semantics yet.

One common edge case is delegated delegation, where Agent A hands work to Agent B, which then hands part of it to Agent C. Without explicit limits on chain depth, scope reduction, and session TTL, the original authority can drift far beyond the initiating intent. Another edge case is partial inheritance, where the downstream agent receives only a subset of context and cannot tell whether missing data is deliberate or a control failure.

Another practical issue is incident response. If logs record only the final action and not the delegation path, investigators lose the ability to distinguish malicious abuse from legitimate escalation. That is why the best practice is evolving toward signed delegation tokens, policy-as-code checks, and minimal claim propagation, as reinforced by the Analysis of Claude Code Security and the NHI breach patterns in the Moltbook AI agent keys breach. Guidance breaks down fastest in environments that still rely on long-lived shared credentials because no downstream agent can reliably prove which authority was actually delegated.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
OWASP Agentic AI Top 10	A1	Agent-to-agent delegation can expand authority beyond intended scope.
CSA MAESTRO	TRUST	MAESTRO addresses trust decisions across autonomous agent chains.
NIST AI RMF		AI RMF governs accountability, traceability, and risk in autonomous systems.

Map delegation flows to AI RMF controls and preserve auditable accountability for every hop.

What breaks when an agent can delegate work to another agent?

Why This Matters for Security Teams

How It Works in Practice

Common Variations and Edge Cases

Standards & Framework Alignment

Related resources from NHI Mgmt Group