What should organisations do when an AI agent delegates work to other agents?

Organisations should require a traceable delegation chain that records the initiating identity, each handoff, each tool call, and the business purpose of downstream access. Without that chain, teams cannot determine whether the final action still matched the original authorization or whether privilege expanded silently as the workflow progressed.

Why This Matters for Security Teams

Delegation is where agentic risk stops looking like a simple access-control problem and starts behaving like a supply-chain problem. Once an AI agent can spawn sub-agents, call tools, or pass tasks across models, the original request can drift away from the original permission set. Static RBAC is usually too coarse for that reality, which is why current guidance increasingly points to NIST AI Risk Management Framework style governance and agent-specific controls in OWASP NHI Top 10. The practical issue is not only who started the workflow, but whether each downstream actor still has a legitimate business purpose.

That matters because AI agents are autonomous and goal-driven: they can select tools, chain actions, and continue operating after the human user has moved on. NHI teams should treat each delegation hop as a new trust decision, not as an extension of the first one. The 2026 OWASP Agentic AI Top 10 and CSA MAESTRO agentic AI threat modeling framework both reinforce this shift toward runtime control rather than trust by design. In practice, many security teams discover delegation creep only after a downstream agent has already touched sensitive systems, rather than through intentional design.

How It Works in Practice

The safest pattern is to make delegation explicit, short-lived, and auditable. Each agent should present a workload identity, not just a reusable secret, and every handoff should mint a fresh, task-scoped credential with a narrow TTL. That is the closest practical analogue to JIT access for autonomous systems. Where possible, authorisation should be intent-based and evaluated at runtime, so the policy engine checks what the agent is trying to do, the dataset or system it is targeting, and whether the downstream action still matches the approved business purpose.

A useful operating model is:

Bind the initiating user, the primary agent, and each sub-agent to distinct workload identities.
Require a delegation token or signed context bundle for every hop.
Record the tool name, scope, input, output, and justification at each step.
Re-evaluate policy before every sensitive tool call, rather than trusting the original session.
Revoke credentials automatically when the task ends or the purpose changes.

This approach aligns with the direction in AI LLM hijack breach analysis and the broader DeepSeek breach lessons: static secrets and overbroad access are a liability when software can act on its own. It also fits the operational logic of NIST AI Risk Management Framework because accountability, traceability, and monitoring must exist at decision time, not only in post-incident review. These controls tend to break down in multi-tenant orchestration platforms where delegated work is fanned out asynchronously and the original purpose context is lost between queues.

Common Variations and Edge Cases

Tighter delegation controls often increase orchestration overhead, requiring organisations to balance security against latency, developer friction, and incident-response complexity. That tradeoff is real, and best practice is still evolving for multi-agent systems that dynamically recruit sub-agents across vendors or MCP-connected tools. The key question is whether every downstream step can prove both identity and purpose without relying on a long-lived shared secret.

There is no universal standard for this yet, but current guidance suggests three common exceptions. First, low-risk internal tasks may justify simpler logging, but not the reuse of standing privileges. Second, agents that only summarise or classify data may not need the same depth of delegation tracing as agents that can write, purchase, delete, or exfiltrate. Third, some environments will prefer policy-as-code gates at the gateway layer, while others will enforce them in the tool itself. In all cases, the delegation chain should survive audits and incident response, especially when a sub-agent inherits access to secrets, certificates, or API keys. The Moltbook AI agent keys breach shows why that matters in real deployments, while MITRE ATLAS adversarial AI threat matrix remains useful for mapping how delegated agents can be turned into lateral-movement tools. The current standard is not perfect, but the operational baseline is clear: if a delegated agent cannot explain why it had access, the organisation probably should not have granted it.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
OWASP Agentic AI Top 10	A2	Agent delegation can expand authority beyond intended scope.
CSA MAESTRO		MAESTRO models agent-to-agent trust, delegation, and tool abuse.
NIST AI RMF		AI RMF governance covers accountability, traceability, and monitoring.

Assign ownership for delegated agent actions and retain auditable decision logs end to end.

What should organisations do when an AI agent delegates work to other agents?

Why This Matters for Security Teams

How It Works in Practice

Common Variations and Edge Cases

Standards & Framework Alignment

Related resources from NHI Mgmt Group