Subscribe to the Non-Human & AI Identity Journal

How should security teams govern AI agent orchestration across multiple systems?

Security teams should govern AI agent orchestration by mapping every agent, connector, and handoff to a clear owner, entitlement scope, and approval boundary. The key is to manage delegated action paths, not just sign-in events. That means tying observability, access reviews, and lifecycle controls to the full workflow, including APIs, memory, and downstream tool use.

Why This Matters for Security Teams

Agent orchestration changes the control problem from “who signed in” to “what actions were delegated across which systems, and under whose authority.” That shift matters because a single agent can chain connectors, memory, and API calls into a workflow that crosses trust boundaries without a human present at each step. Security teams that keep treating orchestration as a normal access-management problem miss the real risk: delegated execution can spread faster than approvals or reviews can keep up.

Current guidance from the OWASP Agentic AI Top 10 and the NIST AI Risk Management Framework both point toward runtime governance, context-aware policy, and explicit accountability for agent behaviour. In NHI practice, that means mapping every connector, token, and workflow handoff to a business owner and an expiry boundary. It also means knowing where secrets live, because the average remediation time for a leaked secret is 27 days, according to The State of Secrets in AppSec from GitGuardian & CyberArk.

In practice, many security teams discover orchestration risk only after an agent has already moved through several systems and generated a trail that is hard to unwind.

How It Works in Practice

Governance starts by treating the agent as a workload identity, not a user. That identity should be bound to a specific orchestration path, a defined purpose, and short-lived credentials that expire when the task ends. Where possible, use workload identity primitives such as OIDC tokens or SPIFFE/SPIRE-style attestations so policy decisions can verify what the agent is and what it is allowed to do at request time. For agentic systems, static RBAC is usually too blunt because the access pattern is dynamic; current best practice is moving toward intent-based authorization and policy-as-code that evaluates each tool call in context.

That runtime model should extend across the full chain: prompt input, planning, tool selection, API invocation, memory access, and downstream system writes. A practical control stack often includes:

  • ownership for each agent, connector, and external system integration
  • JIT credential issuance with short TTLs and automatic revocation
  • approval gates for high-impact actions such as payouts, deletions, or privileged configuration changes
  • central logging for tool calls, token minting, and cross-system handoffs
  • continuous review of agent permissions as workflows change

The State of Non-Human Identity Security shows why this matters: 85% of organisations lack full visibility into third-party vendors connected via OAuth apps, which is exactly the kind of blind spot orchestration can amplify. The CSA MAESTRO agentic AI threat modelling framework and the NIST Cybersecurity Framework 2.0 both reinforce the need for lifecycle control, monitoring, and governance tied to the operating context.

These controls tend to break down in highly federated environments where each system uses different token formats, inconsistent logs, and separate approval chains.

Common Variations and Edge Cases

Tighter orchestration control often increases latency and operator overhead, so organisations have to balance speed against the risk of delegated misuse. That tradeoff is especially visible in multi-agent pipelines, where one agent can pass context to another and expand the blast radius if boundaries are not explicit. There is no universal standard for this yet, so guidance remains evolutionary rather than settled.

One common edge case is retrieval-heavy or memory-driven agents. If long-lived conversation memory can trigger future actions, then memory becomes part of the access surface and must be governed like a privileged data store. Another is cross-domain orchestration, where an agent can read from one system and write to another. In those cases, the safest pattern is to separate read, reason, and act functions, and require fresh authorisation for each write path. Security teams should also expect that vendor-managed connectors may obscure the true handoff path, which is why NHIMG’s Top 10 NHI Issues remains a useful reference for prioritising lifecycle, monitoring, and rotation controls.

For higher-risk environments, the question is not whether agents can be trusted, but whether every delegated action can be traced, bounded, and revoked before it becomes a multi-system incident.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework Control / Reference Relevance
OWASP Agentic AI Top 10 A2 Agentic workflows need runtime action control, not static sign-in checks.
CSA MAESTRO M2 MAESTRO focuses on threat modelling and governance for agentic orchestration.
NIST AI RMF GOVERN AI RMF governance supports accountability for autonomous, cross-system agent behavior.

Evaluate each agent tool call at runtime and block actions outside the approved intent.