What breaks when an AI system can choose tools and actions on its own?

What breaks is the assumption that access can be safely provisioned once and reviewed later. When the system selects tools dynamically, the effective privilege set can expand during the session, and the original approval no longer reflects actual behaviour. Governance has to move from pre-authorisation alone to continuous containment and audit.

Why This Matters for Security Teams

When an AI system can choose tools and actions on its own, the failure is not just “more automation.” The failure is that static approvals stop matching live behaviour. A system that can call APIs, search internal data, open tickets, or trigger workflows may accumulate effective privilege far beyond the original task. That creates a moving target for governance, especially when secrets, tokens, and service accounts are reused across sessions instead of being issued per action.

This is why NHI and agentic ai guidance treats runtime authorisation as the control point, not onboarding alone. The NIST Cybersecurity Framework 2.0 reinforces continuous risk management, but autonomous systems add a sharper problem: the agent can change its own path while still appearing “in policy” at the start. NHIMG research on the LLMjacking threat shows how quickly exposed credentials are abused in the wild, which matters because an agent that can reach tools through broad credentials can be hijacked just as fast.

In practice, many security teams encounter overprivileged agent behaviour only after the agent has already chained tools, touched systems it was never meant to reach, or exposed data through an unintended workflow.

How It Works in Practice

For autonomous systems, the practical shift is from static role assignment to context-aware control at request time. An agent should not receive a long-lived identity that can do everything its orchestration layer might ever decide to attempt. Instead, current guidance suggests using workload identity to prove what the agent is, then issuing short-lived credentials only for the specific task it is about to perform. That means JIT access, tight TTLs, and automatic revocation when the task completes or the context changes.

In agentic environments, the identity primitive is often the workload rather than the human. Cryptographic workload identity, such as SPIFFE-based identities or OIDC-backed service tokens, helps distinguish one agent instance from another and supports policy decisions based on execution context. The policy engine should evaluate each tool invocation at runtime, using policy-as-code patterns such as OPA or Cedar, rather than assuming a fixed role map will remain valid.

Issue per-task credentials, not standing credentials.
Bind tool access to the current goal, data scope, and environment.
Log every tool call with input, output, and policy decision context.
Revoke access automatically when the task ends or confidence drops.

NHIMG’s research on the DeepSeek breach is a reminder that exposed data and embedded secrets can become operational fuel for an agent or attacker once the boundary is lost. This aligns with the broader direction of the NIST Cybersecurity Framework 2.0: continuous monitoring, containment, and recovery matter more when the system can choose its next move. These controls tend to break down when multiple agents share one service account and the orchestration layer cannot distinguish which sub-action created which privilege request.

Common Variations and Edge Cases

Tighter runtime control often increases operational overhead, requiring organisations to balance autonomy against latency, policy complexity, and developer friction. That tradeoff is real, especially when agents must complete multi-step workflows across internal and third-party systems. Best practice is evolving, and there is no universal standard for every agent pattern yet.

One common edge case is delegation chains. An agent may call another agent, which then calls tools on its behalf. If each hop inherits broad permissions, the original least-privilege design collapses. Another edge case is event-driven automation, where a tool is triggered by a queue or webhook and the identity context is weak or missing. In those systems, teams often need stronger containment boundaries, narrow broker permissions, and explicit approval gates for sensitive actions.

There is also a difference between read-only reasoning and action-capable execution. A model that merely drafts recommendations is not the same risk as one that can write files, transfer funds, or modify production systems. The policy should reflect that difference, and continuous audit should record when an agent crosses from suggestion to action. Current guidance suggests that agentic AI controls work best when identity, authorisation, and observability are designed together rather than bolted on after deployment.

In practice, the hardest failures appear in high-churn environments where agents are updated frequently, tool inventories change weekly, and no single owner can explain which runtime permission was needed for which action.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
OWASP Agentic AI Top 10	A05	Covers unsafe agent tool use and uncontrolled actions at runtime.
CSA MAESTRO	TRM	Addresses runtime trust and control for autonomous agent workflows.
NIST AI RMF	GOVERN	Governing function fits accountability for autonomous AI decision-making.

Enforce continuous trust evaluation and revoke access when agent context changes.

What breaks when an AI system can choose tools and actions on its own?

Why This Matters for Security Teams

How It Works in Practice

Common Variations and Edge Cases

Standards & Framework Alignment

Related resources from NHI Mgmt Group