Subscribe to the Non-Human & AI Identity Journal

How should security teams govern agent tool calls in production?

Treat each tool invocation as a separate identity decision, not as a one-time permission granted to the agent. The control model should validate current context, the specific tool being called, and whether the action matches the intended task. This is the only way to keep delegated agent activity auditable and bounded.

Why This Matters for Security Teams

Agent tool calls are not just API requests. In production, each call can move data, trigger workflows, or change infrastructure, which means the security decision has to be made at the moment of execution. Static RBAC is too blunt for autonomous software that can chain tasks, switch context, and pursue goals in ways the original prompt did not anticipate. Current guidance from OWASP Top 10 for Agentic Applications 2026 and NIST AI Risk Management Framework both point toward runtime controls, not trust-by-default. That aligns with OWASP NHI Top 10, which treats overbroad identity authority as a core risk for agentic systems.

The practical issue is that a tool call can be technically valid and still be operationally wrong. A file read may be fine for summarisation, but unsafe if it exposes secrets, and a ticket update may be permitted for one workflow but harmful in another. In practice, many security teams encounter excessive agent privilege only after a tool chain has already crossed an environment boundary or touched a sensitive secret.

How It Works in Practice

Security teams should govern tool calls as a sequence of ephemeral authorisation decisions. The agent first proves workload identity, then requests a short-lived capability for a specific action, and finally submits the call with context that describes intent, target resource, and risk sensitivity. That is closer to CSA MAESTRO agentic AI threat modeling framework than classic IAM, because the control point is the action itself, not a one-time login.

Strong implementations usually combine three layers. First, workload identity establishes what the agent is, ideally with cryptographic identity for the runtime rather than shared secrets. Second, policy-as-code evaluates the request in real time, using task context, tool sensitivity, data classification, and environment. Third, JIT credentials or scoped tokens expire quickly and are revoked as soon as the task ends. This reduces the blast radius of compromised prompts, malformed plans, or tool misuse. Where organisations still rely on long-lived keys, agent behaviour becomes harder to bound and impossible to inspect cleanly.

A useful operational pattern is to classify tools by risk: read-only, bounded write, and privileged. Read-only tools can be broadly available within context, while privileged tools should require explicit approval, step-up policy, or human confirmation. NHI teams should also log the prompt fragment, policy decision, token issuance, and tool result so audits can reconstruct why a call was allowed. This is also where Ultimate Guide to NHIs — Lifecycle Processes for Managing NHIs becomes practical, because lifecycle control is what keeps agent permissions from becoming standing access.

These controls tend to break down when agents are allowed to broker their own credentials across multiple tools because the policy boundary gets blurred after the first delegation.

Common Variations and Edge Cases

Tighter tool governance often increases latency and operational overhead, so organisations have to balance safety against workflow friction. That tradeoff is real, especially when agents support customer-facing automations or developer productivity use cases that cannot tolerate long approval chains. Best practice is evolving here, and there is no universal standard for how much human approval is required at each risk tier.

One edge case is tool chaining across multiple services. A single safe-looking call can become unsafe when the agent uses the output to drive a second or third action, which is why action-by-action controls matter more than initial prompt filtering. Another edge case is secret handling: if an agent can retrieve secrets from a vault, it should receive only short-lived, narrowly scoped access and never persistent credentials. The AI LLM hijack breach and Moltbook AI agent keys breach both reinforce the operational cost of treating agent access as static.

For high-risk environments, current guidance suggests combining NIST AI Risk Management Framework with Top 10 NHI Issues to decide where JIT, approval gates, and stronger logging are mandatory. This is especially important when the agent can act autonomously, because the more goal-driven the workload becomes, the less useful static role design remains.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework Control / Reference Relevance
OWASP Agentic AI Top 10 A1 Agent tool calls create prompt and action abuse paths the framework targets.
CSA MAESTRO GO-02 MAESTRO models agent governance as a runtime trust problem, not static IAM.
NIST AI RMF AIRMF governs accountability, validity, and monitoring for AI system actions.

Bind each tool call to runtime policy checks and deny actions outside the current task.