How should security teams expose APIs to AI systems without creating unsafe access paths?

Why This Matters for Security Teams

Exposing APIs to AI systems is not the same as exposing them to a human operator or a conventional service account. An agent can chain calls, reinterpret prompts, and pursue a goal in ways that defeat static allowlists if the surface is too broad. The practical risk is not just overpermissioned access, but uncontrolled composition of otherwise legitimate endpoints into unsafe workflows. That is why the OWASP Non-Human Identity Top 10 and NHIMG guidance both emphasise bounded delegation rather than raw API exposure, especially when credentials are reusable or long-lived.

The issue is also operational, not purely architectural. In the The State of Secrets in AppSec research from GitGuardian and CyberArk, 43% of security professionals said they are concerned about AI systems learning and reproducing sensitive information patterns from codebases. That concern is justified when AI systems can browse broad endpoints, infer hidden capability paths, or reuse leaked context across tasks. Security teams should assume that an agent will explore whatever the interface makes possible, even if that path was never intended as a business workflow. In practice, many teams discover unsafe delegation only after an agent has already exercised a capability chain that no reviewer modeled in advance.

How It Works in Practice

The safest pattern is to expose AI systems to purpose-built workflows, not a general-purpose API catalog. Each workflow should represent one business outcome, such as “create a support ticket,” “retrieve order status,” or “submit a reimbursement request,” with the system enforcing what inputs are allowed, what side effects are possible, and which approval gates still apply. This aligns with emerging agentic AI guidance in the Anthropic report on an AI-orchestrated cyber espionage campaign, which shows how autonomous tool use can become a control problem when the model is able to drive its own sequence of actions.

Implementation usually includes four layers:

Intent-based authorisation at request time, so the API checks the specific action, target resource, and context before execution.

JIT credential issuance, so the agent receives only short-lived secrets for the exact task and nothing reusable beyond the workflow window.

Workload identity for the agent, using cryptographic identity rather than static shared secrets, so the platform can prove what the requester is before granting access.

Policy-as-code, so approvals, denies, and constraints are evaluated dynamically rather than encoded as static role memberships.

NHIMG research on LLMjacking highlights why this matters: when attackers obtain AI-related credentials, they move quickly from exposure to abuse. The same design pressure applies to legitimate agentic integrations, because broad APIs make it too easy for a model to escalate from one harmless call to a chain of sensitive actions. These controls tend to break down when the environment relies on a single shared service token across multiple tools, because the agent can reuse that token far beyond the original business intent.

Common Variations and Edge Cases

Tighter API scoping often increases integration overhead, requiring organisations to balance developer convenience against the risk of giving agents too much procedural freedom. That tradeoff is real, and current guidance suggests it is better to add explicit workflows than to let an AI system infer hidden paths from a broad endpoint set. There is no universal standard for how much autonomy should be delegated to an agent, but the safest boundary is usually the one that can be described, approved, and revoked per task.

Some environments need special handling. Read-only access is not automatically safe if the data can be combined into sensitive inferences, and write access is not automatically unsafe if the workflow is tightly constrained and fully audited. Internal admin APIs, billing systems, and identity stores deserve stricter treatment because an agent that reaches one of them can often fan out into many more. NHIMG’s 52 NHI Breaches Analysis is a useful reminder that identity abuse often starts with one weakly governed credential and expands into a broader control failure. For teams still maturing their design, the Ultimate Guide to NHIs is a practical reference point for separating identity, access, and workflow scope. Best practice is evolving, but the core principle is stable: expose the outcome, not the entire tool surface.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
OWASP Agentic AI Top 10	A2	Addresses unsafe tool access and agentic abuse paths.
CSA MAESTRO	C2	Covers safe agent orchestration and delegated action boundaries.
NIST AI RMF		Supports governance for contextual risk and runtime AI controls.

Define bounded tasks, approval gates, and revocation paths before exposing tools to agents.

#1 Authority in NHI Education, Research and Advisory, empowering organizations to tackle the critical risks posed by Non-Human Identities (NHIs), including AI Agents.

How should security teams expose APIs to AI systems without creating unsafe access paths?

Why This Matters for Security Teams

How It Works in Practice

Common Variations and Edge Cases

Standards & Framework Alignment

Related resources from NHI Mgmt Group