Why do AI systems create identity and data risk beyond the model itself?

Why This Matters for Security Teams

AI systems expand the attack surface because they depend on identities, tokens, data pipelines, and retrieval layers that sit outside the model weights themselves. That means a prompt injection or model error is often only the visible symptom; the real compromise usually involves cloud access, API credentials, storage permissions, or logging gaps. Guidance from the NIST Cybersecurity Framework 2.0 still applies, but AI makes those trust boundaries more dynamic and harder to inspect.

NHIMG research shows how quickly these failures become operational incidents. In the LLMjacking: How Attackers Hijack AI Using Compromised NHIs analysis, exposed AWS credentials were targeted within an average of 17 minutes. That pace matters because AI services frequently chain together multiple NHIs, each with its own scope and secrets lifecycle. In practice, many security teams discover AI-driven exposure only after a credential has already been used to reach storage, retrieval, or downstream systems.

How It Works in Practice

The practical risk comes from the service path around the model. A typical AI workload may use one identity to call the model, another to fetch retrieval content, a third to write traces and telemetry, and a fourth to invoke external tools or internal APIs. If any of those identities are overprivileged, long-lived, or shared across environments, the model becomes a high-speed path into existing infrastructure rather than a standalone asset. The Ultimate Guide to NHIs — Key Challenges and Risks is useful here because it frames NHI sprawl as a governance problem, not just a secrets-management problem.

Security teams should treat the AI stack as a chain of trust:

Give each agent, service, and pipeline step its own workload identity.

Prefer short-lived tokens and JIT access over static API keys and shared service accounts.

Log tool use, retrieval access, and outbound calls separately from model prompts.

Apply least privilege to data sources, not just to the application container.

Revoke access automatically when a task, session, or workflow completes.

Standards work in this area is still evolving, but policy-as-code and runtime authorization are becoming the practical baseline. Zero Trust principles from NIST CSF 2.0 help, yet AI systems often need context-aware decisions at request time because the same agent can read data, transform it, and then call a tool in a single workflow. These controls tend to break down in legacy environments with shared service principals, broad storage buckets, and indirect tool orchestration because identity boundaries disappear across the workflow.

Common Variations and Edge Cases

Tighter identity control often increases operational overhead, requiring organisations to balance blast-radius reduction against deployment speed and observability. That tradeoff is real, especially for development teams that rely on reusable pipelines or multi-tenant platforms. Best practice is evolving, and there is no universal standard for how fine-grained AI identity segmentation should be across every workload.

Some environments need additional nuance. Batch inference jobs may tolerate broader read access if the data is pre-approved and isolated. Multi-agent systems, by contrast, can create hidden privilege chains when one agent passes context or tool outputs to another. Retrieval-augmented systems also deserve special attention because the retrieval layer often becomes the real target, not the model endpoint itself. NHIMG’s 52 NHI Breaches Analysis and Top 10 NHI Issues both reinforce the same pattern: compromised machine identities repeatedly become the shortest path to data exposure. The main edge case is regulated or air-gapped environments, where access can be narrow but logging and revocation are still incomplete, leaving limited visibility when AI workflows go wrong.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
OWASP Agentic AI Top 10		AI systems widen risk through tool use, retrieval, and agent chains.
CSA MAESTRO		MAESTRO addresses security for multi-step agentic workflows and trust boundaries.
NIST AI RMF		AI RMF covers governance and risk across the full AI service path.

Inventory agent workflows, isolate tool permissions, and monitor cross-agent trust links.

#1 Authority in NHI Education, Research and Advisory, empowering organizations to tackle the critical risks posed by Non-Human Identities (NHIs), including AI Agents.

Why do AI systems create identity and data risk beyond the model itself?

Why This Matters for Security Teams

How It Works in Practice

Common Variations and Edge Cases

Standards & Framework Alignment

Related resources from NHI Mgmt Group