Enterprise AI security risks expose identity and data control gaps

By NHI Mgmt Group Editorial TeamPublished 2026-05-22Domain: Agentic AI & NHIsSource: Orca Security

TL;DR: Enterprise AI security risks cluster around testing gaps, explainability limits, data exposure, adversarial manipulation, supply-chain weakness, and shadow AI, according to Orca Security. The core issue is that AI features inherit cloud identity, data, and governance failures faster than most programmes can inventory or control them.

At a glance

What this is: This is an analysis of recurring enterprise AI security risks, showing that models, prompts, datasets, and integrations behave like governed assets only when security teams map them to identity, data, and cloud controls.

Why it matters: It matters because AI workloads often sit inside ordinary cloud accounts, so IAM, DSPM, and governance teams must treat model access, tool use, and shadow adoption as part of the same control plane.

By the numbers:

When AWS credentials are exposed publicly, attackers attempt access within an average of 17 minutes.

👉 Read Orca Security's analysis of enterprise AI security risks and cloud governance

Context

AI security risk is not a prompt-only problem. Large language models, copilots, and agentic features run on ordinary cloud identities, data stores, and APIs, which means they inherit the same access sprawl, misconfiguration, and exposure paths as other workloads. For IAM and security leaders, the control question is not whether AI is novel, but whether the underlying identities and data paths are governed at the same standard as every other production system.

Orca Security’s framing is useful because it ties model behaviour back to cloud reality: inventories, permissions, storage, and third-party dependencies. That matters for NHI, agentic AI, and human governance programmes alike, because unmanaged access and unseen usage patterns become harder to distinguish once AI is embedded in everyday work. Shadow AI makes that gap worse by moving data outside approved control boundaries before teams can even classify the risk.

Key questions

Q: How should security teams govern AI workloads that run in ordinary cloud accounts?

A: Treat AI workloads as governed production services, not isolated model experiments. Map the workload to its cloud IAM role, storage, logging, network exposure, and downstream tool access. Then classify the data it can reach and the actions it can take. If those elements are unclear, the workload is not ready for production approval.

Q: Why do AI systems create identity and data risk beyond the model itself?

A: Because the model is only one part of the service path. The real risk sits in the identities, APIs, storage, and retrieval layers that feed it and consume its outputs. If those components are overprivileged or poorly logged, AI becomes a multiplier for existing cloud and data exposure rather than a separate problem.

Q: What breaks when shadow AI is not governed?

A: Shadow AI breaks visibility, retention, and accountability at the same time. Data can leave approved environments, legal and privacy controls may never be applied, and security teams lose the ability to review or revoke access. That makes the problem operational, not just policy-driven.

Q: How can organisations stop AI outputs from becoming unsafe actions?

A: Require structured outputs, validation, and human approval for actions that affect customers, money, or access. A model should not be able to trigger downstream systems simply because it produced a plausible answer. The control point is the interface between output and action, not the prompt alone.

Technical breakdown

AI security risk patterns in cloud-hosted model pipelines

Enterprise AI systems fail in recurring ways because the model is only one component in a wider pipeline. The risky parts are the retrieval layer, storage, orchestration, and identity permissions that let data and tools flow into and out of the model. Weak testing, poor explainability, data leakage, adversarial prompts, output abuse, supply-chain compromise, and shadow usage all arise when those surrounding controls are missing or inconsistent. The security model therefore has to cover the whole service path, not just the model endpoint.

Practical implication: Map each AI workload to its cloud identity, data stores, and downstream tools before approving production use.

Why output control and explainability matter in AI governance

Generative systems are variable, which means their outputs cannot be treated like deterministic application logic. If a model can influence a workflow, the organisation needs evidence of which prompts, retrieved sources, tools, and policy checks shaped the outcome. Explainability here is operational rather than academic: investigators need traceable artefacts, and downstream systems need validated, structured outputs. Without that, incident response and accountability both become slower and less reliable.

Practical implication: Require logs, citations, and schema validation wherever AI output can trigger business action.

Supply chain risk and shadow AI as identity problems

AI supply chain risk extends beyond model weights into packages, container images, hosted models, and vendor integrations, while shadow AI creates an unsanctioned identity layer around consumer tools and unsupervised data use. In both cases, the issue is uncontrolled trust. A model or service is only as safe as the dependencies and identities that can reach it, and hidden use often bypasses the review, logging, and retention controls applied to approved systems.

Practical implication: Track AI dependencies and discover unsanctioned tools with the same rigor you use for other production identities.

Threat narrative

Attacker objective: The attacker aims to turn AI-enabled trust paths into data exposure, workflow manipulation, or unauthorized access to internal systems.

Entry occurs when an enterprise AI feature connects to exposed cloud identities, untrusted retrieval content, or unmanaged third-party services that can influence model behaviour.
Credential access or abuse follows when overbroad API scopes, shared service accounts, or embedded secrets let the model pipeline reach sensitive data and internal tools.
Impact appears when poisoned inputs, unsafe outputs, or shadow AI usage cause data leakage, unauthorized actions, or business decisions based on untrusted model behaviour.

Moltbook AI agent keys breach — Moltbook breach exposed 1.5M AI agent keys.
Azure Key Vault privilege escalation exposure — Azure Key Vault Contributor role misconfiguration enabled privilege escalation.

Read our 52 NHI Breaches Analysis report for a comprehensive view of breaches impacting Non-Human Identities including AI Agents.

NHI Mgmt Group analysis

AI security is now an identity governance problem, not a model-only problem. Orca Security’s framing lands because enterprise AI workloads inherit the same cloud identities, storage, and permissions that govern every other service. Once a model can read data, call tools, or act on behalf of a service account, the real control surface is identity and access. Security teams should treat AI services as governed production workloads, not experimental endpoints.

Shadow AI creates an unmanaged identity layer that conventional governance cannot see. Employees will use consumer AI tools when sanctioned options are slower or less convenient, and that behavior moves data outside approved control boundaries. The governance failure is visibility first, not policy wording. Practitioners need a single inventory and approval path for AI tools, because what is unseen cannot be reviewed, risk-ranked, or retired.

Limited testing is a control failure because AI failures emerge in tool paths, not just accuracy scores. The article correctly points to edge cases, retrieval pipelines, and prompt injection, which means test plans must include authentication, authorization, and data isolation between tenants. Static validation does not prove that a model will behave safely when connected to live systems. Security teams should align AI testing with operational abuse cases, not benchmark optimism.

Ephemeral trust debt: AI systems can accumulate access faster than governance cycles can certify it. This is the named concept that best fits the article’s substance. AI features are often approved for a narrow use case, then quietly expand across datasets, APIs, and internal workflows. The implication is that access review logic must be tied to actual service paths and data flows, because entitlement drift in AI happens through integration, not through a single obvious permission grant.

Cross-functional accountability is the only defensible operating model for enterprise AI. Orca Security’s recommendation that legal, security, data, and product share one risk register is consistent with how AI risk crosses ownership lines. When data handling, model behaviour, and downstream business use all intersect, isolated team-by-team governance produces gaps. Practitioners should assume AI risk will be misowned unless the operating model makes it explicit.

From our research:
Two-thirds of enterprises have endured a successful cyberattack resulting from compromised non-human identities, with a quarter encountering multiple attacks, according to The 2024 ESG Report: Managing Non-Human Identities.
Enterprises that have experienced a compromised NHI averaged 2.7 separate incidents in the past 12 months.
For a broader breach pattern view, The 52 NHI breaches Report shows how identity exposure turns into repeated incident chains.

What this signals

Shadow AI will keep expanding until inventory and approval processes are faster than employee workarounds. That is a governance issue, not a tooling issue. Teams that cannot see unsanctioned AI usage cannot classify data residency, retention, or accountability, which means discovery has to become a first-class control rather than a quarterly exercise.

AI risk management is converging with cloud identity governance. Once models live inside ordinary AWS, Azure, or GCP accounts, the relevant controls are IAM, logging, data classification, and dependency management. Practitioners should expect AI findings to land in the same operational queues as other cloud exposures, not in a separate innovation track.

Ephemeral trust debt is the right lens for AI adoption programmes. As AI services accumulate integrations, the access they use often outgrows the access that was originally approved. Teams should prepare for more frequent entitlement reviews, tighter data scoping, and more evidence-driven exception handling across AI services and their connected identities.

For practitioners

Inventory every AI-enabled workload List models, prompts, datasets, vector stores, APIs, and the cloud identities they use. Include SaaS AI features that were purchased indirectly through productivity suites, because they often escape normal application onboarding.
Bind AI risk to existing cloud controls Map each AI service to IAM roles, network exposure, logging, and data classification so it sits in the same backlog as other production workloads. Extend DSPM to training data, embeddings, and retrieval stores.
Test AI abuse cases before release Add adversarial prompt suites, tool-call abuse tests, and tenant-isolation checks to preproduction testing. Cover authentication and authorization paths, not only model accuracy, and record the exact versions, prompts, and datasets used in each run.
Restrict output from becoming action Use schema validation, output filtering, and human approval for high-impact actions such as bulk messaging, pricing changes, or access changes. Treat unvalidated output as untrusted input until it has passed business and security controls.
Detect and reduce shadow AI Publish an approved AI tool list, offer a sanctioned alternative with logging, and use network or endpoint discovery where policy allows. Train employees on what must never be pasted into external chat interfaces.

Key takeaways

Enterprise AI risk is mainly a governance and identity problem because models inherit cloud access, data paths, and vendor dependencies.
The strongest evidence in the source points to recurring failures in testing, explainability, supply-chain control, and shadow usage rather than one-off model bugs.
Practitioners should inventory AI services, bind them to existing cloud controls, and make output validation part of the action path.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and OWASP Non-Human Identity Top 10 address the attack and risk surface, while NIST CSF 2.0 set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
OWASP Agentic AI Top 10	LLM08	Excessive agency and tool abuse map directly to AI output-to-action risk.
OWASP Non-Human Identity Top 10	NHI-03	AI services often rely on service accounts and secrets that need lifecycle control.
NIST CSF 2.0	PR.AA-1	AI access and data use need explicit identity and authorisation governance.

Align AI workloads to identity governance, logging, and data protection under your access control process.

Key terms

Shadow AI: Shadow AI is the use of AI tools or features outside approved governance and inventory. In practice, it creates hidden data flows, retention risks, and access paths that security teams cannot review or revoke because the use was never formally onboarded.
AI workload identity: AI workload identity is the service account, token, or credential set that lets an AI application reach data, APIs, and tools. For governance purposes, it is the trust handle for the entire service, so scope, logging, and lifecycle controls matter as much as the model itself.
Explainability artifact: An explainability artifact is the evidence needed to understand why an AI system produced an outcome. That can include prompts, retrieved sources, tool calls, and policy checks. The goal is not perfect transparency, but enough traceability for audit, incident response, and accountable decision-making.
Ephemeral trust debt: Ephemeral trust debt is the accumulation of temporary AI access, integrations, and permissions that outlast the original approval intent. It matters because AI services can expand quickly across data and tools, leaving governance to catch up after the access pattern has already changed.

Deepen your knowledge

AI security risks in cloud-hosted model pipelines are covered in our NHI Foundation Level course, the industry's only accredited NHI security programme. If you are building controls for AI services inside existing cloud accounts, it is a relevant starting point.

This post draws on content published by Orca Security: enterprise AI security risks and how they map to cloud governance. Read the original.

NHIMG Editorial Note
Published by the NHIMG editorial team on 2026-05-22.
NHI Mgmt Group — the independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org