AI security tools miss cloud identity risk across ML attack phases

By NHI Mgmt Group Editorial TeamPublished 2026-05-21Domain: Agentic AI & NHIsSource: Orca Security

TL;DR: AI security tools split coverage across model robustness, prompt safety, notebook hygiene, privacy leakage, and post-exploitation testing, but they still leave cloud IAM, storage permissions, and shadow AI exposure open, according to Orca Security. The real gap is not feature breadth but whether teams can see attack paths across identities, workloads, and infrastructure before production exposure becomes operational risk.

At a glance

What this is: This is an analysis of how AI security tools map to ML attack phases, and the key finding is that most tools stop at the model layer while cloud identity and infrastructure risk remain exposed.

Why it matters: It matters because IAM, cloud security, and NHI programmes need a shared view of AI workloads, or teams will secure models while missing the credentials and permissions that make those models reachable.

By the numbers:

Shadow AI deployments affect 40% of enterprise cloud environments per the 2024 Gartner AI TRiSM market survey.
The 2026 Infrastructure Identity Survey found that 70% of organisations grant AI systems more access than they would give a human employee performing the exact same job.
Systems with least-privileged AI access had a 17% incident rate vs 76% for over-privileged systems.

👉 Read Orca Security's analysis of AI security tools and cloud identity gaps

Context

AI security is not a single control layer. It spans model behaviour, notebook hygiene, prompt safety, privacy testing, and the cloud permissions that let those workloads run. The problem in this article is that many tools only see one slice of that stack, so security teams can get a false sense of coverage while IAM, storage exposure, and shadow AI remain outside the scan boundary.

For practitioners, the governance question is whether AI security is being treated as a model-testing exercise or as an identity and infrastructure problem. The article is strongest when it shows that attack phases begin before the model is touched and continue after the model is compromised, which is exactly where IAM and NHI teams need to be in the room.

Orca Security frames the issue around production fit, which is the right lens for enterprise adoption. A tool that cannot see execution roles, endpoint exposure, or untracked AI services may still be useful, but it is not sufficient for governing AI workloads end to end.

Key questions

Q: How should security teams evaluate AI security tools for production use?

A: Security teams should evaluate AI security tools by the attack phase they cover, the cloud controls they can see, and how easily they fit into existing DevOps and SOC workflows. A tool that tests models but cannot inspect IAM roles, storage permissions, or shadow AI will leave production risk intact even if it looks comprehensive on paper.

Q: Why do AI workloads create gaps in traditional cloud security models?

A: AI workloads combine model behaviour with cloud identities, storage, and runtime exposure, so traditional tools that focus on one layer miss the full chain. The gap appears when an execution role, bucket permission, and exposed endpoint together create a reachable attack path that no single alert captures.

Q: What breaks when shadow AI is not part of the asset inventory?

A: When shadow AI is absent from inventory, security teams cannot apply policy, logging, access review, or remediation to the workload. That means the service may process sensitive data and expose credentials without ever entering the governance process, which turns discovery into a control boundary, not just a reporting issue.

Q: What is the difference between model testing and cloud AI posture management?

A: Model testing evaluates whether the AI behaves safely under adversarial input, while cloud AI posture management evaluates whether the workload is reachable, overprivileged, or exposed in infrastructure. Both matter, but only posture management can see the IAM and network conditions that make the model exploitable in production.

Technical breakdown

ML attack phases and where AI security tools stop

The article maps AI risk to an attack lifecycle that starts with reconnaissance and initial access, then moves through model manipulation, data poisoning, and post-exploitation. That structure is useful because it separates model testing from environment exposure. A tool like adversarial robustness testing may tell you whether a model is resilient to crafted inputs, but it does not tell you whether the training job can read a bucket it should not, or whether an inference endpoint is reachable from the wrong network zone. The technical mistake is treating model safety as infrastructure safety.

Practical implication: evaluate tool coverage by attack phase, not by feature count, and verify which layer each control actually reaches.

Cloud AI security posture and IAM exposure

Cloud AI security posture management looks at the permissions, storage access, network paths, and runtime settings around AI workloads. That includes execution roles for training jobs, policies on data buckets, and endpoint exposure for inference services. In practice, the attack chain often starts with identity or configuration failure, not with the model itself. If the IAM role is over-scoped, the workload inherits unnecessary reach. If the endpoint is exposed, any downstream model protection becomes less meaningful because access control has already failed at the perimeter of the workload.

Practical implication: review AI workload IAM roles and endpoint policies with the same rigor used for other high-value cloud services.

Shadow AI detection and attack-path analysis

Shadow AI is any AI service, model, or SDK running in cloud without security team awareness. That matters because undiscovered assets cannot be governed, logged, or risk-prioritised. Attack-path analysis then connects individual findings that look moderate in isolation, such as an overprivileged execution role, an open storage bucket, and an unauthenticated endpoint. The technical value is correlation. Security teams do not need more isolated alerts. They need a path view that shows how identity, storage, and runtime exposure combine into a real compromise path.

Practical implication: add asset discovery and attack-path correlation before relying on point findings from any AI security tool.

Azure Key Vault privilege escalation exposure — Azure Key Vault Contributor role misconfiguration enabled privilege escalation.
Cisco DevHub NHI breach — IntelBroker exploited exposed Cisco credentials, API tokens and keys in DevHub.

Read our 52 NHI Breaches Analysis report for a comprehensive view of breaches impacting Non-Human Identities including AI Agents.

NHI Mgmt Group analysis

AI security tooling is still organised around single layers, while real exposure lives in the chain between them. The article shows that model robustness, prompt safety, notebook scanning, privacy testing, and post-exploitation simulation each cover a distinct phase, but none on their own govern the full AI workload surface. That leaves IAM, storage, and endpoint exposure as the practical control plane for many incidents. The implication is that security teams should stop asking which AI tool is best and start asking which attack phase remains invisible.

Shadow AI is the named concept this market is still underestimating. Shadow AI is not just an inventory problem. It is a governance failure where AI services, model SDKs, and inference endpoints exist outside the security team’s control boundary, so no policy, review, or exception process can reach them. The article’s own data point that 40% of enterprise cloud environments contain at least one untracked AI service shows how quickly this becomes a blind spot. Practitioners should treat discovery as a prerequisite, not an add-on.

Attack-path analysis is the right editorial frame because isolated findings do not describe operational risk. An overprivileged role, exposed bucket, and reachable endpoint can each look tolerable alone, yet combine into a serious exposure path. That is exactly where NIST CSF and cloud security posture thinking intersect with NHI governance. The implication is that remediation should be prioritised by chained reach, not by the loudest single alert.

AI workload governance is becoming an identity problem as much as a model problem. The article’s core lesson is that identity, network exposure, and runtime access determine whether AI security controls matter in production. When teams only test the model, they miss the control failures that allow the model to be reached, fed, or exfiltrated from. Practitioners should therefore align AI security reviews with IAM and cloud governance operating rhythms, not leave them inside specialist ML teams alone.

Production fitness is the differentiator that separates useful AI security from interesting AI security. Tools that are hard to operationalise, slow to deploy, or detached from cloud workflows will not reduce risk at scale. The market is moving toward controls that integrate into DevOps, SOC, and compliance processes because that is where AI exposure is discovered and remediated. Teams should evaluate deployment friction as part of security effectiveness, not as an afterthought.

From our research:
Shadow AI deployments affect 40% of enterprise cloud environments per the 2024 ESG Report: Managing Non-Human Identities style research pattern, according to the 2024 ESG Report: Managing Non-Human Identities.
The 2026 Infrastructure Identity Survey found that 53% of security leaders expect AI to run major portions of their infrastructure autonomously within the next three years.
That is why OWASP NHI Top 10 is useful here for mapping AI workload exposure to identity-driven attack paths.

What this signals

Shadow AI has moved from a discovery issue to a governance issue. With 19% of organisations giving AI systems dramatically more access than human employees, per the 2026 Infrastructure Identity Survey, the governance problem is no longer whether AI exists in the environment. It is whether the identities behind those systems are being treated as production access subjects with reviewable scope and accountable ownership.

AI workload risk should be folded into cloud identity operating rhythms. The practical boundary is not the model file. It is the execution role, the bucket policy, the endpoint policy, and the discovery process that keeps untracked AI services from slipping past control coverage. Teams that keep AI security separate from IAM will continue to miss the path that matters most.

Security programmes should treat AI security posture as a cross-domain control surface rather than a tool category. That means discovery, access control, and attack-path analysis need to converge before the environment is mature enough for confidence in production AI.

For practitioners

Map each AI security tool to a single attack phase. Document whether the tool covers model behaviour, notebook hygiene, privacy leakage, post-exploitation, or cloud posture, and refuse to count overlapping claims as full coverage. Use that map to find the phase that still has no control owner, especially around cloud infrastructure and identity.
Inventory shadow AI before tuning detections. Use cloud discovery to find untracked AI services, SDKs, notebooks, and endpoints across AWS, Azure, and GCP. A workload that is not in the asset inventory cannot be attached to an IAM review, a policy exception, or a remediation path.
Correlate IAM, storage, and endpoint findings into attack paths. Prioritise the combination of execution roles, data bucket permissions, and inference exposure rather than scoring each issue in isolation. This is where medium findings become a critical chain and where the highest-risk AI workloads become visible.
Align AI security reviews with cloud and identity governance cycles. Bring AI workload posture checks into the same review rhythm as access governance, CI/CD policy checks, and cloud security operations. That reduces the chance that AI controls remain a specialist activity with no operational owner.

Key takeaways

AI security tools are fragmented by design, so no single tool covers the full ML attack chain from reconnaissance to lateral movement.
The biggest production gap is not model weakness alone but cloud identity, storage, and endpoint exposure around AI workloads.
Teams should prioritise discovery, attack-path correlation, and IAM review before they assume AI security coverage is complete.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and OWASP Non-Human Identity Top 10 address the attack and risk surface, while NIST CSF 2.0 set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
OWASP Agentic AI Top 10	AGENT-03	Tool coverage and misuse map to agentic AI attack phases.
OWASP Non-Human Identity Top 10	NHI-01	AI workloads rely on non-human identities and secret-bearing access.
NIST CSF 2.0	PR.AC-4	Least-privilege access and cloud controls are central to the article.

Map AI security findings to attack phases and verify which runtime behaviours remain ungoverned.

Key terms

Shadow AI: Shadow AI is any AI service, model, or SDK operating in an environment without security team awareness or control. It is a governance problem because undiscovered systems cannot be inventoried, reviewed, logged, or assigned access boundaries, which leaves risk outside normal identity and cloud controls.
Cloud AI security posture management: Cloud AI security posture management is the continuous assessment of IAM, storage, network, and runtime settings around AI workloads. It focuses on the environment that hosts the model, not just the model itself, so teams can see whether access, exposure, and configuration create a reachable attack path.
Attack path analysis: Attack path analysis connects separate weaknesses into a realistic compromise route. In AI environments, it shows how overprivileged identities, exposed data stores, and reachable endpoints combine into an exploit chain that is more dangerous than any single finding rated in isolation.

Deepen your knowledge

AI workload identity exposure and shadow AI detection are core topics in our NHI Foundation Level course, the industry's only accredited NHI security programme. If your programme already spans cloud IAM and machine identities, this course is a useful fit.

This post draws on content published by Orca Security: an analysis of AI security tools and how they map to ML attack phases in production environments. Read the original.

NHIMG Editorial Note
Published by the NHIMG editorial team on 2026-05-21.
NHI Mgmt Group — the independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org