TL;DR: AI adoption is outpacing AI security and governance, with IBM reporting that 97% of businesses suffering an AI-related breach had no proper AI access controls in place, while Cyera argues that extending DSPM into AI workflows is the practical way to restore visibility over data, access, and usage. The underlying issue is not just data sprawl, but governance built for static environments now being asked to control dynamic AI workflows.
At a glance
What this is: DSPM for AI extends data security posture management into AI workflows to expose where sensitive data lives, who can access it, and how it is being used.
Why it matters: It matters because IAM, data governance, and security teams now have to control sensitive data flows across AI systems, shadow AI, and multi-cloud environments without losing auditability or least-privilege discipline.
By the numbers:
- 97% of businesses that suffered an AI-related breach reported they had no proper AI access controls in place.
- 72% of data breaches involved data stored in cloud environments, and 30% of breached data spanned multiple computing environments.
- 67% of executives intend to budget for protections around AI models.
👉 Read Cyera's full research on DSPM for AI and enterprise data governance
Context
DSPM for AI is about applying data security posture management to AI systems so organisations can continuously discover, classify, and govern sensitive data as it moves through training, inference, and storage. The core problem is simple: AI workflows consume data faster than traditional security programmes can map access, purpose, and exposure.
That gap matters for IAM because AI systems are now data consumers with broad access across cloud, SaaS, and hybrid environments, while shadow AI introduces unmanaged usage outside approved controls. The result is a governance problem that spans data protection, access control, and compliance evidence at the same time.
Cyera’s article treats DSPM as the control plane for AI data risk, but the broader lesson is that security teams can no longer separate data governance from identity governance in AI-enabled environments. Once prompts, outputs, and training sets become operational assets, access policy and data policy have to move together.
Key questions
Q: How should security teams govern sensitive data used in AI workflows?
A: Security teams should treat AI workflows as governed data paths, not just model activity. That means classifying training and prompt data, applying purpose-based access rules, monitoring outputs in real time, and enforcing retention and minimisation policies across sanctioned and shadow tools. The goal is to keep identity, data, and audit evidence aligned as the workflow changes.
Q: Why do shadow AI tools create such a large governance gap?
A: Shadow AI creates a gap because sensitive data can move into tools that are outside normal approval, logging, and retention controls. Once that happens, identity governance no longer matches actual usage, and security teams lose visibility into who accessed which data and why. That breaks accountability and makes compliance evidence incomplete.
Q: What breaks when DSPM only covers static data stores?
A: When DSPM stops at static repositories, it misses the highest-risk part of AI use: data in motion through prompts, outputs, and training flows. That leaves over-permissioned access, untracked reuse, and policy violations invisible until after the fact. In practice, the programme can look compliant while AI users are still exposing sensitive material.
Q: Should organisations prioritise AI data governance before scaling AI adoption?
A: Yes. Organisations that scale AI before establishing discovery, classification, monitoring, and policy enforcement are effectively expanding the attack surface faster than they can govern it. AI adoption should be matched with controls that follow the data lifecycle, otherwise compliance, exposure, and misuse risks compound as usage grows.
Technical breakdown
AI-aware data discovery and classification
Traditional discovery tools were built for structured repositories and predictable ownership patterns. DSPM for AI has to identify sensitive information across structured, semi-structured, and unstructured datasets, then classify it by sensitivity, regulatory scope, ownership, and business purpose. That matters because AI training and inference often blend data types that look harmless in isolation but become risky when combined. Classification is not just labeling. It is the foundation for deciding which datasets can feed models, which prompts deserve inspection, and which data should never leave governed boundaries.
Practical implication: map sensitive AI data sources first, then require classification rules that reflect purpose, not just file type.
Real-time monitoring of AI prompts and outputs
AI interaction monitoring is about seeing what users put into models and what the models return before sensitive content escapes approved boundaries. Unlike static applications, AI systems can expose regulated data through prompts, summaries, or generated responses in real time. That makes event-based monitoring essential, especially where employees use sanctioned tools to process confidential material. Effective monitoring also has to watch for prompt injection and other adversarial inputs that try to steer model behaviour toward unintended disclosure.
Practical implication: instrument prompt and response logging where AI systems handle sensitive data, and tie alerts to data classification and user identity.
Policy enforcement across AI workflows
DSPM for AI becomes operational when discovery and classification feed enforcement. The article’s model is to connect policy to AI workloads so that data use, access scope, and retention are controlled continuously rather than reviewed after the fact. That includes integrating with DLP, IAM, and SIEM so AI behaviour is evaluated in the same governance chain as other enterprise systems. The technical point is not just visibility. It is policy propagation across the full AI data lifecycle, from ingestion to output and retention.
Practical implication: extend existing access and retention policies into AI pipelines, then verify that enforcement reaches third-party and shadow AI tools.
Breaches seen in the wild
- DeepSeek breach — DeepSeek breach exposed 1M+ log lines and sensitive secret keys.
- Google Firebase misconfiguration breach — Firebase misconfigurations exposed 19.8M secrets across developer instances.
Read our 52 NHI Breaches Analysis report for a comprehensive view of breaches impacting Non-Human Identities including AI Agents.
NHI Mgmt Group analysis
DSPM for AI is becoming the missing governance layer between data security and identity control. AI systems do not just store data, they transform and redistribute it across prompts, outputs, and training paths that conventional controls were never designed to track. That makes the core problem broader than data loss prevention alone. Practitioners should treat DSPM for AI as a governance bridge, not a point control.
Shadow AI creates a measurable trust deficit because usage can outrun approval. When employees move sensitive content into unsanctioned or weakly governed AI tools, the security team loses visibility into who accessed what, for what purpose, and under which policy. The article is right to frame this as a blind spot. The operational implication is that approval state and actual usage state can diverge quickly in AI environments.
Data minimisation becomes harder, not easier, once AI enters the workflow. Training sets, prompts, and model outputs often pull together data that was never intended to be retained, repurposed, or recombined at scale. That turns minimisation into an active control problem instead of a policy statement. Practitioners need to recognise that AI can widen the reuse boundary unless the data governance model is explicit and continuously enforced.
Identity governance now has to follow data into places where the user is not the only actor. AI tooling frequently inherits human access, then amplifies reach through system-level permissions and third-party integrations. That creates a governance pattern where the data path matters as much as the login path. Security teams should re-evaluate whether their current IAM and DSPM controls are linked tightly enough to explain and constrain AI-driven data use.
Cross-domain governance is the real requirement, not another isolated AI security dashboard. The article’s strongest implication is that AI security sits at the intersection of IAM, data protection, compliance, and operational monitoring. No single team owns that boundary today. Practitioners should treat AI data governance as a programme design issue that must connect identity, data, and audit evidence from the start.
From our research:
- 85% of organisations lack full visibility into third-party vendors connected via OAuth apps, with 38% having no or low visibility and another 47% only partial visibility, according to The State of Non-Human Identity Security.
- Our research also shows that 1.5 out of 10 organisations are highly confident in their ability to secure NHIs, compared to nearly 1 in 4 for securing human identities.
- For a broader governance lens, see Ultimate Guide to NHIs , Key Challenges and Risks for the visibility and sprawl patterns that often precede AI data exposure.
What this signals
Shadow AI creates an identity problem as much as a data problem. When employees move regulated data into AI tools outside approved workflows, the organisation loses the linkage between user, purpose, and dataset. That is why AI security programmes need to connect DSPM with access governance and audit trails, not treat it as a standalone data project.
With 85% of organisations lacking full visibility into third-party vendors connected via OAuth apps, the broader lesson is that AI governance will fail wherever delegated access is poorly mapped. AI platforms, plugins, and connected services expand the same exposure pattern from SaaS into model workflows, so practitioners should assume integration sprawl will outpace manual review unless governance is automated.
The practical signal for security teams is that AI policy cannot remain a document. It has to become an enforcement layer that tracks data purpose, access activity, and retention across sanctioned and unsanctioned tools, with evidence that auditors can test.
For practitioners
- Extend DSPM coverage into AI workflows Inventory sanctioned, shadow, and third-party AI systems, then verify that discovery reaches training data, prompts, outputs, and retention locations across cloud and SaaS environments.
- Classify AI data by purpose and regulatory scope Tag sensitive datasets with owner, purpose, and compliance context so model training and inference decisions can be enforced against policy rather than file location alone.
- Tie AI monitoring to identity context Correlate prompts, outputs, and access activity with user identity and dataset sensitivity so investigators can distinguish normal use from policy violation or data exfiltration.
- Enforce retention and minimisation rules in the pipeline Make deletion, anonymisation, and re-use restrictions part of the AI workflow so data does not remain available after the business purpose has ended.
- Use linked controls across IAM, DLP, and SIEM Connect policy decisions across access management, data loss prevention, and monitoring tools so AI governance is enforced consistently instead of in isolated silos.
Key takeaways
- DSPM for AI addresses the gap between static data security controls and fast-moving AI workflows that continuously reuse sensitive data.
- The scale of the risk is already visible in breach, cloud, and shadow AI statistics, which show that governance is lagging adoption.
- Practitioners should connect DSPM, IAM, and compliance controls so AI data use stays visible, policy-bound, and auditable.
Standards & Framework Alignment
This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.
OWASP Non-Human Identity Top 10 address the attack and risk surface, while NIST AI RMF and NIST Zero Trust (SP 800-207) set the governance and control requirements practitioners need to meet.
| Framework | Control / Reference | Relevance |
|---|---|---|
| NIST AI RMF | AI RMF fits the governance and accountability concerns around AI data workflows. | |
| OWASP Non-Human Identity Top 10 | NHI-03 | AI systems often rely on non-human access patterns that need lifecycle and privilege controls. |
| NIST Zero Trust (SP 800-207) | PR.AC-4 | Zero trust access control is relevant where AI tools consume sensitive data across environments. |
Require least-privilege access for AI workflows and verify every data access path continuously.
Key terms
- DSPM for AI: DSPM for AI is the application of data security posture management to AI workflows. It continuously discovers, classifies, and governs sensitive data as it moves through training, inference, prompts, and outputs so organisations can enforce policy and compliance in real time.
- Shadow AI: Shadow AI is the use of AI tools, models, or agents outside approved governance and visibility. In practice, it creates unmanaged data movement, weak accountability, and gaps in logging or retention, which makes sensitive information harder to control and audit.
- Data minimisation: Data minimisation is the discipline of limiting data collection, retention, and reuse to what is necessary for a specific purpose. In AI environments, it must be enforced continuously because models and workflows can recombine data in ways that expand exposure beyond the original business need.
- AI workflow governance: AI workflow governance is the set of controls that define how AI systems may access, transform, and share data. It combines identity, data, compliance, and monitoring controls so that model usage remains visible, policy-bound, and auditable across the full lifecycle.
Deepen your knowledge
DSPM for AI and AI workflow governance are covered in our NHI Foundation Level course, the industry's only accredited NHI security programme. If you are building controls for shared access, delegated data use, or shadow AI, it is worth exploring.
This post draws on content published by Cyera: Navigating the World of DSPM for AI and Why It Is Mission Critical for Enterprise Organizations. Read the original.
Published by the NHIMG editorial team on 2025-10-07.
NHI Mgmt Group — the independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org