AI systems expose sensitive data through lifecycle gaps and shadow use

By NHI Mgmt Group Editorial TeamPublished 2026-06-24Domain: Agentic AI & NHIsSource: Orca Security

TL;DR: AI systems expose sensitive data through training ingestion, stored prompts, output leakage, and shadow AI use, according to Orca Security, while nearly a third of enterprise employees admit to entering internal documents or emails into AI tools. The core problem is that data can persist, reappear, or move outside approved control points after it enters AI workflows.

At a glance

What this is: This is an analysis of why sensitive data becomes harder to govern in AI systems and which controls reduce exposure across the AI lifecycle.

Why it matters: It matters because identity, access, and data governance teams need controls that follow AI data flows, service identities, and runtime behaviour rather than relying on perimeter-only assumptions.

By the numbers:

56% of security professionals confirm employees use AI tools without formal approval.
22% suspect it is happening but cannot prove it.

👉 Read Orca Security's analysis of sensitive data protection across AI environments

Context

Sensitive data risk in AI systems is not just a data protection issue. Once information enters an AI pipeline, it can be stored, transformed, reused, or regenerated in ways that conventional file-centric controls do not fully govern, especially when service identities and shadow tools are involved.

For IAM, IGA, PAM, and NHI teams, the practical question is where accountability stops. If the identity layer does not track which workload, tool, or user path can introduce sensitive data into model training or prompt history, governance becomes partial and reactive rather than continuous.

Key questions

Q: How should security teams govern sensitive data in AI pipelines?

A: Security teams should govern AI pipelines from ingestion to output, not just at the model boundary. That means classifying sensitive data before it enters the workflow, scoping service identities to each pipeline stage, limiting prompt retention, and validating outputs before reuse. The objective is to keep data lineage, access, and deletion controls aligned with how AI actually processes information.

Q: Why do AI systems make sensitive data harder to protect than traditional applications?

A: AI systems can encode, retain, and regenerate information in ways that do not map to classic file or database controls. Once data enters prompts, training sets, or stored conversations, it may persist outside normal deletion and access revocation paths. That creates a governance problem, not just an encryption problem, because exposure can reappear through later prompts or outputs.

Q: What breaks when AI workloads share one broad service identity?

A: A shared service identity creates standing privilege across training, inference, logging, and preprocessing. That broad scope makes it easier for a mistake or compromise to expose more data than any single function needs. It also weakens accountability because access logs no longer show which stage actually required the permission. Separate identities by function and remove unused cross-stage access.

Q: Who is accountable when sensitive data is retained in a third-party AI tool?

A: Accountability sits with the organisation that allowed the data into the tool, even if the provider stores or processes it. Teams need clear ownership for prompt retention, deletion requests, and vendor data processing terms. If the provider cannot prove erasure or lineage, the organisation still carries the compliance and privacy risk.

Technical breakdown

Why AI data flows bypass traditional DLP controls

Traditional data loss prevention tools are built around fixed transfer points such as email, file upload, and endpoint copy events. AI workflows break that model because data can move through browser sessions, prompt inputs, or backend integrations without a discrete file boundary. That means the control plane must shift from perimeter inspection to context-aware policy around prompts, storage, and service identities. The real technical issue is not only exposure, but persistence: once data enters a model or retained prompt store, deletion and revocation no longer map cleanly to the exposure event.

Practical implication: extend monitoring from file movement to prompt, API, and model interaction paths.

Why least-privilege service identities matter in AI workloads

AI pipelines often reuse broad service identities across training, inference, logging, and preprocessing because teams optimise for speed of deployment. That creates a standing privilege problem: one identity can reach multiple data stores and model artefacts even when each stage needs only a narrow subset of permissions. Least privilege in this context means scoping access by pipeline stage, not by application name alone. Role-based access and just-in-time patterns reduce how much sensitive data any one workload can touch during normal operation.

Practical implication: split AI workload identities by function and remove cross-stage access that is not required.

How prompt storage and output leakage create durable exposure

Prompt histories and model outputs introduce a governance problem rather than a simple configuration issue. Stored prompts may live with third-party providers, while model outputs can echo memorised fragments of training data or policy-violating content. This means sensitive data can reappear outside the original processing context even when the source record was deleted. The defensive logic is layered: classify data before ingestion, enforce retention limits on prompt stores, and filter outputs before they are returned to users or downstream systems.

Practical implication: treat prompt retention and output review as governance controls, not only security settings.

Threat narrative

Attacker objective: The objective is to obtain sensitive enterprise data through AI workflows that bypass normal control points and then retain or reuse it outside the organisation's governance boundary.

entry via employees pasting internal documents, credentials, or regulated data into AI tools and prompts that sit outside formal approval paths.
credential or data exposure through training ingestion, prompt storage, or unsanctioned tool use that places sensitive material under third-party control.
impact through data resurfacing, policy violation, or loss of regulatory control when the model or provider retains or regenerates the information.

Reviewdog GitHub Action supply chain attack — reviewdog/action-setup GitHub Action supply chain attack exposed secrets.
CI/CD pipeline exploitation case study — full server takeover via exposed .git directory and mismanaged CI/CD pipeline secrets.

Read our 52 NHI Breaches Analysis report for a comprehensive view of breaches impacting Non-Human Identities including AI Agents.

NHI Mgmt Group analysis

AI data governance fails when organisations assume the control point is the file boundary. This guide shows that sensitive data now moves through prompts, retained conversations, and service-to-service paths that legacy DLP was never built to observe. The result is not just leakage, but unbounded persistence across model memory, vendor retention, and workflow reuse. Practitioners should treat AI data movement as a governance domain in its own right, not as an extension of endpoint protection.

Standing privilege becomes a data exposure multiplier when AI pipelines reuse the same identity across every stage. Training, inference, logging, and preprocessing are often collapsed into one service account for convenience. That design turns a narrow processing job into a broad trust problem because one identity can access more sensitive data than any single stage actually requires. Practitioners should re-evaluate AI workload identities as separately governed subjects, not as implementation details.

Shadow AI is a lifecycle failure, not just an awareness issue. When employees can introduce sensitive data into unsanctioned tools without discovery, approval, or ownership, governance has already broken before the model sees the data. The problem is not merely that the tool is unapproved. The deeper failure is that the organisation cannot prove where the data went, who processed it, or whether retention and deletion obligations were met. Practitioners should treat hidden AI usage as an identity and data lineage gap.

Prompt retention creates a new form of trust debt across AI programmes. Data that enters a prompt store may outlive the business purpose for which it was shared, and deletion rights become hard to enforce once a third party has copied or indexed the content. That is why compliance frameworks increasingly matter here: GDPR, HIPAA, CCPA, and NIST AI RMF all expose the same weakness in different language. Practitioners should align AI governance with retention, erasure, and provenance controls rather than relying on policy statements alone.

Sensitive data discovery is now a prerequisite for AI assurance, not an adjacent hygiene task. The first governance decision is not what model to use, but what data is allowed to reach it and under what identity path. That shifts the discipline from reactive inspection to pre-ingestion control, where classification, access scope, and monitoring are tied together. Practitioners should make discovery and lineage the starting point for every AI security review.

From our research:
57% of organisations lack a complete inventory of their machine identities, according to The Critical Gaps in Machine Identity Management report.
61% rely on spreadsheets or manual tracking for machine identity management, according to the same report.
For a broader lifecycle lens, see the NHI Lifecycle Management Guide for provisioning, rotation, offboarding, and visibility controls.

What this signals

Secret sprawl and hidden AI usage are converging into the same governance failure. With 57% of organisations lacking a complete inventory of their machine identities, according to The Critical Gaps in Machine Identity Management report, any AI programme that cannot map service identities will also struggle to prove where sensitive data moved.

Sensitive data discovery is becoming the control that makes every other AI safeguard enforceable. Classification, retention, and access scope only work when the organisation can see the identities and pipelines that touch the data. That is why AI security teams need to treat visibility as a prerequisite for policy, not a dashboard afterthought.

Shadow AI will keep expanding until identity governance reaches the prompt layer. The operational answer is not another isolated policy document. It is lineage across users, service accounts, tools, and retained prompts so that data handling obligations can be demonstrated rather than assumed.

For practitioners

Classify data before AI ingestion Map PII, PHI, financial records, source code, and credentials before they reach any model, prompt store, or retrieval layer. Use the classification result to drive downstream retention, access, and logging policy.
Separate AI workload identities by pipeline stage Assign distinct service identities to training, inference, preprocessing, and logging. Remove cross-stage permissions so a single compromise or mistake cannot expose every data store behind the pipeline.
Treat prompts as governed records Apply retention limits, access review, and deletion workflows to prompt histories in the same way you would treat regulated records. Confirm whether third-party providers can actually purge stored conversation data.
Extend monitoring to shadow AI paths Discover unsanctioned tools, browser-based AI use, and third-party integrations that can receive enterprise data outside approved controls. Feed discovery into SIEM and posture tooling so hidden usage becomes visible.
Validate outputs before reuse Scan model responses for sensitive fragments, policy violations, and anomalous content before they are sent to users or downstream systems. Treat output filtering as a release gate, not a post-incident cleanup step.

Key takeaways

AI systems expose sensitive data through prompt retention, training reuse, output leakage, and unsanctioned tool use.
The scale problem is structural because broad service identities and poor inventory make it hard to know where data flows or who can access it.
Teams should move from perimeter controls to classified data, scoped workload identities, prompt governance, and continuous monitoring.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and OWASP Non-Human Identity Top 10 address the attack and risk surface, while NIST CSF 2.0 set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
OWASP Agentic AI Top 10	A3	AI prompts and outputs create injection and leakage risks covered by agentic controls.
OWASP Non-Human Identity Top 10	NHI-03	Shared service identities and over-privilege are central to AI pipeline exposure.
NIST CSF 2.0	PR.DS	Sensitive data protection and monitoring align with the CSF's data-security outcomes.

Classify prompt handling, tool access, and output validation as separate control points.

Key terms

Shadow AI: Shadow AI is the use of AI tools, models, or integrations that the organisation has not approved or cannot fully see. In practice, it creates a governance blind spot because sensitive data, prompts, and outputs can move outside policy, retention, and contractual controls without leaving a clear ownership trail.
AI data lineage: AI data lineage is the ability to trace what data entered an AI system, which identities or tools touched it, where it was stored, and how it was transformed. For security teams, lineage is what turns AI governance from policy intent into auditable evidence across training, prompts, outputs, and retention paths.
Service identity: A service identity is a non-human identity used by an application, workload, or pipeline component to authenticate and access resources. In AI environments, a single service identity is often overused across multiple stages, which increases the blast radius and makes least-privilege governance harder to enforce.
Prompt retention: Prompt retention is the storage of user prompts, chat history, or model context after a conversation ends. It matters because sensitive data can persist outside the original system boundary, creating privacy, compliance, and deletion challenges when organisations cannot prove what was kept, for how long, or where it was copied.

Deepen your knowledge

NHI governance, agentic AI identity, and machine identity lifecycle are core topics in our NHI Foundation Level course, the industry's only accredited NHI security programme. If you are responsible for identity security strategy or NHI governance in your organisation, it is worth exploring.

This post draws on content published by Orca Security: Protecting Sensitive Data in AI Environments. Read the original.

NHIMG Editorial Note
Published by the NHIMG editorial team on 2026-06-24.
NHI Mgmt Group — the independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org