ChatGPT data leak controls expose the limits of enterprise DLP

By NHI Mgmt Group Editorial TeamPublished 2026-05-20Domain: AnnouncementsSource: Cyera

TL;DR: Most enterprise DLP tools cannot reliably stop sensitive data from being pasted into ChatGPT because they do not inspect the browser input field where the leak actually happens, according to Cyera. The practical lesson is that AI use needs context-aware policy, not just broader blocking, because productivity pressure is now outrunning approval-based security models.

At a glance

What this is: This is an analysis of how sensitive data can leak into ChatGPT through pasted prompts and why conventional DLP and browser controls miss that path.

Why it matters: It matters because IAM, NHI, and human access programmes now have to govern data movement into consumer and enterprise AI sessions, not just authentication and storage.

👉 Read Cyera's analysis of how to stop sensitive data leaks in ChatGPT

Context

The core problem is not whether employees will use ChatGPT. The real issue is that sensitive data can move from trusted systems into a browser prompt faster than conventional DLP, CASB, SWG, and endpoint controls can evaluate context or stop the paste.

For IAM and data governance teams, this is a policy problem as much as a tooling problem. Organisations now need controls that understand which data is sensitive, which account is being used, and whether the session is sanctioned before information leaves the endpoint.

Key questions

Q: How should security teams stop sensitive data from being pasted into ChatGPT?

A: Start by enforcing at the browser prompt, not just at file upload or network egress. Classify data first, then apply context-aware policy that can allow, warn, redact, or block based on the sensitivity of the content and whether the session is sanctioned. That approach is stronger than keyword matching because it follows the data, not the format.

Q: Why do traditional DLP tools fail for AI chat usage?

A: Traditional DLP tools often inspect files, email, or network flows, but pasted prompts happen inside the browser input field. If the control does not see the exact interaction where text is entered, it cannot reliably evaluate context before sensitive data leaves the endpoint. The failure is visibility, not just policy intent.

Q: What do organisations get wrong about allowing employee ChatGPT use?

A: They often treat approval as a binary yes-or-no decision instead of a data-governance problem. Employees may use both enterprise and personal accounts on the same device, so the real task is distinguishing sanctioned from unmanaged sessions and applying different rules to each. Without that separation, users route around controls.

Q: How do teams know whether AI prompt controls are actually working?

A: Look for whether the control is operating at the moment of prompt entry and whether it can distinguish data classes, account type, and destination. If users can still paste regulated content into personal AI sessions without warning or enforcement, the control is cosmetic rather than operational. Effective controls reduce silent leakage, not just alert volume.

How it works in practice

Why pasted prompts bypass traditional DLP

Most enterprise DLP tools focus on files, network traffic, or static content inspection. ChatGPT use often happens inside a browser input field, where the user pastes data directly into a prompt and the control plane never sees a conventional file transfer or egress event. That means regex-heavy detection and legacy browser inspection miss the moment that matters. The failure is architectural: the control is looking at the wrong object at the wrong layer, so context never reaches enforcement.

Practical implication: inspect browser prompt content at the point of entry, not only file uploads or network egress.

How data classification changes AI prompt enforcement

Cyera's approach relies on classifying data first, then applying policy as that data is encountered in a prompt. That matters because customer records, source code, regulated fields, and intellectual property require different treatment, even when the user action looks identical. Instead of guessing from keywords, the control uses data labels and context to decide whether to allow, warn, redact, or block. This is a data-governance pattern, not a chatbot-specific exception, and it fits the broader NHI security problem of tracking where sensitive data travels once it leaves a primary system of record.

Practical implication: tie AI prompt policy to trusted data classification rather than generic keyword rules.

Enterprise and consumer ChatGPT sessions need different policy paths

The article highlights a common deployment reality: one managed device can host both a sanctioned enterprise AI tenant and a personal consumer account. Those sessions carry different training, retention, and risk assumptions, so treating them as equivalent creates avoidable exposure. The useful control is session-aware policy that distinguishes approved enterprise use from unmanaged personal use on the same endpoint. That lets security teams apply different rules without forcing a blanket ban that users will route around.

Practical implication: distinguish sanctioned enterprise AI access from personal AI access on managed devices and enforce separate policies.

NHI Mgmt Group analysis

Browser-prompt leakage is a governance gap, not just a DLP miss. The security issue is not merely that ChatGPT is popular. It is that data leaves a trusted environment through a browser interaction that legacy controls were never built to observe. This is a classic boundary failure between data governance and access governance, and it is most visible when organisations assume the endpoint already knows enough to decide safely. Practitioners should treat prompt submission as a controlled data-exfiltration path, not a casual user action.

Context-aware classification is now the decisive control variable for AI sessions. The article shows why generic pattern matching fails when the same machine may handle regulated records, source code, and ordinary text in the same workflow. Classification-based policy is more durable because it follows the sensitivity of the data instead of the surface form of the content. That is a stronger fit for NHI-era data movement, where the question is less about who authenticated and more about what the identity is trying to move into an external model.

Shadow AI on managed devices creates an access problem that looks like a usage problem. When employees can move between enterprise and personal ChatGPT accounts on the same endpoint, security teams are no longer managing a single sanctioned service. They are governing multiple AI identities with different policy expectations, some approved and some invisible. That requires a programme view that joins device trust, account trust, and data sensitivity. Practitioners should assume the inventory problem comes before the control problem.

Data exfiltration into AI prompts is becoming a named category of identity-adjacent risk. Cyera's framing effectively names a pattern we increasingly see across the AI stack: trusted users, trusted devices, and untrusted destinations combining into an invisible leak path. The useful concept here is prompt-channel exposure, which describes sensitive data leaving approved systems through the AI interaction layer. Teams should use that lens when updating data-loss and AI-usage governance.

From our research:
98% of companies plan to deploy even more AI agents within the next 12 months, despite documented rogue behaviour in 80% of current deployments, according to AI Agents: The New Attack Surface report.
Only 52% of companies can track and audit the data their AI agents access, leaving 48% with a complete blind spot for compliance and breach investigation.
That visibility gap makes prompt-channel governance harder, which is why readers should also review Guide to the Secret Sprawl Challenge for the credential and data-flow side of the problem.

What this signals

Prompt-channel exposure is likely to become a standard governance term as organisations formalise AI usage policies. The practical shift is away from broad blocking and toward differentiated enforcement that recognizes sanctioned enterprise tenants, unmanaged consumer accounts, and sensitive data classes as separate control objects.

The next maturity step for many programmes will be joining data classification with endpoint context so that paste-time decisions can be made locally. That matters because security teams cannot depend on quarterly inventory reviews when AI use is happening inside everyday browser workflows.

As adoption spreads, the control question will move from whether employees are using AI to whether the organisation can explain where regulated data went, into which session, and under which policy. That is a compliance and identity-adjacent visibility issue, not just a content filter problem.

For practitioners

Map prompt-channel exposure paths Identify where employees can paste regulated or proprietary data into AI chat sessions from managed devices, including personal accounts and embedded AI inside SaaS apps. Prioritise browser-based use cases because that is where legacy DLP typically loses visibility.
Anchor AI policy to trusted data labels Use the same classification taxonomy your data team already maintains for customer PII, source code, financial records, and regulated fields. Apply allow, warn, redact, or block actions based on classification and destination rather than keyword matches.
Separate sanctioned and unmanaged AI access Create distinct policy paths for enterprise AI tenants and consumer AI accounts on the same device. If the session is unmanaged, tighten controls around paste actions, file upload paths, and data-bearing prompts.
Treat prompt submission as a control point Add endpoint or browser enforcement where the user actually enters content, not only where files are opened or transferred. That keeps the control close to the exfiltration moment and reduces workarounds.

Key takeaways

Sensitive data can leak into AI chat sessions through pasted prompts even when conventional DLP and CASB controls appear to be in place.
The governance gap is not simply ChatGPT usage, but unmanaged session context across enterprise and personal accounts on the same device.
Effective control depends on trusted data classification, browser-level enforcement, and separate policy paths for sanctioned and unmanaged AI use.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Non-Human Identity Top 10 address the attack and risk surface, while NIST CSF 2.0 and NIST Zero Trust (SP 800-207) set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
OWASP Non-Human Identity Top 10	NHI-01	Prompt leakage is an NHI data exposure problem across AI sessions.
NIST CSF 2.0	PR.DS-1	Protecting data in use and transit is central to prompt-channel enforcement.
NIST Zero Trust (SP 800-207)	PR.AC-4	Session and device trust must be enforced before AI data can leave the endpoint.

Classify AI session data paths and enforce controls on sensitive content leaving the browser.

Key terms

Prompt-channel exposure: Prompt-channel exposure is the movement of sensitive data into an AI system through user-entered prompts rather than file transfer or API integration. It matters because traditional DLP often watches the wrong control point, leaving browser input as a blind spot for regulated or proprietary information.
Shadow AI: Shadow AI is the use of AI tools, agents, or embedded AI features that security and governance teams have not approved, inventoried, or controlled. In practice, it creates hidden data paths, unmanaged policy exceptions, and reporting gaps that make access and exfiltration harder to explain.
Data classification: Data classification is the process of labelling information according to sensitivity, regulatory impact, or business value so controls can be applied consistently. For AI governance, it allows policy to follow the data into prompts, sessions, and destinations rather than relying on brittle text matching.

Deepen your knowledge

AI prompt governance and data classification are core topics in our NHI Foundation Level course, the industry's only accredited NHI security programme. If you are building controls for ChatGPT and similar tools, it is worth exploring.

This post draws on content published by Cyera: How Cyera stops sensitive data leaks in ChatGPT. Read the original.

NHIMG Editorial Note
Published by the NHIMG editorial team on 2026-05-20.
NHI Mgmt Group — the independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org