Subscribe to the Non-Human & AI Identity Journal
Home FAQ Governance, Ownership & Risk What do organisations get wrong about DLP for…
Governance, Ownership & Risk

What do organisations get wrong about DLP for AI use cases?

← Back to all FAQ
By NHI Mgmt Group Editorial Team Updated June 6, 2026 Domain: Governance, Ownership & Risk

They assume keyword matching can distinguish legitimate work from sensitive exfiltration. In practice, AI prompts are contextual, so the same text may be safe in one workflow and dangerous in another. Teams need policy that evaluates intent, destination, and action, not just strings.

Why This Matters for Security Teams

Traditional DLP is designed to catch known patterns in data leaving the organisation, but AI use cases blur the boundary between normal work and sensitive disclosure. A prompt can contain source code, customer context, or regulated data without being malicious, while the same prompt can also be an exfiltration step. That is why current guidance suggests evaluating the full request context, not just the text payload, alongside policy and destination controls as described in the NIST Cybersecurity Framework 2.0.

The deeper failure is assuming that humans are the only actors worth classifying. AI workflows often move sensitive material into external model endpoints, copilots, or embedded tools where DeepSeek breach style exposure shows how quickly secrets can be learned, copied, or resurfaced when governance is weak. This is not just a classification problem; it is a control problem across identity, intent, and action.

In practice, many security teams encounter prompt-based data loss only after the model interaction has already happened, rather than through intentional review of the workflow.

How It Works in Practice

Effective AI-era DLP starts by treating prompts, attachments, retrieved context, and tool calls as one decision surface. Policy should ask three questions at runtime: what is being sent, where is it going, and what is the user or agent trying to accomplish? That approach aligns better with NIST Cybersecurity Framework 2.0 than static keyword scanning, because the control objective is to reduce exposure, not just detect sensitive phrases.

Practitioners usually get better results when DLP is paired with identity and workflow controls:

  • Classify the destination, such as public SaaS AI, internal model, or approved private deployment.
  • Use intent-based rules, for example allowing summarisation but blocking export of source code or secrets.
  • Apply context-aware redaction so only the minimum required data reaches the model.
  • Log prompt, response, and tool activity for investigation and policy tuning.

This is where identity matters. If an AI workflow is operating as a workload with its own permissions, DLP alone cannot tell whether a request is a legitimate task or an abuse path. The exposure patterns discussed in DeepSeek breach reinforce a broader point: once sensitive material enters an AI system, downstream reuse can be difficult to contain without tight policy, short-lived access, and strong auditability. The most reliable designs combine DLP with zero standing privilege, workload identity, and approval gates for high-risk actions, rather than trusting the text filter to make the security decision.

These controls tend to break down when users route prompts through unmanaged browser plugins or shadow ai services because the security stack never sees the full request path.

Common Variations and Edge Cases

Tighter DLP often increases friction for analysts, engineers, and support teams, requiring organisations to balance productivity against confidentiality. That tradeoff is especially sharp in environments that use code assistants, retrieval-augmented generation, or multi-step agent workflows, because the same prompt may be both a legitimate task and a disclosure risk.

There is no universal standard for this yet, but best practice is evolving toward policy that understands business context. For example, a finance team may be allowed to ask a model to classify invoices, while blocking the upload of full ledger exports. An engineering team may be allowed to paste an error trace, while blocking repository secrets. The control is not “never send data to AI”; it is “send only what the current intent requires.” That distinction is consistent with emerging agentic governance guidance in NIST Cybersecurity Framework 2.0 and operational lessons surfaced in DeepSeek breach.

The edge case that catches teams most often is internal AI use over high-value data, where employees assume trusted tools make DLP unnecessary. In reality, internal deployment reduces some exposure but does not remove the need to inspect intent, destination, and post-processing, especially when model outputs can be copied into downstream systems or agents can chain actions across tools.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

FrameworkControl / ReferenceRelevance
OWASP Agentic AI Top 10A01Covers unsafe agent behaviour and prompt-driven data exposure.
CSA MAESTROGOV-03Addresses governance for AI workflows that can move or expose sensitive data.
NIST AI RMFSupports risk-based AI controls that consider context, impact, and accountability.

Review agent actions at runtime and block sensitive tool use unless the request is clearly authorised.

NHIMG Editorial Note
Reviewed and updated by the NHIMG editorial team on June 6, 2026.
NHI Mgmt Group — the #1 independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org