How should security teams stop sensitive data from being uploaded into public AI tools?

Why This Matters for Security Teams

Public AI tools create a data-loss path that traditional perimeter controls often miss. Once a user pastes source code, customer records, or incident details into an external chatbot, the organisation may lose practical control over retention, model training exposure, and downstream sharing. The risk is not only exfiltration by attackers; it is also accidental disclosure by well-intentioned employees trying to work faster. Current guidance suggests treating AI upload destinations as a distinct trust boundary, not as just another website.

NHI Management Group’s research on the State of Non-Human Identity Security shows that only 1.5 out of 10 organisations are highly confident in securing NHIs, which matters because the same weak credential and visibility patterns often underlie data movement into unsanctioned AI services. The practical question is not whether users will copy data into AI tools, but whether the control stack can stop it at the moment of transfer. In practice, many security teams discover the issue only after sensitive content has already been pasted into a public AI tool, rather than through intentional policy enforcement.

How It Works in Practice

The most effective approach is to combine endpoint DLP, browser controls, and application-aware policy so the decision is made at send time. That means the control should inspect the content itself, the destination application, and the user’s context before allowing upload or paste operations. The NIST Cybersecurity Framework 2.0 is useful here because it frames the problem as protecting information flows, not just locking down devices. For organisations that need a deeper threat lens, the LLMjacking research shows how quickly exposed credentials and sensitive content can be abused once they leave controlled environments.

In practice, teams should build layered controls:

Classify data so the endpoint policy can distinguish confidential, regulated, and public content.

Block or warn on uploads, paste events, and drag-and-drop actions into approved public AI domains and unsanctioned AI tools.

Use browser isolation or sanctioned gateways where business use of AI is allowed under logging and inspection.

Detect secrets, API keys, source code, and regulated data patterns before transfer, then quarantine or require justification.

Log the event with user, device, destination, and content category for investigation and tuning.

For broader governance context, the Ultimate Guide to NHIs is a useful reference point for understanding how identity sprawl, credential exposure, and visibility gaps intersect with modern data handling. These controls tend to break down when users move between managed and unmanaged devices because the policy engine cannot reliably inspect content or enforce destination-aware rules across every path.

Common Variations and Edge Cases

Tighter upload blocking often increases user friction, requiring organisations to balance data protection against productivity and approved AI use cases. That tradeoff is especially sharp for engineering, legal, and security teams that legitimately need summarisation or analysis support. Best practice is evolving, and there is no universal standard for this yet, so the control model should be tuned to data sensitivity and business workflow rather than imposed as a blanket ban.

Common edge cases include screenshots, copied chat transcripts, images containing sensitive text, and files renamed to bypass simple extension filters. Some organisations also allow approved AI copilots while blocking public tools, which requires destination-based policy rather than file-only inspection. Where browser policy, CASB, and endpoint DLP overlap, teams should define a single enforcement owner to avoid gaps and conflicting user prompts. The key is to stop data at the last possible trusted control point, not rely on post-upload detection.

Public AI controls are least reliable in BYOD, contractor, and remote-work environments where the organisation does not fully manage the browser, clipboard, or local storage path.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Non-Human Identity Top 10 address the attack and risk surface, while NIST CSF 2.0 and NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
OWASP Non-Human Identity Top 10	NHI-04	Sensitive data often moves with overexposed identities and tokens.
NIST CSF 2.0	PR.DS-1	Directly addresses protecting data at rest and in transit.
NIST AI RMF		AI RMF supports governing safe, controlled AI use and data handling.

Apply data protection controls at the endpoint and browser layer before content leaves the enterprise boundary.

#1 Authority in NHI Education, Research and Advisory, empowering organizations to tackle the critical risks posed by Non-Human Identities (NHIs), including AI Agents.

How should security teams stop sensitive data from being uploaded into public AI tools?

Why This Matters for Security Teams

How It Works in Practice

Common Variations and Edge Cases

Standards & Framework Alignment

Related resources from NHI Mgmt Group