TL;DR: AI assistants are now a common path for accidental secret exposure, and Entro Security says WebGuard scans prompts in the browser to block secrets and PII before they reach major LLMs. The governance gap is that current IAM and DLP assumptions do not see what users paste into AI tools in real time.
At a glance
What this is: This is an analysis of browser-side prompt scanning for AI tools, focused on stopping secrets and PII from leaving the browser before they reach external LLMs.
Why it matters: It matters because AI prompt leakage now sits at the intersection of human behaviour, NHI secret exposure, and emerging AI governance, forcing IAM teams to extend visibility beyond traditional control planes.
By the numbers:
- When AWS credentials are exposed publicly, attackers attempt access within an average of 17 minutes and as quickly as 9 minutes in some cases.
👉 Read Entro Security’s analysis of browser-side AI prompt secret scanning
Context
AI prompt leakage is a governance problem, not just a user-training problem. If employees can paste production API keys, tokens, or customer data into browser-based AI assistants without the control plane seeing it, existing DLP, logging, and access review models are already behind the behaviour they are meant to govern.
The primary issue for identity teams is that sensitive data often leaves through the user’s browser before any traditional security stack can classify it. That creates a blind spot across human identity, secrets management, and the wider NHI estate, because the same credentials that protect systems can be exposed through everyday assistant use.
Key questions
Q: How should security teams prevent secrets from being pasted into AI assistants?
A: Security teams should intercept prompts before they leave the browser, classify sensitive content in real time, and apply policy based on data type. High-risk secrets should be blocked, lower-risk content can be warned or audited, and all decisions should be recorded in a sanitised event trail for governance and incident review.
Q: Why do AI assistants create new risk for secrets management?
A: AI assistants create new risk because they turn ordinary user workflows into outbound data transfers that traditional controls do not always inspect. A secret pasted into a prompt can bypass the usual security path, enter third-party logs, and persist outside the organisation’s direct control, which changes both exposure and accountability.
Q: What breaks when prompt content is not inspected before send?
A: When prompt content is not inspected before send, organisations lose the chance to stop accidental disclosure at the browser boundary. That means secrets, credentials, and sensitive records can reach external models or logs intact, leaving security teams with evidence after exposure instead of prevention before transmission.
Q: Who is accountable when an employee leaks a secret into an AI prompt?
A: Accountability should sit with the organisation’s data, identity, and acceptable-use governance, not with the model itself. The practical question is whether the company defined prompt handling as a controlled data path, logged user decisions, and set response rules that match the sensitivity of the information involved.
How it works in practice
Browser-side prompt inspection and secret detection
Browser-side prompt inspection moves classification closer to the point of user action. Instead of waiting for network-layer inspection or post-event logging, the control examines text as it is typed and again before send. That matters because secrets are often embedded in free-form prompts, attached files, or copied configuration snippets, which means the exposure is contextual rather than simply field-based. Classification engines then label likely secrets such as API keys, access tokens, or customer data before the prompt reaches the model. Practical implication: identity and security teams need controls that operate at the browser boundary, not only in downstream analytics.
Practical implication: place detection before the model sees the prompt, not after the data has already left the browser.
Policy actions for blocked, warned, or audited prompts
A useful prompt-control system is not binary. Different data classes need different responses, because private keys, email addresses, and low-risk metadata do not justify the same operational handling. Blocking stops the request and redacts the sensitive content. Warning allows a user to cancel or continue while recording the decision. Audit allows the prompt through but creates evidence for later review. This policy split is important because security teams need room to tune controls without causing blanket disruption. Practical implication: map prompt sensitivity to response type, then align those responses with data classification and acceptable-use policy.
Practical implication: define separate response paths for high-risk secrets, moderate-risk PII, and low-risk informational prompts.
Why outbound prompt logs become a governance control
Outbound prompt logging turns a one-time exposure event into an auditable identity and data trail. The operational value is not just detection but evidence: timestamp, content class, action taken, and whether the user bypassed a warning. That creates accountability without requiring every prompt to be blocked. It also matters that logs themselves must be redacted, because the audit trail can become another leak path if raw secrets are stored. Practical implication: send only sanitised events to SIEM or governance tooling, and treat the log pipeline as part of the control surface.
Practical implication: redact sensitive values before forwarding prompt events into SIEM, case management, or compliance evidence stores.
NHI Mgmt Group analysis
Prompt leakage is now an identity governance issue because the browser has become an unofficial secrets transport layer. Traditional IAM and DLP programmes assume sensitive data moves through known systems and policy-controlled channels. When employees paste keys, tokens, or customer records into AI assistants, that assumption fails and governance loses sight of the event before model interaction even begins. Practitioners should treat browser-visible prompt handling as part of the identity and secrets control boundary.
Secret exposure through AI assistants is a standing privilege problem in a new disguise. A production API key pasted into a prompt behaves like any other long-lived credential once it escapes controlled boundaries. The difference is that the exposure can occur through normal work, not malicious exfiltration, which means recertification and access review processes will miss the moment unless the prompt boundary is monitored. Practitioners should map AI use to the same risk logic they already apply to overexposed secrets.
Browser-side prompt governance is becoming a named control category, not a feature add-on. Prompt boundary enforcement: controls that inspect and classify sensitive content before it leaves the browser define a new operational layer between human action and model ingestion. That layer matters because the model is no longer the only thing that must be secured; the outbound prompt path becomes part of the governance model. Practitioners should expect policy, detection, and audit to converge at this boundary.
AI assistant use collapses the distinction between user convenience and enterprise data handling. Employees do not stop working when they use a model, but the security programme must still decide whether the prompt is a communication, a data transfer, or a disclosure event. That ambiguity is why teams need explicit governance for assistant use across human identity, secrets, and NHI oversight. Practitioners should define who owns prompt risk before incident response has to do it for them.
From our research:
- Two-thirds of enterprises have endured a successful cyberattack resulting from compromised non-human identities, with a quarter encountering multiple attacks, according to The 2024 ESG Report: Managing Non-Human Identities.
- The average organisation believes more than 1 in 5 of their non-human identities are insufficiently secured, which shows how broad the exposure surface already is across machine credentials and access paths.
- That governance gap is why many teams now need to think beyond static secret storage and into prompt-path controls, as described in the Guide to the Secret Sprawl Challenge.
What this signals
Prompt boundary enforcement is likely to become a standard control expectation wherever employees use AI assistants for coding, support, or analysis. The programme question is no longer whether users will paste sensitive data, but whether the control surface can see and classify it before the model does.
The wider signal for identity teams is that secrets governance is expanding from vaults and rotation schedules into user interaction points. That shift aligns with the broader NHI problem: access risk now appears wherever credentials are handled, not just where they are stored.
With two-thirds of enterprises already suffering compromise linked to NHIs, browser-based AI leakage adds another route into the same underlying problem space. Teams that already struggle with secret inventory and exposure windows should expect assistant use to widen the blast radius unless prompt controls are treated as governance, not convenience.
For practitioners
- Define browser-prompt controls as a governed data path Treat prompts to AI assistants as a monitored outbound channel. Classify what may be typed, pasted, or attached, and align that policy with secrets handling, PII handling, and acceptable-use rules.
- Block high-risk secrets before model submission Use inline controls to stop production keys, tokens, and certificates at the browser boundary. Redact the sensitive value from the user field so the prompt cannot be reconstructed from the control itself.
- Separate warn, block, and audit decisions by data type Do not force one response for all prompt content. Reserve block for high-risk secrets, warn for recoverable mistakes, and audit for lower-risk patterns that still need evidence.
- Sanitise prompt logs before SIEM ingestion Forward only redacted scan events into logging and case-management systems. Preserve timestamp, action, and user decision, but remove raw secret values so the log never becomes a secondary leak.
Key takeaways
- AI assistant prompt leakage turns everyday user behaviour into a governance problem because secrets can leave the browser before traditional controls see them.
- Detection, policy response, and audit trail design matter because not every prompt needs the same action, but every sensitive prompt needs a recorded decision.
- Identity teams should extend secrets governance to the browser boundary, where prompt handling increasingly determines whether exposure is prevented or merely documented.
Standards & Framework Alignment
This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.
OWASP Non-Human Identity Top 10 address the attack and risk surface, while NIST CSF 2.0 and NIST Zero Trust (SP 800-207) set the governance and control requirements practitioners need to meet.
| Framework | Control / Reference | Relevance |
|---|---|---|
| OWASP Non-Human Identity Top 10 | NHI-01 | Prompt leakage exposes secrets before policy can act, matching secret discovery and exposure risks. |
| NIST CSF 2.0 | PR.DS-1 | Sensitive data in prompts must be protected in transit and before external disclosure. |
| NIST Zero Trust (SP 800-207) | PR.AC-4 | Prompt controls extend least-privilege thinking to the user-to-model interaction boundary. |
Classify prompt handling as a secret-exposure control path and block high-risk credentials at the browser boundary.
Key terms
- Prompt Boundary Enforcement: Prompt boundary enforcement is the practice of inspecting AI prompts before they leave the browser or client application. It treats the prompt as a controlled data path, so secrets, PII, and other restricted content can be blocked, warned on, or logged according to policy before model ingestion.
- Outbound Prompt Logging: Outbound prompt logging is the capture of prompt events, user decisions, and detection outcomes as a governance record. The value is accountability, but the logs must be sanitised because raw prompt content can become a second exposure path if sensitive values are retained.
- Secret Exposure Window: A secret exposure window is the period between a credential entering an uncontrolled context and the moment it is revoked, rotated, or otherwise contained. In AI prompt scenarios, that window can begin when a user pastes a secret and extend into third-party systems if the prompt is not blocked.
- Prompt Path Governance: Prompt path governance is the policy and control model for how users interact with AI assistants when sensitive information may be involved. It spans classification, enforcement, logging, and accountability at the interface between human action and model access.
What's in the full announcement
Entro Security's full blog post covers the operational detail this post intentionally leaves for the source:
- How WebGuard classifies secrets and PII at prompt time across typed text, pasted code, and attached files.
- The exact block, prevent, and audit response modes and how each changes user workflow.
- Examples of what gets caught, including GitHub tokens, AWS access keys, and Stripe keys.
- How outbound events are redacted before forwarding into SIEM and audit workflows.
Deepen your knowledge
NHI governance, agentic AI identity, and machine identity lifecycle are core topics in our NHI Foundation Level course, the industry's only accredited NHI security programme. If you are responsible for identity security strategy or NHI governance in your organisation, it is worth exploring.
Published by the NHIMG editorial team on 2026-06-26.
NHI Mgmt Group — the independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org