Why do RAG systems create data exposure risk even without prompt injection?

Why This Matters for Security Teams

RAG is often deployed as a productivity feature, but it also becomes a discovery engine for sensitive content. The exposure risk is not limited to hostile prompts: broad indexing, weak document segregation, and over-permissive retrieval can let a routine question surface content that should never be visible. That is a governance and data-handling problem, not only an AI safety problem. NHI Management Group’s Ultimate Guide to NHIs notes that 97% of NHIs carry excessive privileges, which is exactly the kind of condition that makes retrieval layers overshare once access boundaries are loose.

Security teams get caught off guard because the system can behave “correctly” from a search perspective while still violating expected access boundaries. A user does not need to inject malicious text if the retriever is allowed to pull from a corpus that was never permission-filtered at ingest time. This is why the issue sits at the intersection of identity, authorization, and content governance, not just prompt hardening. The pattern is visible in incidents like the McKinsey AI platform breach and in broader industry reporting such as the OWASP Agentic AI Top 10, where data exposure emerges from system design choices more than overt attack strings.

In practice, many security teams discover the exposure only after an ordinary user has already retrieved content they should never have been able to find.

How It Works in Practice

RAG systems usually combine an ingestion pipeline, an embedding store, a retriever, and a generation layer. The exposure risk begins when those layers are treated as neutral plumbing rather than as access-controlled security boundaries. If documents are indexed broadly, chunked without sensitivity labeling, or stored in a vector database that ignores source permissions, the retriever can return legitimate matches from the wrong data domain. A user may ask a harmless question and receive a sensitive snippet because semantic similarity outranks authorization at query time.

Good practice is to apply authorization before retrieval, not after generation. That means the system should know which documents, chunks, or metadata partitions a given user is allowed to access, then constrain the retriever accordingly. For many environments, this is more reliable than trying to redact output after the fact. Current guidance suggests aligning this with least privilege, data classification, and explicit tenant boundaries, as reflected in NIST Cybersecurity Framework 2.0 and NHI governance guidance from Ultimate Guide to NHIs — Why NHI Security Matters Now.

Filter retrieval by user entitlements, tenant, and sensitivity label before any chunk is returned.

Bind each retrieved item to a source of truth, so access can be audited and revoked cleanly.

Separate public, internal, and restricted corpora rather than relying on one global index.

Log the retrieval set, not just the final answer, because the leak often happens in what was exposed to the model.

This approach is reinforced by the Guide to the Secret Sprawl Challenge, which shows how secrets and sensitive artifacts tend to spread across places that search systems can easily ingest. These controls tend to break down when legacy content repositories, shared drives, and permissive vector indexes are merged without a permission model that survives semantic search.

Common Variations and Edge Cases

Tighter retrieval controls often increase implementation overhead, requiring organisations to balance user experience against leakage prevention. That tradeoff is especially sharp in hybrid deployments, where a RAG system spans multiple business units, external knowledge bases, and fast-changing document stores. There is no universal standard for this yet, but best practice is evolving toward runtime authorization and document-level filtering rather than static “index once, trust forever” designs.

One common edge case is when the model is connected to a secure corpus but the embeddings were built from a broader source set. In that case, the index can still reveal the existence of sensitive material through semantic proximity, metadata, or answer phrasing. Another is cross-tenant retrieval in SaaS environments, where a shared backend and imperfect isolation can expose another customer’s content through normal search behavior. This is why a “no prompt injection” claim does not mean “no exposure risk.”

Practitioners should also watch for downstream leakage through citations, summaries, and cached conversations. Even if the retrieved chunk is not shown verbatim, enough context may be disclosed to expose names, projects, or operational details. The 52 NHI Breaches Analysis is a reminder that access mistakes usually compound when identities, permissions, and automation are coupled poorly. For teams assessing exposure controls, the right question is not only “can an attacker inject a prompt?” but also “what can a normal query discover that should have remained undiscoverable?”

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Non-Human Identity Top 10 address the attack and risk surface, while NIST CSF 2.0 and NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
OWASP Non-Human Identity Top 10	NHI-03	RAG exposure often starts with overbroad identity privileges and weak secret boundaries.
NIST CSF 2.0	PR.AC-4	Retrieval should respect access permissions before content is surfaced.
NIST AI RMF		AI RMF covers data governance and misuse risk in AI-enabled systems.

Constrain retrieval identities to least privilege and rotate any secrets used by the pipeline.

#1 Authority in NHI Education, Research and Advisory, empowering organizations to tackle the critical risks posed by Non-Human Identities (NHIs), including AI Agents.

Why do RAG systems create data exposure risk even without prompt injection?

Why This Matters for Security Teams

How It Works in Practice

Common Variations and Edge Cases

Standards & Framework Alignment

Related resources from NHI Mgmt Group