Subscribe to the Non-Human & AI Identity Journal

How should organisations govern RAG-based AI workflows?

Organisations should govern retrieval sources, indexing pipelines, and embedding stores as part of the application’s trust chain. If malicious or low-quality content can enter retrieval, it can steer model outputs and downstream decisions. The practical test is whether the model can be influenced by content it should never have trusted in the first place.

Why This Matters for Security Teams

RAG workflows are often treated as a model feature, but the real security boundary is the retrieval path that feeds the model. If source documents, embeddings, and indexing jobs are not governed as production inputs, an attacker or careless contributor can shape what the model “knows” before a prompt is ever issued. That makes RAG a trust-chain problem, not just a model-quality problem. Current guidance from the NIST Cybersecurity Framework 2.0 supports managing this as part of asset, access, and data governance, while NHIMG research on the Top 10 NHI Issues shows how weak control over machine identities and secrets tends to widen the blast radius around AI systems.

The practical mistake is assuming retrieval is “just search.” In reality, retrieval pipelines often reach internal wiki pages, tickets, shared drives, object storage, and vector databases that are only loosely controlled. If those sources are poisoned, stale, over-permissioned, or indexed from compromised accounts, the model can produce confident but unsafe output. In practice, many security teams encounter RAG abuse only after a bad answer has already driven a user action or leaked internal data, rather than through intentional review of the retrieval chain.

How It Works in Practice

Governance should start by defining every system that can influence retrieval as part of the application’s trust boundary. That includes the source repository, the ingestion job, the chunking logic, the embedding pipeline, the vector store, and any reranking or citation layer. A useful control model is to classify each source by sensitivity and trust level, then restrict what can be indexed, who can modify it, and how often it is refreshed. NHIMG’s Ultimate Guide to NHIs — Lifecycle Processes for Managing NHIs is relevant here because the same lifecycle discipline used for machine identities should also apply to the services that fetch, transform, and store retrieval data.

Practically, the strongest RAG controls are a mix of data governance and workload identity:

  • Restrict ingestion to approved sources and signed or attested content where possible.
  • Use separate indexing pipelines for sensitive, public, and experimental corpora.
  • Apply least privilege to the retrieval service, not just the model runtime.
  • Log which documents influenced a response so reviewers can trace bad outputs back to source content.
  • Rotate secrets and service credentials used by ingestion, embedding, and retrieval jobs on short lifetimes.

For policy and control mapping, the NIST Cybersecurity Framework 2.0 is useful for asset inventory, access restriction, and monitoring, but it does not by itself solve prompt or retrieval poisoning. RAG governance also needs source validation, content review, and exception handling for untrusted data. These controls tend to break down when the retrieval layer aggregates many semi-trusted repositories because ownership is fragmented and no single team can confidently approve every upstream change.

Common Variations and Edge Cases

Tighter retrieval controls often increase operational overhead, requiring organisations to balance answer relevance against content approval latency. That tradeoff is real, especially in fast-moving environments where teams want fresh knowledge indexed quickly. Best practice is evolving, but there is no universal standard for how much retrieved content must be curated before it is safe to expose to an LLM. The right threshold depends on the business impact of a wrong answer and the sensitivity of the source corpus.

Edge cases deserve explicit treatment. Public documentation usually can be indexed with lighter controls, but internal playbooks, HR material, legal drafts, and incident notes should use stronger segregation and approval gates. If retrieval is connected to tools that can act on the answer, governance should be stricter because a poisoned document can trigger a downstream action, not just a misleading response. NHIMG’s DeepSeek breach is a useful reminder that exposed data and embedded secrets can quickly become a retrieval and model-training liability, while the Ultimate Guide to NHIs — Regulatory and Audit Perspectives helps frame evidence collection for audits.

Where organisations are still early in maturity, the safest pattern is to govern the highest-risk sources first, prove traceability, then expand coverage. That approach is usually more durable than trying to approve every document equally.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Non-Human Identity Top 10 address the attack and risk surface, while NIST CSF 2.0 and NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework Control / Reference Relevance
NIST CSF 2.0 ID.AM RAG governance starts with knowing all data and service assets in the trust chain.
OWASP Non-Human Identity Top 10 NHI-03 Retrieval services rely on machine credentials that need short-lived, controlled access.
NIST AI RMF AI RMF addresses managing data quality, provenance, and operational risk in AI systems.

Issue least-privilege, short-lived credentials to indexing and retrieval workloads and rotate them regularly.