Subscribe to the Non-Human & AI Identity Journal

Vector Database Exposure

The risk created when a vector database is reachable from untrusted networks or embedded in sensitive AI workflows. Because these services often sit near prompts, embeddings, and credentials, a compromise can expose more than data, including the runtime and adjacent secrets.

Expanded Definition

vector database exposure is not just “a database left online.” In NHI and Agentic AI environments, it means the retrieval layer that stores embeddings, metadata, or similarity indexes can be reached by untrusted actors, or can be queried from a workflow that already has excessive runtime privileges. Because vector stores often sit beside prompt pipelines, tool calls, and secret-bearing services, exposure can become a bridge into broader AI infrastructure. Definitions vary across vendors, but the security meaning is consistent: if the vector layer is reachable without strong network segmentation, identity controls, and query restrictions, it becomes part of the attack surface. The risk is especially acute when embeddings encode sensitive documents or customer data, because “non-readable” does not mean non-sensitive. For identity design, this aligns with Zero Trust Architecture principles in NIST SP 800-207, where every access path must be treated as untrusted until verified.

The most common misapplication is treating the vector database as a low-risk internal datastore, which occurs when teams expose it on a broad network segment and assume its content is harmless because it contains embeddings rather than raw records.

Examples and Use Cases

Implementing vector search rigorously often introduces latency and operational friction, requiring organisations to weigh fast retrieval and developer convenience against stricter network and identity controls.

  • A customer-support assistant uses a vector database to retrieve policy snippets. If the index is publicly reachable, an attacker can enumerate sensitive support content and infer internal procedures.
  • An agentic workflow stores product documentation embeddings in a self-hosted cluster. Without service identity checks, a compromised service account can query private materials and expand access beyond the intended application path. That pattern is visible across incidents discussed in the 52 NHI Breaches Analysis.
  • A retrieval-augmented generation system uses a managed vector service and secret-backed API access. If the runtime environment is over-permissioned, exposure can cascade from the index to adjacent credentials, echoing the secret sprawl dynamics in the Guide to the Secret Sprawl Challenge.
  • A product team hosts embeddings for regulated documents and assumes encryption alone is enough. In practice, query abuse and weak authentication can still reveal sensitive relationships, similar to the access-path failures documented in the MongoBleed breach.
  • Security teams test retrieval endpoints against the Zero Trust Architecture model, requiring explicit authentication, least privilege, and continuous verification before any query is answered.

Why It Matters in NHI Security

Vector database exposure matters because the compromise path rarely stops at the index. In AI systems, retrieval layers often sit near application secrets, service accounts, and orchestration components, so a single exposed endpoint can become an entry point for data theft, prompt manipulation, or runtime abuse. NHI Management Group research shows that 79% of organisations have experienced secrets leaks, with 77% causing tangible damage, which helps explain why overexposed AI infrastructure is not a theoretical concern. The same research also shows that 96% of organisations store secrets outside of secrets managers in vulnerable locations, making adjacent compromise more likely when a vector service is poorly isolated. A mature NHI program therefore treats the vector store as a governed identity-adjacent asset, not just a data platform. This is also consistent with the Anthropic report on AI-orchestrated cyber espionage, which underscores how agentic workflows can be abused when trusted execution paths are reachable.

Organisations typically encounter vector database exposure only after an incident review shows that a prompt pipeline, embedding store, or service credential was reachable from an untrusted path, at which point the term becomes operationally unavoidable to address.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Non-Human Identity Top 10 and OWASP Agentic AI Top 10 address the attack and risk surface, while NIST Zero Trust (SP 800-207) set the governance and control requirements practitioners need to meet.

Framework Control / Reference Relevance
OWASP Non-Human Identity Top 10 NHI-01 Covers exposed NHI surfaces and weak access paths around AI-adjacent services.
OWASP Agentic AI Top 10 A-03 Addresses agent tool and data access paths that can be abused through retrieval layers.
NIST Zero Trust (SP 800-207) SC-7 Zero Trust requires explicit trust decisions for every network path to the vector database.

Bind agent queries to least-privilege retrieval scopes and monitor for abnormal access patterns.