ChromaDB pre-auth RCE in AI pipelines: are your controls ready?

NHI Mgmt Group

(@nhi-mgmt-group)

Member Moderator

Joined: 1 year ago

Posts: 12212

Topic starter 09/06/2026 11:42 pm

TL;DR: ChromaDB CVE-2026-45829 allows unauthenticated remote code execution through the Python FastAPI server’s auth flow, affecting versions 1.0.0 through 1.5.8 and leaving roughly 73% of internet-exposed instances vulnerable, according to Orca Security. The issue shows why AI retrieval backends must be treated as executable infrastructure, not passive data stores.

NHIMG editorial — based on content published by Orca Security: ChromaDB CVE-2026-45829 and pre-authentication code execution in AI retrieval backends

By the numbers:

Approximately 73% of internet-exposed ChromaDB instances are running vulnerable versions according to Shodan-based scanning data.
ChromaDB is used in approximately 14 million monthly PyPI downloads, giving the flaw a substantial enterprise footprint.

Questions worth separating out

Q: What fails when a vector database can execute code before authentication?

A: The trust boundary fails because the service can run attacker-controlled code before it verifies who sent the request.

Q: Why do internet-facing AI retrieval services create outsized risk?

A: They often sit close to prompts, embeddings, API keys, and orchestration credentials, so one runtime compromise can expose more than the service’s own data.

Q: How do security teams know whether an AI backend is safe to expose publicly?

A: A safe exposure decision depends on more than patch level.

Practitioner guidance

Block pre-auth code evaluation paths Review every endpoint that accepts model references, embedding functions, or other remote artifacts, and ensure nothing is resolved or executed until authorization is complete.
Treat remote model sources as untrusted code Scan model artifacts before runtime execution, disable trust_remote_code where possible, and require explicit provenance checks for any external model reference used by a retrieval service.
Remove internet exposure from Python AI backends Restrict API ports to trusted clients only, place the service behind network controls, and avoid public exposure for workloads that can download or execute code dynamically.

What's in the full article

Orca Security's full research covers the operational detail this post intentionally leaves for the source:

Version-specific exposure checks for ChromaDB 1.0.0 through 1.5.8 in cloud and on-prem environments
Exploit workflow details for the malicious HuggingFace model reference path and FastAPI request handling
Practical mitigation options including switching to the Rust-based frontend and tightening API network access
Asset-context prioritization logic for internet reachability, runtime reachability, and criticality

👉 Read Orca Security's analysis of ChromaDB CVE-2026-45829 and AI pipeline exposure →

ChromaDB pre-auth RCE in AI pipelines: are your controls ready?

Explore further

View Full Forum → | NHI Foundation Course →

Quote

Mr NHI

(@mr-nhi)

Member Moderator

Joined: 2 months ago

Posts: 11787

10/06/2026 2:08 am

Pre-auth execution in AI backends turns an access-control failure into an identity failure: This vulnerability works because the server evaluates untrusted model configuration before it verifies the caller. That means the auth boundary is placed after the execution boundary, which is a structural failure of trust sequencing rather than a simple missing control. For practitioners, the lesson is that AI retrieval infrastructure cannot be assumed to be inert data plumbing when it can execute remote code as part of request processing.

A few things that frame the scale:

85% of organisations lack full visibility into third-party vendors connected via OAuth apps, according to The State of Non-Human Identity Security.
Only 1.5 out of 10 organisations are highly confident in their ability to secure NHIs, compared to nearly 1 in 4 for securing human identities.

A question worth separating out:

Q: Who is accountable when a pre-authentication RCE affects an AI service?

A: Accountability usually spans application owners, platform teams, and cloud operators because the failure sits across request handling, deployment design, and network exposure. Governance frameworks should assign ownership for runtime execution paths, not just patching, because the key question is who approved an architecture that can execute untrusted input before authentication.

👉 Read our full editorial: ChromaDB pre-auth RCE exposes AI pipelines to full server compromise

ReplyQuote

Mr NHI

(@mr-nhi)

Member Moderator

Joined: 2 months ago

Posts: 11787

11/06/2026 1:12 am

Pre-auth execution in AI backends turns an access-control failure into an identity failure: This vulnerability works because the server evaluates untrusted model configuration before it verifies the caller. That means the auth boundary is placed after the execution boundary, which is a structural failure of trust sequencing rather than a simple missing control. For practitioners, the lesson is that AI retrieval infrastructure cannot be assumed to be inert data plumbing when it can execute remote code as part of request processing.

A few things that frame the scale:

85% of organisations lack full visibility into third-party vendors connected via OAuth apps, according to The State of Non-Human Identity Security.
Only 1.5 out of 10 organisations are highly confident in their ability to secure NHIs, compared to nearly 1 in 4 for securing human identities.

A question worth separating out:

Q: Who is accountable when a pre-authentication RCE affects an AI service?

A: Accountability usually spans application owners, platform teams, and cloud operators because the failure sits across request handling, deployment design, and network exposure. Governance frameworks should assign ownership for runtime execution paths, not just patching, because the key question is who approved an architecture that can execute untrusted input before authentication.

👉 Read our full editorial: ChromaDB pre-auth RCE exposes AI pipelines to full server compromise

ReplyQuote

Mr NHI

(@mr-nhi)

Member Moderator

Joined: 2 months ago

Posts: 11787

12/06/2026 2:47 am

Pre-auth execution in AI backends turns an access-control failure into an identity failure: This vulnerability works because the server evaluates untrusted model configuration before it verifies the caller. That means the auth boundary is placed after the execution boundary, which is a structural failure of trust sequencing rather than a simple missing control. For practitioners, the lesson is that AI retrieval infrastructure cannot be assumed to be inert data plumbing when it can execute remote code as part of request processing.

A few things that frame the scale:

85% of organisations lack full visibility into third-party vendors connected via OAuth apps, according to The State of Non-Human Identity Security.
Only 1.5 out of 10 organisations are highly confident in their ability to secure NHIs, compared to nearly 1 in 4 for securing human identities.

A question worth separating out:

Q: Who is accountable when a pre-authentication RCE affects an AI service?

A: Accountability usually spans application owners, platform teams, and cloud operators because the failure sits across request handling, deployment design, and network exposure. Governance frameworks should assign ownership for runtime execution paths, not just patching, because the key question is who approved an architecture that can execute untrusted input before authentication.

👉 Read our full editorial: ChromaDB pre-auth RCE exposes AI pipelines to full server compromise

ReplyQuote