TL;DR: Exposed AI endpoints now behave like non-human identities with direct cost, data, and lateral-movement exposure, not just application risk, as Pillar Security says its honeypots captured 35,000 attack sessions over two months while attackers scanned exposed AI infrastructure, validated endpoints, and resold access through a criminal marketplace, and MCP servers created pivot paths into internal systems.
At a glance
What this is: This is a research post on Operation Bizarre Bazaar, a commercial LLMjacking campaign that targeted exposed LLM and MCP endpoints at scale.
Why it matters: It matters because AI endpoints, MCP servers, and the identities behind them now need the same governance discipline IAM teams apply to other non-human access paths, including discovery, authentication, and exposure control.
👉 Read Pillar Security's analysis of Operation Bizarre Bazaar and exposed AI endpoint abuse
Context
LLMjacking is unauthorized use of exposed LLM infrastructure for compute theft, data access, and downstream pivoting. In this case, the primary problem is not model quality but identity and exposure control: publicly reachable AI endpoints and MCP servers were reachable without the access boundaries IAM teams expect.
For identity programmes, the important shift is that AI endpoints now behave like a non-human access surface with commercial value to attackers. If discovery, authentication, and rate limits are weak, the problem quickly moves from abuse to credential harvesting, data exposure, and internal system reachability.
Key questions
Q: How should security teams handle exposed AI endpoints in production?
A: Treat exposed AI endpoints as governed non-human identities, not convenience services. Require authentication, apply rate limiting, and confirm that the endpoint cannot reach internal systems without explicit delegation. Then continuously scan the external attack surface so any public AI service is detected before hostile discovery tools index it.
Q: Why do MCP servers increase lateral movement risk?
A: MCP servers increase lateral movement risk because they connect models to files, databases, shells, and APIs through delegated permissions. If that bridge is publicly reachable or weakly scoped, attackers can use the integration path to reach systems the model itself should never expose. The risk is trust expansion through delegation.
Q: What breaks when AI endpoints have no authentication?
A: When AI endpoints have no authentication, attackers can probe model behaviour, run unauthorized inference, and use the service as a paid compute resource at the defender's expense. The same exposure can also leak context, prompts, or internal integration details that help the attacker move deeper into the environment.
Q: Who is accountable when an exposed MCP server is used to reach internal systems?
A: Accountability sits with the team that owns the delegated access path, the network exposure, and the identity controls around the server. In practice, that means security, platform, and application owners must all understand whether the MCP trust boundary is intentionally public or accidentally exposed.
Technical breakdown
Exposed LLM endpoints and unauthenticated access
Self-hosted LLM services such as Ollama and vLLM often expose HTTP APIs on predictable ports. If those endpoints accept requests without strong authentication, attackers can submit inference jobs, enumerate model behaviour, and test for weakly protected interfaces. The risk is not limited to API cost inflation. A reachable model endpoint may also disclose system prompts, chat context, and integration details that help attackers shape follow-on activity. In practice, the security boundary is the identity layer around the model, not the model itself.
Practical implication: require authentication on every externally reachable LLM endpoint and treat open model APIs as exposed identity surfaces.
MCP servers as pivot points into internal systems
Model Context Protocol servers connect LLMs to files, databases, shell access, and external APIs. That makes them a bridge layer, not a harmless integration shim. When an MCP server is internet-facing or poorly restricted, an attacker may use it to read source code, query databases, or reach cloud and collaboration tools through the permissions already attached to the integration. The danger is delegation without containment: a single exposed control point can amplify into broad internal access if the server is trusted by design.
Practical implication: isolate MCP servers from public reach, and scope their delegated access as if they were privileged service identities.
Commercial resale turns abuse into a supply chain
The article describes a scanner, a validator, and a marketplace operating in sequence. That matters because it shows LLMjacking is no longer only opportunistic abuse. Exposed endpoints can be discovered automatically, validated quickly, and monetized through resale of unauthorized access. In identity terms, the attacker is not breaking the model so much as exploiting weakly governed access paths that already exist. Once an endpoint is listed in scan results, the window to abuse can be very short, which raises the value of continuous exposure management over periodic review.
Practical implication: pair external attack-surface monitoring with immediate exposure suppression for any AI endpoint that should not be public.
Threat narrative
Attacker objective: The attacker aims to monetise unauthorized access to AI infrastructure through resale, while also creating paths for data theft and internal system pivoting.
- Entry begins with distributed scanning of exposed AI infrastructure, including public LLM endpoints and accessible MCP servers, until a target responds to unauthenticated or weakly authenticated requests.
- Credential access and abuse follow when attackers test placeholder keys, enumerate model capabilities, and use accepted access paths to run inference, extract context, or validate resale value.
- Impact comes from unauthorized compute consumption, data exposure from context windows, and pivot opportunities through MCP integrations into internal systems.
Breaches seen in the wild
- LiteLLM PyPI package breach — LiteLLM PyPI supply chain attack, credentials stolen from users.
- Moltbook AI agent keys breach — Moltbook breach exposed 1.5M AI agent keys.
Read our 52 NHI Breaches Analysis report for a comprehensive view of breaches impacting Non-Human Identities including AI Agents.
NHI Mgmt Group analysis
LLMjacking is now an identity governance problem, not just an application abuse problem. The article shows that exposed AI endpoints can be discovered, validated, and monetised at industrial scale. That means the governance question is no longer whether a model can answer a request, but whether the endpoint is a controlled non-human identity with enforceable boundaries. Practitioners should treat externally reachable AI services as governed access paths, not experimental infrastructure.
Exposed AI endpoints collapse the assumption that public reachability is harmless if the workload is non-human. That assumption was designed for low-value services with limited blast radius. It fails when the actor can generate expensive inference, leak conversation context, and bridge to internal systems through MCP integrations. The implication is that identity programmes must rethink what counts as privileged access when the interface itself can become a monetised attack surface.
MCP introduces a broader identity blast radius because delegated tool access is only as safe as the server that mediates it. Once an MCP server is trusted to reach files, databases, or cloud APIs, its compromise or exposure turns that trust into a lateral-movement path. This is exactly where NHI governance and workload segmentation intersect. Practitioners should treat MCP trust as a scoped delegation problem with clear containment requirements.
The commercialisation of AI abuse changes defender priorities from cleanup to continuous exposure reduction. The scanner, validator, marketplace sequence means attackers are building pipelines around exposed AI infrastructure. That is a structural signal for the market: discovery, authentication, and runtime guardrails are converging into one control surface for AI identity. Teams that still manage these elements separately will struggle to contain abuse once their endpoints are indexed by hostile tooling.
OWASP NHI and agentic AI guidance both become relevant when LLM platforms start acting like services with durable permissions. The article's attack chain is not about model intelligence, it is about credentials, delegated access, and tool reach. That is why the right control conversation spans exposed service identity, attack-surface reduction, and tool-abuse monitoring. Practitioners should align governance to the access path, not the marketing label on the workload.
From our research:
- 80% of organisations report their AI agents have already performed actions beyond their intended scope, including accessing unauthorised systems, according to AI Agents: The New Attack Surface.
- Only 52% of companies can track and audit the data their AI agents access, leaving 48% with a complete compliance and breach-investigation blind spot, according to AI Agents: The New Attack Surface.
- That visibility gap is why teams should also review Top 10 NHI Issues when AI endpoints begin behaving like durable non-human identities.
What this signals
Identity blast radius: once an AI endpoint can authenticate, call tools, and reach internal systems, its risk profile looks closer to a privileged workload than a chatbot. That is the governance shift here, and it is why discovery and delegated-access reviews need to move together rather than sit in separate programmes.
With 92% of organisations agreeing that governing AI agents is critical but only 44% having implemented policies, the operational gap is already wider than the tooling gap, according to AI Agents: The New Attack Surface. Teams should expect similar pressure wherever models are tied to reusable credentials or public endpoints.
For readers building programme controls, the practical next step is to align AI endpoint discovery with non-human identity governance and runtime guardrails. The attack pattern described in this post is less about a single vulnerable model and more about unmanaged access paths becoming monetisable infrastructure.
For practitioners
- Inventory all externally reachable AI endpoints Scan for Ollama, vLLM, OpenAI-compatible APIs, and MCP servers that are reachable from the internet or from untrusted network zones. Classify each endpoint by authentication state, data access, and whether it can reach internal systems through delegated tools.
- Enforce authentication on every model-facing interface Require valid credentials for inference endpoints, model gateways, and administrative panels. Eliminate anonymous access, placeholder API keys, and default credentials before exposing any service to production traffic.
- Restrict MCP server trust to the minimum delegated scope Separate MCP servers from public networks, limit their file, database, and API privileges, and review every integration that allows a model to reach internal systems. Treat each server as a privileged service identity with explicit boundaries.
- Monitor for AI-specific abuse patterns Alert on placeholder key strings, multi-provider enumeration, bursty inference requests, and repeated capability probing from a single source. Pair those detections with rate limits and automated blocking for hostile scan activity.
Key takeaways
- Operation Bizarre Bazaar shows that exposed AI endpoints can be turned into a commercial attack supply chain, not just abused opportunistically.
- The reported 35,000 attack sessions underscore that public AI infrastructure is already a high-volume target, especially when authentication is weak or missing.
- Authentication, exposure control, and scoped delegation are the controls that most directly limit the blast radius of LLMjacking and MCP pivoting.
Standards & Framework Alignment
This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.
OWASP Non-Human Identity Top 10 and OWASP Agentic AI Top 10 address the attack and risk surface, while NIST CSF 2.0 set the governance and control requirements practitioners need to meet.
| Framework | Control / Reference | Relevance |
|---|---|---|
| OWASP Non-Human Identity Top 10 | NHI-03 | Exposed AI endpoints and credential abuse map to non-human identity exposure and weak lifecycle controls. |
| OWASP Agentic AI Top 10 | MCP tool abuse and agent-adjacent access paths fit agentic AI threat modelling. | |
| NIST CSF 2.0 | PR.AC-4 | Least-privilege access and access governance are central to preventing unauthorized AI endpoint abuse. |
Inventory AI service identities, remove anonymous access, and enforce lifecycle controls on every exposed endpoint.
Key terms
- LLMjacking: LLMjacking is unauthorized use of large language model infrastructure for someone else’s compute, data, or internal access paths. The attacker does not need to compromise the model’s intelligence. They exploit exposed or weakly governed access so the model becomes a paid service for the attacker and a risk surface for the defender.
- Model Context Protocol server: A Model Context Protocol server mediates connections between a model and tools such as files, databases, APIs, or shell access. In governance terms, it is a delegated access layer, which means its exposure, authentication, and privilege scope determine whether it becomes a safe integration point or a lateral-movement bridge.
- Identity blast radius: Identity blast radius is the amount of damage that can flow from one compromised or overexposed identity. For AI and other non-human identities, the blast radius includes data access, tool reach, compute cost, and downstream system access. The more delegated the identity, the more important containment becomes.
- Exposed AI endpoint: An exposed AI endpoint is any model interface reachable without adequate network or identity controls. It may accept inference requests, administrative commands, or tool calls from outside the intended boundary. In practice, exposure turns a workload into a target for abuse, resale, and pivoting.
Deepen your knowledge
AI endpoint exposure and MCP trust boundaries are core topics in our NHI Foundation Level course, the industry's only accredited NHI security programme. If your team is now governing model-facing services as non-human identities, the course is a practical place to start.
This post draws on content published by Pillar Security: Operation Bizarre Bazaar, the first attributed LLMjacking campaign with commercial marketplace monetization. Read the original.
Published by the NHIMG editorial team on 2026-01-28.
NHI Mgmt Group — the independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org