Subscribe to the Non-Human & AI Identity Journal

What is LLMjacking in an IAM context?

LLMjacking is the abuse of AI model access through stolen non-human identities such as API keys and cloud credentials. In practice, it is an NHI governance problem because the attacker is not stealing a login session, but using valid machine access to enumerate services, invoke models, and create cost or data exposure.

Why LLMjacking Is an IAM Problem, Not Just an AI Problem

llmjacking succeeds because attackers rarely need to break the model itself. They look for the non-human identities that already have permission to call model endpoints, access orchestration layers, or reach adjacent cloud services. That makes this an IAM and NHI governance issue: long-lived API keys, over-privileged service accounts, and weak secret handling become the real entry point. Current guidance from OWASP Agentic AI Top 10 and NIST AI Risk Management Framework points to the same pattern: control the identity, the entitlement, and the runtime context, or the AI layer becomes a high-value abuse target.

The practical risk is not only unauthorized inference. Once a valid NHI is hijacked, an attacker can enumerate models, pivot into connected tools, drain quota, exfiltrate prompts or outputs, and sometimes reach data stores or CI/CD systems that were never meant to be part of the AI workflow. That is why the issue belongs alongside OWASP NHI Top 10 and AI LLM hijack breach analysis, not in a narrow application security bucket. In practice, many security teams encounter LLMjacking only after unexpected cloud spend, anomalous model calls, or data-access alerts have already surfaced.

How LLMjacking Works in Practice

Attackers usually begin with exposed secrets, stolen CI/CD credentials, compromised developer endpoints, or reused service-account tokens. The difference from classic account takeover is that the stolen credential often belongs to a machine identity with direct tool access rather than a human user with interactive controls. From there, the adversary can test model permissions, enumerate APIs, and chain adjacent services if the same identity is trusted too broadly. Research on compromised AI credentials in the Moltbook AI agent keys breach and DeepSeek breach shows how quickly secrets exposure turns into operational abuse.

Defence needs to focus on runtime identity controls, not only perimeter blocks. That usually means:

  • Using workload identity instead of shared static keys, so the agent or service proves what it is at request time.
  • Issuing JIT credentials with short TTLs, then revoking them automatically when the task ends.
  • Applying intent-based authorisation so policy is evaluated against what the workload is trying to do, not just which role it carries.
  • Separating model access from downstream system access, so a hijacked NHI cannot freely pivot.
  • Monitoring token use, quota spikes, unusual tool chaining, and high-frequency prompts as abuse indicators.

This aligns with the implementation direction in NIST AI 600-1 Generative AI Profile and the agent controls discussed in OWASP Agentic Applications Top 10. For broader context on how exposed credentials are abused, Entro Security reports that when AWS credentials are exposed publicly, attackers attempt access within an average of 17 minutes, and as quickly as 9 minutes in some cases. These controls tend to break down when legacy integrations require shared keys across multiple systems because revocation and scoping become operationally slow.

Common Variations and Edge Cases

Tighter credential controls often increase operational overhead, requiring organisations to balance faster delivery against stronger runtime discipline. That tradeoff is especially visible in AI environments where model gateways, orchestrators, plugins, and downstream APIs were added at different times and never designed around a single identity model.

One common edge case is an agentic workflow that is technically non-interactive but still highly autonomous. In that setting, role-based access control alone is too static. Best practice is evolving toward context-aware policy decisions, but there is no universal standard for this yet. That is why current guidance from NIST AI 600-1 Generative AI Profile and Anthropic — first AI-orchestrated cyber espionage campaign report emphasises runtime review, not trust by default.

Another edge case is cost abuse without direct data theft. A hijacked NHI may stay within its nominal permissions while repeatedly invoking expensive models, burning budget, or degrading service levels. The issue is still security-relevant because it exploits the same identity trust path. For teams building mature controls, the most useful references are the Ultimate Guide to NHIs — 2025 Outlook and Predictions and the LiteLLM PyPI package breach, because they show how software supply chain and identity hygiene intersect. In environments with many third-party connectors, these controls often fail because the weakest secret handling practice becomes the easiest path to model abuse.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework Control / Reference Relevance
OWASP Agentic AI Top 10 A2 LLMjacking abuses over-scoped agent permissions and tool access.
CSA MAESTRO T1 Addresses autonomous workload identity and task-bound authorization.
NIST AI RMF GOVERN LLMjacking exposes governance gaps in AI identity and accountability.

Constrain agent tools and evaluate each action against runtime policy before execution.

Related resources from NHI Mgmt Group