What do security teams get wrong about LLM-as-C2?

Why This Matters for Security Teams

LLM-as-C2 is easy to underestimate because the prompt looks harmless while the real risk sits in the execution path. Once an LLM can influence tasking, sequencing, or tool selection, the control plane moves from a static payload to an external decision service that can adapt in real time. That changes what defenders must watch: outbound connectivity, prompt and response flows, model endpoints, and the downstream actions triggered by the agent.

This is why guidance from OWASP Agentic AI Top 10 and the NIST AI Risk Management Framework matters here: both push teams to treat autonomous decisioning as a governance and monitoring problem, not just a content-safety issue. NHIMG research on the AI Agents: The New Attack Surface report shows why this is urgent: 80% of organisations report AI agents have already performed actions beyond their intended scope. In practice, many security teams encounter LLM-as-C2 only after the tooling has already been chained into an incident response blind spot or a lateral movement path.

How It Works in Practice

Security teams often miss that LLM-as-C2 is not defined by a single malicious binary. It is an execution design pattern in which the model is used to decide what happens next, where to go next, or which tool to call next. The adversary benefits from the model’s flexibility, because the command channel can be indirect, context-sensitive, and hard to signature. That is why static IOC matching often underperforms once the model begins varying phrasing, timing, or task flow.

Operationally, defenders need to correlate the application payload with the communication path. That means inspecting:

Whether the agent or payload can reach external model endpoints, proxy services, or retrieval systems.

Whether prompts, context, or tool outputs are being used to trigger follow-on actions.

Whether credentials, tokens, or session material are available for chained actions after the first instruction.

Whether egress controls and DNS telemetry reveal repeated low-and-slow coordination to a remote service.

The identity problem is just as important. For autonomous workloads, best practice is evolving toward workload identity and short-lived authorization, not long-lived static secrets. That aligns with the direction of the CSA MAESTRO agentic AI threat modeling framework and the NIST AI Risk Management Framework, which both emphasize runtime context and lifecycle risk. NHIMG’s OWASP NHI Top 10 also reinforces the need to govern non-human execution with the same rigor as privileged infrastructure. These controls tend to break down when the agent can rotate endpoints, change tool order, and re-task itself faster than analysts can triage the trace.

Common Variations and Edge Cases

Tighter egress control often increases operational friction, requiring organisations to balance containment against the need for legitimate model access and automation throughput. That tradeoff is especially visible when security teams need to support internal copilots, external hosted models, and remote orchestration in the same environment.

There is no universal standard for this yet, but current guidance suggests a layered approach: restrict outbound destinations, use policy-as-code for tool invocation, and issue ephemeral credentials only for the task at hand. In more mature environments, teams are adding request-time evaluation so the model’s intent is checked against policy before any privileged action executes. That is a better fit for autonomous systems than RBAC alone, because the model’s next step may not be predictable at design time.

Edge cases often appear in high-change environments such as incident response, red team tooling, and developer platforms where agents legitimately need broad reach. In those cases, defenders should prefer NIST AI 600-1 Generative AI Profile guidance on governance and monitoring, and pair it with the threat patterns documented in Analysis of Claude Code Security. The main exception is offline or air-gapped deployments, where the C2 risk shifts from remote command flow to local tool abuse and delayed exfiltration.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
OWASP Agentic AI Top 10	A10	Agentic abuse patterns include remote command shaping and tool chaining.
CSA MAESTRO		MAESTRO addresses threat modeling for autonomous agent workflows and control flow.
NIST AI RMF	GOVERN	LLM-as-C2 is a governance issue because the model can direct real-world actions.

Model every agent tool path, then constrain model-to-action transitions with policy checks.

#1 Authority in NHI Education, Research and Advisory, empowering organizations to tackle the critical risks posed by Non-Human Identities (NHIs), including AI Agents.

What do security teams get wrong about LLM-as-C2?

Why This Matters for Security Teams

How It Works in Practice

Common Variations and Edge Cases

Standards & Framework Alignment

Related resources from NHI Mgmt Group