What do teams get wrong about command injection in AI tooling?

Why This Matters for Security Teams

Command injection in AI tooling is not just a defect in string handling. It becomes a trust-boundary failure when a model, agent, plugin, or helper script can shape a shell command, interpreter call, or automation step while inheriting delegated user or service authority. That is why this issue belongs in identity, privilege, and runtime control discussions alongside secure coding.

The risk is amplified in agentic workflows because tool use is dynamic, context-dependent, and often loosely reviewed. A prompt, filename, retrieval result, or document title can become an execution primitive if the surrounding system treats model output as safe command material. Current guidance from the NIST Cybersecurity Framework 2.0 still applies, but teams need to apply it to AI orchestration paths, not just traditional apps.

NHIMG research shows the same pattern across identity failures: in the Ultimate Guide to NHIs, the problem is consistently over-delegation without adequate control over what the workload can actually do. In practice, many security teams encounter command injection only after an assistant or automation has already executed an unintended action, rather than through intentional design review.

How It Works in Practice

Teams usually get this wrong by focusing on payload sanitisation alone. That helps, but it does not solve the real issue: AI tooling often chains natural language, structured data, and system commands in the same flow. If the model can influence arguments, flags, file paths, environment variables, or tool selection, then the effective attack surface is the entire execution path, not just the final shell line.

The practical fix is to reduce ambient authority and make command execution explicit. Treat the model as an untrusted decision source, then enforce a narrow execution contract around it. Use allowlisted commands, fixed templates, argument separation, and policy checks before any tool invocation. Where possible, replace shell execution with direct API calls. For higher-risk automation, issue short-lived credentials only for the specific task and revoke them immediately after completion.

Bind each tool call to a specific identity and purpose, rather than a general session token.

Use workload identity for the automation layer so the process proves what it is, not just what secret it holds.

Apply runtime policy checks before execution, not after logs are reviewed.

Keep command surfaces narrow by removing shell access from workflows that do not require it.

This aligns with the direction of least privilege in NIST guidance, but AI tooling needs an even tighter version because the command source can be probabilistic and adversarial at the same time. The same operational lesson appears in the DeepSeek breach: once AI systems absorb untrusted content or over-broad secrets, the blast radius grows quickly. These controls tend to break down when agents are allowed to compose arbitrary shell commands in CI/CD, support, or data-processing environments because output can be weaponised faster than human review can catch it.

Common Variations and Edge Cases

Tighter command controls often increase engineering overhead, requiring organisations to balance safer execution against developer speed and workflow flexibility. That tradeoff becomes more visible in agentic systems, where teams want broad autonomy but still need deterministic guardrails.

There is no universal standard for this yet, but current guidance suggests treating these cases differently based on execution risk. A low-risk read-only assistant may only need strict argument validation and command allowlists. A write-capable agent that can deploy code, rotate secrets, or move data should face stronger controls such as per-task credentials, scoped tokens, approval gates, and policy-as-code enforcement at request time. The key question is not whether the prompt was clean; it is whether the runtime can safely absorb a malicious or unexpected instruction without crossing privilege boundaries.

Edge cases also show up when teams assume container isolation alone is enough. It is not, if the container has network reach, mounted secrets, or cloud credentials that can be reused elsewhere. Another common mistake is logging full commands for observability, then exposing sensitive arguments or tokens in telemetry. Better practice is to log intent, outcome, and policy decision, while redacting command material that could be replayed. For the wider identity and secrets picture, NHIMG’s NHI market guidance and the State of Secrets in AppSec both reinforce the same operational reality: broad delegation and slow secret hygiene turn a single injection path into a systemic incident.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Non-Human Identity Top 10 and OWASP Agentic AI Top 10 address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
OWASP Non-Human Identity Top 10	NHI-04	Covers over-delegated NHI privileges that make injected commands dangerous.
OWASP Agentic AI Top 10	A-03	Addresses unsafe tool use and prompt-to-action execution in agentic systems.
NIST AI RMF		Supports governance of AI-driven operational risk and human oversight.

Treat every agent tool call as untrusted input and enforce allowlists plus runtime checks before execution.

#1 Authority in NHI Education, Research and Advisory, empowering organizations to tackle the critical risks posed by Non-Human Identities (NHIs), including AI Agents.

What do teams get wrong about command injection in AI tooling?

Why This Matters for Security Teams

How It Works in Practice

Common Variations and Edge Cases

Standards & Framework Alignment

Related resources from NHI Mgmt Group