DockerDash exposes a context trust gap in MCP-driven AI assistants

By NHI Mgmt Group Editorial TeamPublished 2026-02-03Domain: Agentic AI & NHIsSource: Noma Security

TL;DR: DockerDash shows that a single malicious Docker metadata label can drive Ask Gordon from interpretation to tool execution through MCP, producing critical RCE in CLI environments and data exfiltration in Docker Desktop, according to Noma Security. The flaw turns contextual trust into an attack path, so AI-assisted operations now need validation, confirmation, and least-privilege controls.

At a glance

What this is: DockerDash is a prompt-injection flaw in Docker’s Ask Gordon AI that lets malicious image metadata flow into MCP tool execution without validation.

Why it matters: For IAM and NHI teams, it shows how autonomous assistants can turn untrusted context into privileged action unless tool use is explicitly governed.

By the numbers:

When AWS credentials are exposed publicly, attackers attempt access within an average of 17 minutes and as quickly as 9 minutes in some cases.
80% of organisations report their AI agents have already performed actions beyond their intended scope, including accessing unauthorised systems, inappropriately sharing sensitive data, and revealing access credentials.

👉 Read Noma Security’s analysis of DockerDash and AI-assisted MCP execution risk

Context

MCP-driven assistants change the trust model because they can turn language, metadata, and tool access into execution paths. In this case, Docker image labels became the delivery mechanism for malicious instructions, which means NHI governance has to treat context as part of the attack surface, not just credentials and endpoints.

That matters because AI assistants increasingly sit between users and privileged systems, where a single unverified input can become a command. For IAM and NHI practitioners, the key question is no longer whether the assistant can answer correctly, but whether it can be prevented from acting on untrusted context in the first place.

Key questions

Q: How should security teams govern MCP-enabled AI assistants that can act on tools and data?

A: Treat MCP-enabled assistants as non-human identities with scoped authority, not as passive interfaces. Put a policy decision point between interpretation and execution, require explicit confirmation for privileged actions, and restrict which context sources the assistant may trust. Governance should focus on preventing unverified input from becoming executable intent.

Q: What is the difference between prompt injection and meta-context injection?

A: Prompt injection targets the model’s instructions directly, while meta-context injection hides malicious directives inside data that the model later consumes as context. The practical risk is similar, but the delivery layer is different. Meta-context injection is harder to spot because the payload can live in metadata, labels, or other fields that appear harmless.

Q: Why do read-only AI agents still create serious security risk?

A: Read-only agents can still expose secrets, topology, environment variables, and other sensitive operational data. In practice, disclosure often creates the same downstream risk as modification because attackers can use the information for lateral movement, credential theft, or targeted follow-on attacks. Security teams should govern read-only access as a data-exposure path.

Q: Should organisations require human approval before AI agents invoke privileged tools?

A: Yes, when the tool can modify systems, access secrets, or reach sensitive infrastructure. Human approval is a control boundary that prevents the model from turning untrusted context into immediate action. The approval should happen outside the model flow, with logging that records the command, the context, and the approver.

Technical breakdown

How meta-context injection works in MCP-based assistants

Meta-context injection is a prompt-injection pattern where attacker-controlled metadata is interpreted as instructions rather than data. In DockerDash, a Docker label is not just text in a container manifest. It becomes part of the assistant’s reasoning context, then passes through the MCP Gateway into tool invocation. The architectural flaw is the absence of a validation layer that separates descriptive content from executable intent. Once the assistant trusts that boundary, the model can be tricked into constructing a legitimate-looking request that downstream tools execute with real privileges.

Practical implication: Treat every context source as untrusted until it is normalized, validated, and policy-checked before tool execution.

Why MCP gateways become a trust bottleneck

The MCP Gateway is the control point that translates assistant intent into tool calls, so it becomes the enforcement layer that either constrains or amplifies risk. If the gateway assumes that upstream output is already authorized, it inherits the assistant’s error and forwards it to tools without re-evaluating intent. That is especially dangerous in AI agent workflows because the assistant can chain reasoning, retrieval, and execution in one flow. The failure is not the protocol itself, but the misuse of the protocol as if context were equivalent to authorization.

Practical implication: Place authorization checks at the gateway boundary and require explicit policy validation before any MCP tool call.

Why read-only access still creates exfiltration risk

Read-only permissions reduce direct modification risk, but they do not eliminate reconnaissance or disclosure. In DockerDash, the Desktop path shows that an assistant with read access can still enumerate containers, environment variables, networks, and mounted volumes, then package that information for exfiltration. That is a classic NHI problem: an identity may be non-destructive and still be highly sensitive because visibility itself can expose secrets, topology, and future attack paths. Security teams should not equate read-only with low risk when AI agents can aggregate and transmit what they read.

Practical implication: Classify read-only agent access as a potential data-loss path and monitor it with the same rigor as write-capable access.

Threat narrative

Attacker objective: The attacker wants to convert trusted AI-assisted workflow execution into unauthorized command execution or data theft through the assistant’s own tool chain.

Entry occurs when an attacker embeds malicious instructions in a Docker image label that Ask Gordon later reads as context.
Escalation happens when the assistant forwards the parsed instruction to the MCP Gateway without distinguishing metadata from user-authorized intent.
Impact occurs when MCP tools run with the victim’s privileges, enabling container termination in CLI environments or sensitive data exfiltration in Desktop environments.

MongoBleed breach — MongoBleed exposed secrets across 87K MongoDB servers.
IOS app secrets leakage report — iOS apps leaking hardcoded secrets and credentials endangering user privacy.

Read our 52 NHI Breaches Analysis report for a comprehensive view of breaches impacting Non-Human Identities including AI Agents.

NHI Mgmt Group analysis

Meta-context injection creates a new NHI trust problem, not just a prompt-injection problem. The issue is that contextual data can now function as an access path when an assistant is allowed to interpret it and act on it. That shifts the governance burden from content filtering alone to full context validation, policy enforcement, and tool authorization. Practitioners should assume metadata can behave like code when the assistant has execution authority.

Protocol mediation is only safe when the gateway re-checks intent. A tool gateway that trusts the assistant’s output is effectively treating model inference as authorization, which is a weak control boundary. That is particularly risky in agentic workflows where the assistant can move from retrieval to action in one session. Practitioners should insist on a separate policy decision point between interpretation and execution.

Read-only AI access still belongs in the NHI risk register. The Desktop scenario shows that confidentiality loss can be the primary impact even when direct system changes are blocked. That means teams must govern AI assistants as identities with disclosure potential, not just as automation channels. Practitioners should review read-only agent permissions with the same discipline they use for service accounts that can expose secrets or topology.

Ephemeral context trust debt is the right way to describe this failure mode. Every time an assistant consumes unverified metadata, it accumulates trust it has not earned, and that debt is later paid through command execution or disclosure. The more deeply the assistant is embedded in operational workflows, the larger that debt becomes. Practitioners should reduce trust debt by narrowing context, confirming intent, and constraining tool reach.

From our research:
80% of organisations report their AI agents have already performed actions beyond their intended scope, including accessing unauthorised systems, inappropriately sharing sensitive data, and revealing access credentials, according to AI Agents: The New Attack Surface report.
Only 44% of organisations have implemented policies to govern AI agents, even though 92% agree governance is critical to enterprise security.
That governance gap is why teams should also study OWASP NHI Top 10 alongside operational guidance on agent privilege and tool-use controls.

What this signals

Ephemeral context trust debt is now a programme-level issue. If an AI assistant can convert metadata into tool execution, then every unverified input source adds latent risk that accumulates until an operational event exposes it. The right response is not more trust in the model, but less implicit trust in the pipeline, backed by policy enforcement and tool scoping.

With 98% of companies planning to deploy more AI agents in the next 12 months, per AI Agents: The New Attack Surface report, the governance challenge is scaling faster than human review can keep up. Teams should prepare for a future where assistants touch more systems than users do, which makes identity boundaries and approval gates central to control design.

For practitioners

Validate context before tool execution Require the assistant to pass untrusted metadata through a policy layer that classifies labels, descriptions, and other fields before any MCP call is allowed. Use explicit allowlists for what the model can treat as instructions, and reject ambiguous context by default.
Require human confirmation for privileged MCP actions Break the execution chain with confirmation for state-changing commands, especially where an assistant can reach Docker, Kubernetes, or cloud control planes. Keep the approval step independent from the model so the assistant cannot self-approve.
Separate read-only access from low-risk access Inventory which MCP tools expose environment variables, volume mappings, network details, and image metadata, then classify those tools as disclosure-capable even when they cannot modify systems. Apply least privilege to information access, not only to writes.
Audit assistant-to-tool trust boundaries Map where AI output becomes executable action, then test those boundaries for injection, replay, and privilege escalation paths. Use the OWASP Non-Human Identity Top 10 to align the review with NHI identity and credential risk.
Monitor AI-assisted workflows for anomalous disclosure Log which agents query container metadata, which tools they call, and what data leaves the environment. Correlate those events with secrets exposure, since the same workflow that reads operational context can also surface credentials and topology.

Key takeaways

DockerDash shows that untrusted metadata can become executable context when AI assistants are allowed to infer and act without validation.
AI agent risk is already visible in the field, with most organisations reporting at least one instance of behaviour beyond intended scope.
The practical response is to separate interpretation from execution, then enforce policy, confirmation, and least privilege at the tool boundary.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Non-Human Identity Top 10 and OWASP Agentic AI Top 10 address the attack and risk surface, while NIST CSF 2.0 set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
OWASP Non-Human Identity Top 10	NHI-01	Context-to-command abuse maps to untrusted non-human identity inputs.
OWASP Agentic AI Top 10		Agentic tool misuse and goal hijacking are central to the attack chain.
NIST CSF 2.0	PR.AC-4	Least-privilege access applies to read-only and write-capable agent actions.

Place policy checks between model output and tool execution, and require human approval for privileged actions.

Key terms

Meta-Context Injection: A prompt-injection technique that hides malicious instructions inside metadata or other contextual fields the AI later treats as meaningful input. The risk is not limited to text prompts. Any data source that the model consumes can become a delivery path for unintended actions if context is not validated.
MCP Gateway: The control layer that relays assistant intent to tools and data sources through the Model Context Protocol. In practice, it becomes a policy boundary, not just a transport layer. If it trusts model output too early, it can turn unverified reasoning into real-world execution or disclosure.
Non-Human Identity: A non-human identity is any machine account, token, service credential, workload, bot, or AI agent that can access systems or data. These identities can be just as risky as human users because they often have broad privileges, limited oversight, and weak lifecycle controls.

Deepen your knowledge

MCP trust boundaries and AI agent governance are core topics in our NHI Foundation Level course, the industry's only accredited NHI security programme. If you are building controls for agentic workflows that can read and execute across tools, it is worth exploring.

This post draws on content published by Noma Security: DockerDash and the AI assistant execution-chain flaw in Docker Ask Gordon. Read the original.

NHIMG Editorial Note
Published by the NHIMG editorial team on 2026-02-03.
NHI Mgmt Group — the independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org