Subscribe to the Non-Human & AI Identity Journal

Notifications
Clear all

AI coding agent sandboxes: are your controls keeping up?


(@nhi-mgmt-group)
Member Moderator
Joined: 1 year ago
Posts: 3789
Topic starter  

TL;DR: AI coding agents routinely process untrusted code and content, and Pillar Security’s analysis of 14 sandbox solutions shows every isolation tier has a failure mode, from containers and user-space kernels to microVMs and kernel-enforced controls. Isolation contains blast radius, but only if teams understand what they are isolating from and what credentials are mounted inside the sandbox.

NHIMG editorial — based on content published by Pillar Security: Your AI Agent Will Run Untrusted Code. Now What?

By the numbers:

Questions worth separating out

Q: How should security teams handle credentials inside AI coding agent sandboxes?

A: Security teams should assume any credential visible to an AI coding agent is usable for theft, leakage, or lateral movement.

Q: Why do AI coding agents make sandbox design an IAM issue?

A: AI coding agents are useful only because they can execute with meaningful access, which means the identity and access decisions made before runtime directly shape the blast radius.

Q: What breaks when sandboxing relies only on command allowlists?

A: Allowlists fail when context is poisoned, because a command that looks safe in isolation can become dangerous after environment variables, inherited shell state, or prior execution steps modify the session.

Practitioner guidance

  • Inventory every credential mounted into agent sandboxes List environment variables, secret files, cloud tokens, and service-account material that an AI coding agent can read during execution.
  • Choose the isolation tier from the threat model inward Select containers, user-space kernels, microVMs, or kernel-enforced controls based on the untrusted source, the value of the data inside the sandbox, and the blast radius you can tolerate.
  • Separate command control from context control Combine allowlists with restrictions on inherited environment state, filesystem read paths, and network egress.

What's in the full article

Pillar Security's full blog post covers the operational detail this post intentionally leaves for the source:

  • The four isolation tiers compared side by side, including containers, user-space kernels, microVMs, and kernel-enforced capabilities.
  • Specific product examples and configuration patterns for Claude Code, Cursor, Gemini CLI, and other coding agents.
  • The detailed failure modes behind full filesystem read access, allowlist bypass, and environment-variable poisoning.
  • Threat-model questions for choosing between latency, session limits, and stronger host isolation.

👉 Read Pillar Security's analysis of AI coding agent sandbox failures →

AI coding agent sandboxes: are your controls keeping up?

Explore further

View Full Forum →  |  NHI Foundation Course →



   
Quote
Share: