How should teams govern AI assistants that can run migrations and execute code?

Why This Matters for Security Teams

AI assistants that can run migrations or execute code are not just copilots, they are privileged automation with judgment delegated to a model. That changes the risk from “bad advice” to “bad action.” Traditional approval gates, ticketing, and developer trust do not hold up when a system can chain tool calls, modify infrastructure, or write directly into production-like environments. The right control lens is governed NHI, not generic chatbot policy.

Current guidance from NIST Cybersecurity Framework 2.0 points teams toward asset clarity, access control, logging, and recovery, but AI assistants add an operational twist: the tool invocation is the attack surface. NHIMG’s Top 10 NHI Issues research highlights how identity, secrets, and lifecycle gaps routinely appear together when automation is granted broad access without tight governance.

In practice, many security teams encounter privilege misuse only after an assistant has already executed an unsafe migration or touched data it was never intended to reach.

How It Works in Practice

Governance should start by separating three functions: code generation, change approval, and runtime execution. The assistant may propose a migration plan or generate a script, but it should not inherit the ability to execute that script automatically. Execution should occur only through a controlled runtime identity, a bounded environment, and a policy decision made at request time.

For most teams, that means treating the assistant as a workload identity with narrowly scoped permissions, not as a user account with broad standing access. Use short-lived credentials, just-in-time provisioning, and explicit environment boundaries so the assistant receives only the access needed for a specific task. Where possible, the execution layer should be tied to cryptographic workload identity such as SPIFFE or OIDC-backed tokens, because the control objective is to prove what the agent is and what it is allowed to do at that moment.

Policy enforcement should be context-aware. A migration in a dev sandbox may be acceptable, while the same action in a pre-production database may require human approval or be blocked entirely. That is why static RBAC alone is insufficient for autonomous execution flows: the same assistant can behave differently depending on prompt content, retrieved context, and tool output. Real-time policy evaluation, supported by policy-as-code, is the practical control pattern that best fits this model.

Operationally, teams should log the full execution path: prompt, approval, tool call, command output, and any resulting change. NHIMG’s Lifecycle Processes for Managing NHIs reinforces the need to manage issuance, rotation, and revocation as an end-to-end lifecycle, not a one-time setup. The same discipline applies here, because code execution without revocation and traceability becomes standing privilege by another name. These controls tend to break down when assistants are connected to shared admin shells or long-lived service accounts, because task boundaries disappear and privilege accumulates across sessions.

Common Variations and Edge Cases

Tighter execution control often increases friction, so organisations must balance developer velocity against blast-radius reduction. That tradeoff is real, especially when teams want assistants to automate routine migrations without turning every change into a manual approval queue.

There is no universal standard for this yet, but current guidance suggests a few consistent patterns. Read-only assistants can often operate with looser controls than assistants that write to databases, deploy code, or alter IAM. Similarly, a migration assistant that runs in an isolated CI job is much easier to govern than one embedded in an interactive IDE with broad network reach. For high-risk actions, current best practice is evolving toward separation of duties, ephemeral execution environments, and explicit human sign-off for sensitive targets.

NHIMG’s The State of Secrets in AppSec is also relevant here because code-executing assistants frequently interact with secrets, tokens, and API keys. If the assistant can read them, it may also leak them into logs, generated code, or downstream tools. That risk becomes sharper in environments with weak secret hygiene or fragmented controls. In short, assistant governance must cover both the code path and the credential path, because one without the other leaves a workable attack route.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
OWASP Agentic AI Top 10	A2	Covers unsafe tool use and autonomous execution risk in agents.
CSA MAESTRO	TRM	Addresses trust boundaries and runtime governance for agentic workflows.
NIST AI RMF	GOVERN	AI RMF governance fits accountability for autonomous assistants executing code.

Restrict tool authority, require context checks, and block direct action without policy approval.

#1 Authority in NHI Education, Research and Advisory, empowering organizations to tackle the critical risks posed by Non-Human Identities (NHIs), including AI Agents.

How should teams govern AI assistants that can run migrations and execute code?

Why This Matters for Security Teams

How It Works in Practice

Common Variations and Edge Cases

Standards & Framework Alignment

Related resources from NHI Mgmt Group