AI policy enforcement for GenAI: runtime controls that hold

By NHI Mgmt Group Editorial TeamPublished 2026-04-09Domain: Agentic AI & NHIsSource: Lasso Security

TL;DR: Enterprises are moving AI into production faster than they can govern it, and Lasso Security argues that runtime policy enforcement must inspect prompts, retrieved context, outputs, and tool calls as they happen, not after the fact, to manage data exposure, misuse, and compliance risk. The governing assumption is already broken: traditional policy models expect stable boundaries and predictable execution, while GenAI changes behavior with context and downstream actions.

At a glance

What this is: This is a runtime governance analysis of AI policy enforcement, showing that GenAI needs continuous controls across prompts, context, outputs, and tool execution.

Why it matters: It matters because IAM, data protection, and compliance teams must govern AI behaviour as an active execution path, not as a static application layer, across NHI, autonomous, and human workflows.

👉 Read Lasso Security's analysis of AI policy enforcement for GenAI workflows

Context

AI policy enforcement is the discipline of deciding, in real time, what an AI system may see, combine, say, and do. The governance problem is that GenAI does not behave like a normal application: context changes the output, retrieved data changes the response, and tool calls can extend the impact beyond the original prompt.

For identity and access teams, that means policy can no longer stop at authentication or static authorisation. AI workflows now create a shadow execution layer where data access, delegated actions, and output shaping all need continuous control, especially when copilots, internal agents, and third-party AI services are embedded in daily work.

Key questions

Q: How should security teams enforce AI policy at runtime?

A: Security teams should enforce AI policy in a separate control layer that inspects prompts, retrieved context, outputs, and tool calls before a response or action completes. That layer should make deterministic allow, block, redact, or constrain decisions based on identity, data sensitivity, and intent, rather than relying on the model to self-regulate.

Q: Why do traditional IAM controls fail for GenAI workflows?

A: Traditional IAM controls assume a stable subject, a bounded action, and a predictable execution path. GenAI breaks those assumptions because the same user can trigger different outputs, data retrieval, and downstream actions depending on context, meaning authorisation has to operate continuously across the interaction.

Q: What do organisations get wrong about AI policy enforcement?

A: The most common mistake is treating AI policy as a document, a prompt rule, or a design-time approval step. Effective enforcement requires runtime inspection, context-aware decisions, and logging of the rationale behind each action, because risk emerges during execution rather than only at deployment.

Q: How can compliance teams prove AI governance is actually working?

A: Compliance teams should look for evidence that policy decisions were enforced during operation, not just written into standards. Useful proof includes runtime logs, policy rationale, blocked or redacted responses, and records showing that restricted models or tools were denied in sensitive workflows.

Technical breakdown

Real-time AI activity controls and runtime interception

Real-time AI activity controls sit in the execution path and inspect prompts, retrieved context, model outputs, and tool calls as they happen. The control plane must be independent of the model because the model cannot reliably self-police under prompt injection, context poisoning, or policy evasion. This is closer to inline security enforcement than to traditional post hoc monitoring. The practical mechanism is streaming inspection plus deterministic allow, block, redact, or constrain decisions before the response or tool action completes.

Practical implication: Place enforcement outside the model so prompt, output, and tool-call decisions are made by a separate policy layer.

Context-based policy enforcement and AI usage rules

Context-based enforcement uses identity, role, data sensitivity, intent, conversation history, and retrieved sources to decide whether a response is acceptable. This is a CBAC pattern for AI, not plain RBAC, because the same user can receive different output depending on the query and data involved. Role-based AI usage rules extend that idea by constraining what kinds of AI actions a role may perform, such as summarising, transforming, or generating regulated content. The architectural point is that policy must evaluate meaning and circumstance, not just access entitlement.

Practical implication: Bind AI policy decisions to user role, intent, and data context rather than treating every prompt as equivalent.

Model and tool-level restrictions in agentic workflows

When AI systems call plugins, APIs, retrieval sources, or downstream agents, the real control surface becomes the orchestration layer. Model and tool-level restrictions prevent sensitive workflows from using unapproved models or external tools, and they limit which delegated actions can be invoked under a given policy. This matters most where autonomy increases the number of possible execution paths. Once an AI system can chain tool calls, policy has to govern the path, not just the input or output.

Practical implication: Restrict which models, tools, and retrieval sources each workflow may invoke, especially in regulated or high-risk use cases.

ASP.NET machine keys RCE attack — 3,000+ exposed ASP.NET machine keys enabled remote code execution.
DeepSeek breach — DeepSeek breach exposed 1M+ log lines and sensitive secret keys.

Read our 52 NHI Breaches Analysis report for a comprehensive view of breaches impacting Non-Human Identities including AI Agents.

NHI Mgmt Group analysis

AI policy enforcement is becoming an identity problem, not just a data problem. The article correctly frames runtime policy as the place where AI behaviour is governed, but the deeper point is that identity, context, and execution are now inseparable. Once an AI system can retrieve data and trigger downstream actions, access control must follow the behaviour chain, not the login event. Practitioners should treat policy enforcement as runtime identity governance for non-human execution.

Context-based AI controls expose the limits of static RBAC. The same prompt can be safe in one context and unacceptable in another because identity alone does not describe the risk. That is why AI enforcement has to consider purpose, data classification, conversation state, and downstream usage before a response is released. Practitioners should stop assuming that role assignment can answer all authorisation questions in GenAI.

Shadow AI creates policy drift faster than audit cycles can close it. Multiple surveys already show that more than half of employees are using GenAI tools that have not been formally approved or reviewed, which means the control problem starts outside sanctioned systems. The named concept here is shadow execution layer: AI work happening in everyday workflows without the control plane that would normally make it visible, reviewable, and governable. Practitioners should focus on discovery and runtime control, not policy documents alone.

Runtime governance is now the compliance baseline for regulated AI use. The article aligns with the direction of the EU AI Act and NIST AI RMF, both of which expect controls to be demonstrable while systems operate. That shifts evidence from design-time approval to operational enforcement, including logging of decisions and rationale. Practitioners should assume auditors will ask what the system did at runtime, not what the policy said on paper.

AI policy enforcement converges NHI, human IAM, and agent governance into one control problem. Human users, service accounts, and autonomous agents can all participate in the same AI workflow, but the enforcement logic has to follow the most dynamic actor in the chain. This is where identity governance becomes cross-domain: entitlement, delegation, and output control all land in the same runtime path. Practitioners should design for the full workflow, not one identity class at a time.

From our research:
The average estimated time to remediate a leaked secret is 27 days, despite 75% of organisations expressing strong confidence in their secrets management capabilities, according to The State of Secrets in AppSec.
Only 44% of developers are reported to follow security best practices for secrets management, exposing a significant developer behaviour gap.
Use Top 10 NHI Issues to map where runtime enforcement, secrets handling, and delegated access controls break down together.

What this signals

Runtime AI policy will converge with NHI governance faster than most programmes expect. As AI systems begin to trigger tool calls and handle sensitive data, the enforcement point moves from the application layer to the identity and decision layer. Teams that already govern service accounts, tokens, and delegated access are better placed to extend those controls to AI workflows without creating a parallel governance stack.

With 33% of organisations reporting AI agents accessing sensitive data beyond their intended scope, per AI Agents: The New Attack Surface, the operational question is not whether to govern AI but how to embed enforcement where decisions are made. That means discovery, policy evaluation, and audit logging need to be part of the execution path, not separate after-action processes.

Policy engines will need to understand context, not just entitlement. The next practical step for most enterprises is to align AI policy with existing access governance, data classification, and runtime monitoring so controls can travel with the prompt, the retrieved data, and the delegated action chain.

For practitioners

Move policy enforcement into the runtime path Inspect prompts, retrieved context, outputs, and tool calls in-line so allow, block, redact, or constrain decisions happen before completion. Keep the policy engine separate from the model itself.
Replace static rules with context-based decisions Use identity, role, query intent, data classification, and conversation state to decide whether an AI response is acceptable. Treat identical prompts differently when the underlying data sensitivity changes.
Constrain model and tool combinations by workflow Limit which models, plugins, APIs, and retrieval sources can be used for each business process. Apply the strictest controls where external retrieval or regulated data is involved.
Log policy decisions and rationale Record why a response was modified or blocked, not just that enforcement occurred. Preserve the policy trigger, the context signals, and the resulting action for audit and incident review.
Inventory approved and shadow AI usage Map which teams are using public, private, and embedded AI tools without security review. Prioritise discovery of hidden AI workflows before expanding policy coverage.

Key takeaways

AI policy enforcement fails when organisations rely on static rules for systems that change behaviour with context, data, and downstream tool use.
The evidence points to a broad control gap, with runtime governance and auditability becoming the deciding factors for safe GenAI adoption.
Enterprises need policy decisions made at execution time, not after the model has already exposed data or triggered actions.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 address the attack and risk surface, while NIST AI RMF and NIST Zero Trust (SP 800-207) set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
OWASP Agentic AI Top 10	LLM-04	Runtime policy must govern tool use and output shaping in AI workflows.
NIST AI RMF		AI RMF requires continuous, contextual, lifecycle-aware risk management.
NIST Zero Trust (SP 800-207)	PR.AC-4	AI runtime controls extend least privilege to delegated actions and context.

Treat AI workflows as continuously verified execution paths rather than one-time authenticated sessions.

Key terms

Runtime AI Policy Enforcement: Runtime AI policy enforcement is the practice of deciding what an AI system may access, generate, or trigger while the interaction is happening. It moves policy out of documents and into the control path, where prompts, context, outputs, and tool calls can be evaluated before impact occurs.
Context-Based Access Control For AI: Context-based access control for AI uses role, intent, data sensitivity, conversation history, and retrieved sources to decide whether a response is acceptable. It is more precise than static RBAC because the same identity can represent different risk depending on the task and surrounding data.
Shadow Execution Layer: A shadow execution layer is AI activity that happens inside everyday workflows without being governed by a visible control plane. It includes approved and unapproved tools, agents, or copilots that transform data, make decisions, or invoke actions outside standard monitoring and review.
Delegated AI Action Chain: A delegated AI action chain is the sequence of permissions and tool invocations that an AI system uses to complete a task. For governance, the important unit is not the initial login but the full path from identity through retrieval, model output, and downstream execution.

Deepen your knowledge

AI policy enforcement and runtime governance are core topics in our NHI Foundation Level course, the industry's only accredited NHI security programme. If you are extending identity controls into GenAI workflows, it is worth exploring.

This post draws on content published by Lasso Security: AI Policy Enforcement to Protect Data, Models & Enterprise Systems. Read the original.

NHIMG Editorial Note
Published by the NHIMG editorial team on 2026-04-09.
NHI Mgmt Group — the independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org