By NHI Mgmt Group Editorial TeamPublished 2026-04-08Domain: Agentic AI & NHIsSource: Lasso Security

TL;DR: As AI agents move from generating content to taking actions, traditional controls such as DLP, RBAC, and prompt filters lose visibility into runtime behavior and cumulative context, according to Lasso Security. Intent security shifts the decision point from what the model said to whether the action belongs in context, which is now essential for governing production agents.


At a glance

What this is: This post argues that agentic AI security has to move from content filtering to runtime behavior control because agents now take actions across enterprise systems.

Why it matters: It matters because IAM, PAM, and governance programmes must assess what an AI agent is allowed to do at runtime, not just what it produces in a prompt or response.

👉 Read Lasso Security's analysis of why agentic AI needs intent security


Context

Agentic AI security is the problem of governing systems that can decide and act, not just generate text. The article argues that once agents can call APIs, update records, and trigger workflows, the security question changes from output inspection to runtime behaviour control.

Traditional enterprise controls assume risk can be evaluated at a single point in time. That assumption breaks when an agent carries forward context, adapts across steps, and reaches different outcomes from the same request depending on what has already happened in the chain.


Key questions

Q: How should security teams govern AI agents that can take actions in production systems?

A: Security teams should govern AI agents as runtime actors, not just content generators. That means defining which actions are allowed, which require approval, and which are blocked even when the agent has valid credentials. The control point is the behaviour of the agent across the full workflow, including context, tool use, and downstream effects.

Q: Why do traditional IAM and DLP controls fall short for agentic AI?

A: Traditional IAM and DLP controls assume risk can be judged at a point in time from one request or one response. Agentic AI accumulates context, chains actions, and can reach harmful outcomes through a sequence of individually valid steps. That makes single-event inspection too narrow for production governance.

Q: What is the difference between content filtering and intent security for AI agents?

A: Content filtering checks whether text looks risky, while intent security checks whether the resulting action belongs in context. Intent security compares user goal, application purpose, outside data, and the action about to happen. It is the right control when the risk is operational behaviour, not just unsafe language.

Q: What should organisations measure to detect drift in agent behaviour?

A: Organisations should measure whether the agent’s actions still match the user’s goal, the intended workflow, and the normal pattern for that agent or role. Changes in action sequence, tool use, or side effects are stronger governance signals than prompt content alone. That is how behavioural drift becomes visible.


Technical breakdown

Why point-in-time controls fail for agentic AI

Point-in-time controls such as DLP, prompt filters, and coarse RBAC work best when the security question is visible in one request or one response. Agentic systems accumulate conversation history, retrieved content, tool output, and prior reasoning, so the relevant risk emerges over a sequence rather than a single event. That makes isolated inspection unreliable because the dangerous step is often only obvious after the agent has chained several valid actions together. The architecture problem is not just more volume, but more state carried across time.

Practical implication: move from single-event review to runtime monitoring that evaluates multi-step agent behaviour across the full decision chain.

Intent security and context-aware authorisation

Intent security evaluates whether an action makes sense in context, not whether the prompt contains risky text. It compares the user’s goal, the application’s allowed purpose, outside data influencing the model, and the action the agent is about to take. When those signals diverge, the issue is behavioural drift rather than content abuse. This is why content inspection alone is incomplete for production agents: the risky decision may be operational, such as approving a refund, modifying records, or triggering a workflow outside policy.

Practical implication: define approval and escalation rules around action context, not just prompt content, especially for workflows that can change records or move money.

Alignment and drift in dynamic agent behaviour

Alignment asks whether the agent’s action still matches the user’s goal and the system’s intended purpose. Drift asks whether the behaviour still looks normal for this agent, this user, or this workflow. Those two tests matter because an agent can remain internally consistent while still moving outside its intended boundary. In practice, that means legitimate permissions and valid tools can still produce unacceptable outcomes if the behaviour pattern changes enough. The governance challenge is therefore behavioural change detection, not only policy enforcement.

Practical implication: monitor for behavioural drift across repeated agent sessions and treat pattern changes as a governance signal, not just an anomaly alert.


Threat narrative

Attacker objective: The objective is to use legitimate agent access to cause harmful business actions that appear valid step by step but are wrong in aggregate.

  1. Entry occurs when an autonomous AI agent is granted legitimate access to enterprise systems such as APIs, internal platforms, or workflow engines.
  2. Escalation happens when the agent accumulates context, chains valid steps, and moves beyond the original user intent without a human approval gate between actions.
  3. Impact appears when the agent approves refunds, updates records, or triggers workflows that create operational, compliance, or reputational harm.

Read our 52 NHI Breaches Analysis report for a comprehensive view of breaches impacting Non-Human Identities including AI Agents.


NHI Mgmt Group analysis

Intent security is now an identity governance problem, not just an AI safety problem. The article shows that the decisive risk is not what an agent says, but what it is authorised to do at runtime across enterprise systems. That moves the governance burden from content review into action control, with implications for IAM, PAM, and access decisioning. Practitioners should treat agent behaviour as an access problem first and a model problem second.

Point-in-time security was designed for stable requests, and that assumption fails when an agent carries context across a decision chain. The premise behind DLP, prompt filtering, and single-step RBAC is that the risky event can be observed at one moment. That assumption breaks when the agent adapts across steps and the harmful outcome only emerges after a sequence of individually valid actions. The implication is that governance must be built around behaviour over time, not isolated checks.

Behavioral drift is the named failure mode this framework exposes. The article is describing a system where alignment can look intact while the action pattern has already moved outside policy, purpose, or expected workflow. That failure mode is more precise than generic “AI risk” because it names the control gap practitioners miss when they only inspect prompts. Teams should recognise drift as a governance signal that current review models were not designed to catch.

Agentic AI expands the NHI problem from credential governance to execution governance. Traditional NHI controls are built to manage identities that authenticate and act within bounded permissions. Once the actor can sequence actions autonomously, the security question becomes whether the runtime behaviour still belongs inside the approved delegation boundary. Practitioners should re-evaluate whether their NHI model can express action scope, not just access scope.

The combination of valid permissions and invalid behaviour is what makes agentic systems hard to govern. The article correctly points out that serious incidents may look operational rather than like classic compromise. That means access reviews, policy checks, and workflow approvals have to account for legitimate tools being used in illegitimate combinations. The practical conclusion is that security teams must measure behavioural appropriateness, not just entitlement presence.

From our research:

  • 98% of companies plan to deploy even more AI agents within the next 12 months, despite documented rogue behaviour in 80% of current deployments, according to AI Agents: The New Attack Surface report.
  • Only 52% of companies can track and audit the data their AI agents access, leaving 48% with a complete blind spot for compliance and breach investigation.
  • The governance problem is moving faster than most programmes can adapt, and the article on OWASP Agentic AI Top 10 provides the next control lens practitioners should use.

What this signals

Behavioral drift is the right name for the gap many programmes are about to face. When 98% of companies plan to deploy more AI agents within 12 months and 80% already report rogue behaviour, the issue is no longer adoption speed, but whether governance can keep up with runtime decisions.

The immediate programme shift is from static entitlement review to action-level authorisation. Teams should expect more demand for logging, approval gating, and policy checks that can explain why a specific agent action was allowed, not only whether the agent had access.

For practitioners building an agentic control stack, the most relevant external baseline is the OWASP Top 10 for Agentic Applications 2026, because it frames the same runtime-risk problem from a control-design perspective.


For practitioners

  • Define runtime approval boundaries for agent actions Map which agent actions can execute autonomously, which require step-up approval, and which must be blocked even when the underlying credentials are valid. Focus on refunds, record updates, workflow triggers, and other actions with direct business impact.
  • Instrument agent behaviour monitoring Log the full decision chain, including prompt context, retrieved data, tool calls, and downstream side effects so that drift can be detected across a session rather than inside a single prompt.
  • Separate content risk from action risk Keep content inspection, but add an independent control for whether the resulting action fits the user intent, application purpose, and policy boundary. A clean prompt should never be treated as proof of safe behaviour.
  • Review delegated access for business workflows Check whether existing workflow permissions allow an agent to make irreversible changes, approve transactions, or trigger cascades without human review. Where they do, require explicit delegation scope and tighter audit logging.

Key takeaways

  • Agentic AI changes the security question from what the model said to what the system did.
  • Legacy controls such as DLP, RBAC, and prompt filters are too narrow when risk emerges across a chain of actions.
  • Practitioners need runtime behaviour control, behavioural drift detection, and action-level authorisation to govern production agents effectively.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 address the attack and risk surface, while NIST AI RMF and NIST CSF 2.0 set the governance and control requirements practitioners need to meet.

FrameworkControl / ReferenceRelevance
OWASP Agentic AI Top 10Addresses agentic AI action misuse and runtime tool abuse.
NIST AI RMFAI RMF GOVERN and MAP fit runtime accountability for autonomous behaviour.
NIST CSF 2.0PR.AC-4Access permissions must reflect action scope, not only authentication.

Assign ownership for agent actions and document how intent, context, and side effects are assessed.


Key terms

  • Intent Security: Intent security is a control approach that checks whether an AI agent's action belongs in context, not just whether its output looks safe. It combines user goal, application purpose, surrounding data, and the next action to decide if behaviour is appropriate.
  • Behavioral Drift: Behavioral drift is the gradual movement of an AI agent's actions away from the normal or approved pattern for that workflow. It can happen even when each step looks valid on its own, which is why sequence-level monitoring matters more than single-step review.
  • Runtime Behaviour Control: Runtime behaviour control is the practice of governing what an AI agent actually does while it is executing, including tool calls, record changes, and workflow triggers. It complements content inspection by focusing on action, timing, and side effects rather than text alone.

Deepen your knowledge

Agentic AI runtime behaviour control is a core topic in our NHI Foundation Level course, the industry's only accredited NHI security programme. If you are building governance for production agents, it is worth exploring.

This post draws on content published by Lasso Security: Why Agentic AI Needs Intent Security. Read the original.

NHIMG Editorial Note
Published by the NHIMG editorial team on 2026-04-08.
NHI Mgmt Group — the independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org