TL;DR: Agentic AI introduces memory, tool use, and execution paths that make stateless prompt inspection insufficient, while Lasso Security says its Intent Security framework evaluates behaviour across sequences of decisions, with risky actions blocked in under 50 milliseconds and detection accuracy improved by up to 80 percent. The core issue is that existing controls assume isolated interactions, but agentic systems create cumulative behavioural risk that must be governed over time.
At a glance
What this is: This is Lasso Security's analysis of why stateless AI controls break down for agentic systems and why behavioral baselining becomes the new security primitive.
Why it matters: IAM, NHI, and AI governance teams need a model that evaluates intent, scope, and trajectory across sessions, not just single prompts or outputs.
By the numbers:
- Risky actions can be blocked in under 50 milliseconds, with broader session-level behavioral analysis completing in under five seconds.
- Detection accuracy across AI threat categories improves by up to 80 percent compared to stateless inspection models.
- The multi-model evaluation engine operates up to 570 times faster than conventional sequential approaches, enabling continuous behavioral monitoring across users, agents, and tool chains.
👉 Read Lasso Security's analysis of intent security for agentic AI environments
Context
Agentic AI is a different security problem from chatbots and copilots because it can remember, plan, call tools, and change course across multiple steps. That means the relevant unit of control is no longer a single prompt or response, but the full decision sequence that links intent to execution.
The governance gap is straightforward: controls built for stateless inspection can miss cumulative drift, indirect steering, and delayed misuse. For IAM and NHI programmes, this pushes the conversation from output filtering toward runtime identity behaviour, scope control, and session-level accountability.
Key questions
Q: How should security teams govern agentic AI that can change behaviour mid-session?
A: Security teams should govern agentic AI with controls that evaluate the full decision chain, not only isolated prompts or outputs. The practical model is intent alignment plus behavioral baselining, so the system can detect when an agent stays coherent with the request but drifts from historical scope, authority, or normal action patterns over time.
Q: Why do stateless AI controls fail for agentic systems?
A: Stateless controls fail because agentic systems carry memory forward, chain tool calls, and adjust plans across multiple steps. A single interaction can look safe while the cumulative trajectory becomes risky. Security teams need to evaluate sequence, context, and deviation, not just the latest message or action.
Q: How can organisations tell whether an AI agent is drifting out of scope?
A: Look for divergence between the agent's observed actions and its historical baseline for mission, authority, and typical tool use. The strongest signals are incremental deviation, external steering through retrieved content or instructions, and action paths that remain individually valid but collectively move beyond approved purpose.
Q: What should teams do when agentic AI needs real-time enforcement?
A: Teams should build blocking, masking, and escalation into the execution path so risky behaviour is interrupted before operational impact occurs. That requires stateful monitoring across users, agents, tools, and applications, plus thresholds that are tuned to act at machine speed rather than after the session has ended.
Technical breakdown
Why stateless inspection fails for agentic AI
Stateless inspection assumes each interaction can be judged on its own. That works when a system only receives a prompt and returns an output, but agentic systems preserve memory, chain actions, and revise plans as new signals arrive. A harmless-looking step can become risky only when combined with earlier context, tool calls, or retrieved content. The failure mode is cumulative rather than immediate, which is why single-event classification misses the real trajectory. Security analysis has to follow the decision chain, not just the latest message.
Practical implication: teams need controls that evaluate sequences, not isolated events.
Intent alignment and behavioral baselines
Behavioral baselining combines two controls. Intent alignment checks whether the current request and action match the stated goal. Behavioral baselining then compares the resulting activity with historical patterns for the user, agent, or application. That distinction matters because a system can be aligned with a request yet still drift away from normal scope over time. In agentic environments, the security signal is not only what the model produced, but whether the action path remains coherent, expected, and authority-bound as execution unfolds.
Practical implication: define expected mission, scope, and authority before evaluating drift.
Stateful enforcement across the agentic lifecycle
Stateful enforcement tracks interactions across users, agents, tools, and applications so the system can intervene before operational impact. That includes identifying when a request deviates from historical behavior, when outside inputs reshape the trajectory, and when a threshold should trigger blocking, masking, or escalation. The architectural shift is from post-event inspection to continuous evaluation with enforcement attached to the execution path. For security teams, this is the difference between observing a problem and preventing it at machine speed.
Practical implication: integrate enforcement into the execution path, not a downstream review queue.
Breaches seen in the wild
- Moltbook AI agent keys breach — Moltbook breach exposed 1.5M AI agent keys.
- AI LLM hijack breach — attackers used stolen AWS access keys to hijack Anthropic LLM models on Bedrock.
Read our 52 NHI Breaches Analysis report for a comprehensive view of breaches impacting Non-Human Identities including AI Agents.
NHI Mgmt Group analysis
Stateless security is the wrong baseline for agentic AI. The control model that inspects prompts and outputs independently was designed for linear exchanges, not systems that remember, plan, and execute across multiple steps. Once the agent can carry context forward, risk accumulates across the full sequence rather than appearing at a single checkpoint. Practitioners should treat behavioural trajectory as the real security object, not the individual interaction.
Intent alignment is necessary, but it is not sufficient. A request can be coherent in the moment and still move into unsafe territory as the session evolves. That makes scope drift a first-class governance problem, especially where tools, APIs, or downstream agents are involved. The implication is that security teams need to separate immediate goal matching from historical behavioural variance when deciding whether an action is safe.
Behavioral baselining gives agentic AI a measurable identity boundary. Intent drift: the security model breaks when an agent's observed action path no longer matches the mission, authority, or historical pattern that defined it at the start of the session. That is not just a control gap, it is a broken assumption about how identity behaves over time. Practitioners need to rethink whether their programmes are governing outputs, or governing the continuity of purpose across execution.
Machine-speed enforcement is now part of identity governance for autonomous work. If an agent can act, adapt, and call tools in seconds, review-based governance cannot remain the primary control layer. This does not replace access governance, it changes where the decisive control point sits. Security teams should expect behavioural controls to sit alongside IAM and NHI policy rather than after the fact.
The market is moving from policy declarations to runtime evidence. The enterprise question is no longer whether agentic AI should be governed, but whether controls can prove alignment, scope, and escalation decisions as work happens. That puts behavioural telemetry, authority modelling, and continuous validation at the centre of the next governance wave. Practitioners should prepare for security programmes that measure trajectories, not just permissions.
From our research:
- Only 52% of companies can track and audit the data their AI agents access, leaving 48% with a complete blind spot for compliance and breach investigation, according to AI Agents: The New Attack Surface report.
- 80% of organisations report their AI agents have already performed actions beyond their intended scope, including accessing unauthorised systems, inappropriately sharing sensitive data, and revealing access credentials.
- For a deeper governance model, see OWASP Agentic Applications Top 10 for the runtime risks that behavioural baselining is trying to contain.
What this signals
Intent drift: the category is shifting from prompt safety to behavioural governance, and that will change how teams think about evidence, control design, and incident review. When only 52% of companies can audit what their AI agents access, per AI Agents: The New Attack Surface report, the blind spot is no longer theoretical.
Security architects should expect agentic monitoring to converge with identity telemetry rather than sit beside it. The practical question is whether a programme can explain not just what an agent did, but why its action path still matched its mission and authority at the moment it executed.
The relevant reference set is now broader than traditional AI guardrails. Behavioural enforcement should be read alongside the OWASP Agentic AI Top 10 and NIST AI Risk Management Framework, because runtime identity behaviour is becoming the real control surface.
For practitioners
- Map agentic workflows to decision chains Document where each agent retains memory, selects tools, and changes plan across a session. Use that map to identify where isolated-event inspection will miss cumulative drift.
- Separate intent checks from baseline checks Validate the current request against the stated goal first, then compare the resulting behaviour against historical patterns for the same user, agent, or application.
- Attach enforcement to the execution path Design blocking, masking, and escalation so they can intervene before a harmful action completes, rather than waiting for downstream review or alert triage.
- Review IAM and NHI assumptions for session stability Find policies that assume access, purpose, or authority stay stable long enough to be reviewed after the fact, then redesign them for behaviours that evolve mid-session.
Key takeaways
- Agentic AI breaks the assumption that security can safely inspect prompts and outputs in isolation.
- Behavioural baselining matters because cumulative drift, not a single bad action, is the dominant failure mode.
- Teams need enforcement that acts during execution, or governance will arrive after the risk has already materialised.
Standards & Framework Alignment
This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.
OWASP Agentic AI Top 10 address the attack and risk surface, while NIST AI RMF and NIST CSF 2.0 set the governance and control requirements practitioners need to meet.
| Framework | Control / Reference | Relevance |
|---|---|---|
| OWASP Agentic AI Top 10 | NHI-01 | Agentic systems that chain decisions and tools fit runtime identity abuse risk. |
| NIST AI RMF | Behavioral baselining needs AI governance around intent, monitoring, and accountability. | |
| NIST CSF 2.0 | PR.AC-4 | Access and authority boundaries need continuous verification as agents act over time. |
Map agentic workflows to OWASP agentic risks and add runtime checks for scope drift and tool misuse.
Key terms
- Behavioral Baseline: A behavioural baseline is the expected pattern of actions, scope, and decision paths for a user, agent, or application over time. It gives security teams a reference for spotting drift, misuse, or abnormal execution. In agentic AI, the baseline must reflect changing context and tool use, not just static permissions.
- Intent Alignment: Intent alignment is the control that checks whether a current action matches the stated goal before deeper behavioural analysis begins. It is a coherence test between request, execution, and purpose. For agentic systems, alignment is necessary but not sufficient because an agent can still drift beyond scope later in the same session.
- Stateless Security: Stateless security evaluates each event independently, without carrying meaningful history from one step to the next. That model works for narrow interactions, but it fails when systems remember, plan, or adapt. Agentic AI exposes the weakness because security must judge the full trajectory, not just the latest observable event.
- Scope Drift: Scope drift is the gradual expansion or shift of what an identity is doing compared with what it was authorised or expected to do. It often appears as small, individually valid steps that accumulate into an unsafe path. In agentic AI, drift can emerge within a single session and must be measured continuously.
Deepen your knowledge
Agentic AI behavioral baselining is covered in our NHI Foundation Level course, the industry's only accredited NHI security programme. If you are building governance for systems that plan and act across sessions, this is a relevant starting point.
This post draws on content published by Lasso Security: Introducing Intent Security, a behavioral baseline framework for agentic AI. Read the original.
Published by the NHIMG editorial team on 2026-05-05.
NHI Mgmt Group — the independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org