Agentic runtime security exposes the gap between prompts and actions

By NHI Mgmt Group Editorial TeamPublished 2026-03-23Domain: AnnouncementsSource: HiddenLayer

TL;DR: One in eight AI breaches are linked to agentic systems, while most controls still stop at prompts, policies, or static permissions, leaving execution-time behaviour largely unobserved, according to HiddenLayer. The real problem is that autonomous agents can create impact inside the same session that traditional review and monitoring models assume is still ongoing.

At a glance

What this is: HiddenLayer’s update focuses on runtime security for autonomous AI execution, with a central finding that prompt-level and static-permission controls leave execution-time agent behaviour insufficiently governed.

Why it matters: IAM teams need to treat autonomous agents as runtime identities because behaviour, tool use, and action timing can change the blast radius faster than traditional access review, PAM, or policy controls can respond.

By the numbers:

One in eight AI breaches are linked to agentic systems, according to HiddenLayer’s 2026 AI Threat Landscape Report.
92% agree governing AI agents is critical to enterprise security, yet only 44% have implemented any policies to do so.

👉 Read HiddenLayer's update on agentic runtime security for autonomous AI execution

Context

Agentic runtime security is the control layer that watches what an AI agent does while it is making decisions, calling tools, and moving through a workflow. HiddenLayer’s announcement lands in a gap that many IAM programmes still leave open: prompt filtering and static permissions do not tell you whether an agent is about to exfiltrate data, chain tools, or execute an unsafe action at runtime.

For identity teams, the issue is not whether AI agents are useful. The issue is whether access governance can keep up when the identity subject is a software actor that acts at machine speed, changes context mid-session, and can expand impact before a human review cycle even starts.

Key questions

Q: How should security teams govern autonomous AI agents at runtime?

A: Treat the agent as a runtime identity whose behaviour must be observed while it acts, not just before it starts. Focus on session traces, tool calls, data movement, and action-level enforcement so unsafe behaviour can be detected and stopped inside the workflow rather than after the damage is done.

Q: Why do static permissions fail for autonomous AI execution?

A: Static permissions define what an agent can access in theory, but they do not control how the agent combines access, tools, and timing during execution. Autonomous behaviour creates risk through sequences, so governance must account for runtime context rather than only the entitlement set assigned at provisioning time.

Q: What breaks when AI agent behaviour is only monitored at the prompt layer?

A: Prompt-layer monitoring misses the point where risk becomes real, which is tool use, data movement, and chained actions inside the session. If the agent can move from request to execution without fresh control checks, the organisation sees intent but not impact until it is too late.

Q: How can organisations tell whether AI agent controls are actually working?

A: Look for evidence that the organisation can reconstruct sessions, explain why a tool was called, and show where unsafe actions were blocked or redacted. If investigations depend on guesswork or final output alone, the control set is not governing runtime behaviour effectively.

How it works in practice

Why prompt-level controls do not govern agent execution

Prompt filters and policy rules sit upstream of execution, so they can shape requests but not reliably control what an agent does once it starts using tools. Runtime risk appears when the agent combines data, calls external systems, and progresses through multi-step workflows without a fresh authorization check at each meaningful decision point. That is why static permissions are not enough for autonomous execution. They define what should be possible in principle, not what is safe in context. Practical implication: teams need visibility into session-level behaviour, not just prompt content or pre-approved tool lists.

Practical implication: move from prompt inspection to session-level monitoring of tool use, data access, and action sequencing.

Agentic runtime visibility and investigation at machine speed

Runtime visibility means reconstructing an agent session after the fact and, where possible, in near real time. That includes the tools called, the data touched, the branches taken, and the outputs produced. Investigation is different from simple logging because it lets analysts pivot across sessions and identify unusual patterns, such as repeated tool chaining or unexpected data movement. In agentic environments, this matters because the harmful action may be a sequence rather than a single event. Practical implication: establish evidence capture that preserves execution paths, not only authentication events or final outputs.

Practical implication: retain session traces that support forensic review of every tool call and branch taken.

Adaptive enforcement must act on context, not just policy

Agentic enforcement is the decision to allow, redact, or block an action based on the runtime context around it. That context can include the data classification involved, the current workflow step, the tool being called, and the confidence that the action matches expected behaviour. This is the difference between governance on paper and governance in execution. For autonomous systems, context-sensitive enforcement is what stops a benign-looking workflow from turning into credential exposure or unauthorised data movement. Practical implication: enforce at the point of action, where the risk is actually created.

Practical implication: build context-aware controls that can stop unsafe actions before the agent completes them.

NHI Mgmt Group analysis

Execution-time governance is the control boundary that now matters most for AI agents. HiddenLayer’s update shows that prompt-level controls and static permissions only shape the front door of agentic systems. They do not govern what happens once an autonomous actor starts chaining tools, switching contexts, and acting at machine speed. The practitioner implication is clear: the security boundary moves from authorisation intent to runtime behaviour.

Agentic runtime visibility is becoming the minimum evidence standard for autonomous identity. If an organisation cannot reconstruct what an agent accessed, which tools it used, and how it moved from one action to the next, it cannot credibly investigate or contain agentic risk. That is not a tooling preference, it is a governance failure in auditability. Practitioners should treat runtime traces as identity evidence, not optional telemetry.

Autonomous AI collapses the assumption that access persists long enough to be reviewed. Access review was designed for conditions where privilege remains stable between grant, use, and certification. That assumption fails when the actor can acquire, combine, and discard permissions inside a single runtime session. The implication is not simply more review, but a rethink of what review can meaningfully observe when the identity executes faster than the governance cycle.

Adaptive enforcement is now part of identity containment, not just AI safety. HiddenLayer’s focus on blocking unsafe actions, redacting sensitive data, and constraining execution reflects a broader shift in how autonomous actors must be governed. This is where NHI controls, zero trust assumptions, and AI risk governance converge around one question: can the organisation stop a software identity before the action completes? Practitioners need controls that operate at the point of execution, not after the fact.

Agentic AI is turning NHI governance into a runtime discipline. Once an AI system can choose actions and timing independently, it stops behaving like a static workload and starts behaving like an identity with operational discretion. That changes how blast radius, accountability, and containment should be modelled. Security teams should re-evaluate whether their current NHI programme is built to observe behaviour, or merely to grant access.

From our research:
92% agree governing AI agents is critical to enterprise security, yet only 44% have implemented any policies to do so, according to AI Agents: The New Attack Surface report.
Only 52% of companies can track and audit the data their AI agents access, leaving 48% with a complete blind spot for compliance and breach investigation.
That gap is why OWASP Agentic Applications Top 10 belongs in the control conversation whenever runtime behaviour, tool misuse, or scoped execution is in play.

What this signals

Agentic runtime governance is moving from optional enhancement to baseline control. The early pattern is clear: organisations are adopting AI agents faster than they are building evidence, containment, and accountability for what those agents do after authentication. The programme implication is that IAM, PAM, and AI security teams need a shared operating model for runtime identities, not separate oversight layers.

Runtime traces will become the audit artefact that matters most for autonomous systems. If the organisation cannot explain an agent’s execution path, it cannot defend its decisions to compliance, legal, or incident response teams. That is why session reconstruction and action-level enforcement should be designed into the governance model now, before agent deployments scale further.

Agentic AI widens the gap between access grant and observable control. With 98% of companies planning more AI agents in the next 12 months, according to AI Agents: The New Attack Surface report, security teams need to prepare for a larger population of identities that act faster than review cycles can see.

For practitioners

Instrument agent sessions end to end Capture tool calls, data access, branching decisions, and outputs for each autonomous session so investigators can reconstruct behaviour after an incident or policy violation.
Move enforcement to the point of action Block or redact unsafe actions when the agent attempts them, rather than relying on prompt filtering or static allow lists that cannot react to runtime context.
Define agent-specific containment triggers Set escalation conditions for anomalous chaining, unexpected system access, and repeated data movement so containment can begin before the workflow completes.
Review identity evidence by workflow, not only by account Treat each agent workflow as an auditable execution path with its own evidence trail, because a single account can produce multiple distinct risk states in one session.

Key takeaways

Autonomous AI agents create runtime identity risk because they can chain tools and actions faster than static controls can react.
HiddenLayer’s own analysis says one in eight AI breaches are already linked to agentic systems, which makes runtime visibility a current control need, not a future one.
Practitioners should move from prompt-only oversight to session reconstruction, context-aware enforcement, and audit-ready execution traces.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 address the attack and risk surface, while NIST AI RMF and NIST Zero Trust (SP 800-207) set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
OWASP Agentic AI Top 10	NHI-01	Agentic execution and tool misuse are central risks in this update.
NIST AI RMF		The article focuses on governance and runtime accountability for AI behaviour.
NIST Zero Trust (SP 800-207)	PR.AC-4	Runtime enforcement and least privilege are core zero trust concerns for agents.

Map agent sessions, tool calls, and enforcement points to agentic risk controls before production rollout.

Key terms

Agentic runtime security: Agentic runtime security is the control layer that observes and constrains an AI agent while it is making decisions and taking actions. It focuses on execution-time behaviour, including tool calls, data access, and action sequencing, rather than only on prompts or static permissions.
Runtime visibility: Runtime visibility is the ability to reconstruct what an agent did during a live session, including which tools it used, what data it touched, and how it moved through the workflow. It turns agent behaviour into auditable evidence for security and compliance teams.
Action-level enforcement: Action-level enforcement is the practice of allowing, redacting, or blocking a specific agent action at the moment it is attempted. It differs from policy at the prompt layer because it evaluates context, timing, and risk before the action is completed.
Autonomous identity: An autonomous identity is a software actor that can choose its action sequence, select tools, and decide when to act without human approval gates. In governance terms, it behaves like an identity with runtime discretion, so review and containment must operate on execution rather than on entitlement alone.

Deepen your knowledge

Agentic runtime visibility and enforcement are covered in the NHI Foundation Level course, the industry's only accredited NHI security programme. If you are building controls for autonomous AI execution, it is a useful place to ground the governance model.

This post draws on content published by HiddenLayer: Agentic runtime security capabilities for autonomous AI execution. Read the original.

NHIMG Editorial Note
Published by the NHIMG editorial team on 2026-03-23.
NHI Mgmt Group — the independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org