Five-signal ai agent security exposes the limits of single controls

By NHI Mgmt Group Editorial TeamPublished 2026-06-03Domain: Agentic AI & NHIsSource: Zenity

TL;DR: Single-signal security for agentic AI leaves predictable blind spots across identity, data, model behavior, posture, and environment, according to Zenity. A five-signal runtime view is now the baseline for deciding whether an agent’s activity was appropriate, not merely permitted.

At a glance

What this is: This analysis argues that agentic AI security fails when teams rely on any single signal, because runtime appropriateness depends on correlating identity, data, model behavior, posture, and environment.

Why it matters: IAM, NHI, and AI governance teams need correlated runtime controls because partial visibility can miss authorized but malicious agent activity that still produces material breach outcomes.

👉 Read Zenity's analysis of five-signal AI agent security and runtime response

Context

Single-signal controls are insufficient for agentic AI because an agent can look normal through one lens while behaving anomalously through another. In practice, that means identity telemetry, data access telemetry, model-layer monitoring, posture data, and environment context all matter at runtime if teams want to know whether an action was appropriate.

The identity governance problem is broader than prompt filtering or secret control alone. For AI agents, the relevant question is not just whether access was granted, but whether the access, timing, and downstream tool use still fit the task as it unfolded. That is the point where conventional IAM assumptions start to fail.

Key questions

Q: How should security teams govern AI agents that can act within authorised scope?

A: They should correlate identity, data, model behaviour, posture, and environment signals at runtime. An agent can remain fully authorised while still being manipulated into harmful actions, so permission alone is not enough. Governance needs to answer whether the activity was appropriate for the task, not just whether the credential was valid.

Q: Why do single-signal controls fail for agentic AI security?

A: They fail because each signal covers only one part of the decision chain. Identity shows who acted, data shows what was touched, model behaviour shows manipulation, posture shows configuration state, and environment shows context. Any one of them can look normal while the others reveal compromise, so isolated monitoring leaves structural blind spots.

Q: What breaks when AI agent monitoring stops at deployment posture?

A: Runtime attacks break through that model because a clean deployment does not guarantee a clean session. An agent can be correctly configured at scan time and later drift through prompt injection, tool misuse, or altered goals. Teams need continuous execution-time visibility to catch that change before material damage occurs.

Q: What is the difference between blocking an agent and mutating a step?

A: Blocking stops the workflow entirely, while step mutation rewrites one inappropriate action and lets the rest continue. That difference matters when the overall task is legitimate but one step is unsafe. Mature programs need both options because not every suspicious action requires a full kill switch.

Technical breakdown

Why identity-only monitoring misses agentic abuse

Identity monitoring tells you which credential a system used, whether the token was valid, and whether activity stayed inside the granted scope. For agentic AI, that is necessary but not sufficient. An attacker can manipulate the agent through prompt injection or compromised tool responses while the identity signal remains clean. The session still looks authorised, but the workflow can drift into harmful data access or tool invocation. Because the identity layer does not explain intent, it cannot distinguish legitimate task completion from an authorised compromise path.

Practical implication: Treat clean identity telemetry as one input, not a decision point, and correlate it with task-level behaviour before allowing the workflow to continue.

How data, posture, and model behaviour combine at runtime

Data signals show what the agent touched, model behaviour signals show whether reasoning or outputs were manipulated, and posture signals show whether the agent’s configuration changed from a known-good state. Each domain covers a different failure mode. Data alone cannot tell you who acted or why. Model monitoring alone can miss downstream tool misuse. Posture alone only proves the deployment state, not the runtime state. The architectural problem is that agentic AI attacks often span all three domains in sequence, so any one of them can appear normal while the others reveal compromise.

Practical implication: Build detections that join data access, behavioural drift, and posture change into one runtime verdict instead of triaging them separately.

Step mutation and the response spectrum beyond block or allow

Step mutation is the ability to intercept and rewrite a single inappropriate action inside a multi-step workflow while allowing the rest of the task to continue. It sits between simple logging and full workflow termination. That matters because many agentic incidents are not binary. A specific step may be unsafe, but the overall business process may still be legitimate if the risky action is replaced with a safe equivalent. This requires trajectory-aware monitoring, meaning the platform must understand the workflow state across steps rather than react to isolated events.

Practical implication: Adopt response options that include step-level intervention, but reserve hard stops for irreversible actions or confirmed exfiltration.

Moltbook AI agent keys breach — Moltbook breach exposed 1.5M AI agent keys.
AI LLM hijack breach — attackers used stolen AWS access keys to hijack Anthropic LLM models on Bedrock.

Read our 52 NHI Breaches Analysis report for a comprehensive view of breaches impacting Non-Human Identities including AI Agents.

NHI Mgmt Group analysis

Single-signal security is an architectural blind spot, not a tuning problem. The article shows that identity-only, data-only, model-only, posture-only, and environment-only monitoring each leave a different blind spot open. That means the failure is structural, not operational. A control that only sees one slice of the agent’s runtime state cannot answer whether an action was appropriate. Practitioners should treat partial coverage as incomplete by design, not as a lower-fidelity version of the same answer.

Agentic AI exposes a runtime governance gap across the identity stack. The most useful distinction here is not “AI versus non-AI” but whether the system can be evaluated only after it acts. Zenity’s five-signal framing shows that the decision surface is distributed across identity, data, model behaviour, posture, and environment. That aligns with OWASP NHI and NIST Zero Trust thinking, but it also pushes beyond static access control. The implication is that agent governance must be evaluated as a live control plane, not a pre-deployment checklist.

Five-signal coverage names the named concept of identity context collapse. Identity context collapse occurs when the credential remains valid but the surrounding signals that explain whether the action is legitimate no longer align. The article’s PleaseFix example illustrates this clearly: the token is clean, yet the workflow is no longer trustworthy. This is a failure of contextual assurance, not a failure of authentication alone. Practitioners should recognise that a valid session can still be the wrong session for the task.

Step mutation matters because binary controls force avoidable trade-offs. A block-or-allow model treats every suspicious step as if the only safe outcome is termination. That is too blunt for multi-step agent workflows where one action can be rewritten safely and the rest of the task can continue. The broader market signal is that agent security is moving toward trajectory-aware response, not just detection. Security teams should expect their governance model to distinguish between recoverable steps and irreversible ones.

Runtime agent governance is becoming the real boundary between acceptable and unacceptable automation. The article’s core claim is that risk cannot be judged from prompt content, secret inventory, or model output alone. It must be judged from assembled runtime signals that explain what the agent was doing, what it touched, and how its behaviour changed. That pushes AI governance closer to continuous identity verification and continuous authorisation than to traditional static review. Practitioners should plan for controls that decide in motion, not after the fact.

From our research:
79% of organisations have experienced secrets leaks, with 77% of these incidents resulting in tangible damage, according to Ultimate Guide to NHIs.
Only 5.7% of organisations have full visibility into their service accounts, which shows how often identity governance starts from partial inventory.
The 52 NHI Breaches Analysis is the right next step for practitioners who need real incident patterns, root causes, and control failures.

What this signals

Identity context collapse: agentic AI creates a condition where a valid identity session is no longer enough to establish legitimate use. Security teams should expect the governance question to shift from “was access allowed?” to “was the full runtime context still appropriate?” That is a deeper operational standard than static entitlement review.

With only 5.7% of organisations having full visibility into their service accounts, identity programmes already struggle to see non-human actors clearly. Agentic AI raises the bar again because visibility must now extend across tool use, behavioural drift, data access, and environment context in the same decision cycle. Teams that cannot assemble those signals will struggle to prove containment or explain intent.

Practitioners should treat this as a control design warning, not just a detection story. The next phase of AI governance will favour continuous verification, step-aware intervention, and tighter correlation with NHI controls such as the Ultimate Guide to NHIs and the NIST AI Risk Management Framework.

For practitioners

Correlate runtime signals before you trust an agent Join identity, data, model behaviour, posture, and environment telemetry into a single workflow view so analysts can judge appropriateness, not just permission.
Separate deployment safety from runtime safety Use posture checks to confirm baseline configuration, but require continuous runtime monitoring because a clean scan cannot rule out mid-session manipulation.
Add step-level response options Define when a suspicious action should be rewritten, when it should be blocked, and when the entire workflow must stop, rather than relying on one binary control.
Review agent workflows for trajectory-aware control gaps Test whether your current controls can see session drift, tool misuse, and gradual exfiltration that remain inside per-step thresholds but fail at the workflow level.

Key takeaways

Single-signal monitoring for agentic AI leaves exploitable blind spots because runtime appropriateness depends on correlated identity, data, behaviour, posture, and environment signals.
The operational evidence is not that one control failed, but that partial coverage can miss authorised compromise paths that still produce material breach outcomes.
Security teams should move toward trajectory-aware response models that can rewrite one bad step, block irreversible actions, and preserve legitimate workflows where possible.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and OWASP Non-Human Identity Top 10 address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
OWASP Agentic AI Top 10		Agentic AI risk here is runtime manipulation and tool misuse across signals.
OWASP Non-Human Identity Top 10	NHI-03	The article centers on identity scope, runtime access, and NHI governance.
NIST AI RMF		Five-signal governance aligns with AI risk governance and runtime oversight.

Review NHI scope continuously and detect when valid credentials are used outside intended task boundaries.

Key terms

Five-signal correlation: A runtime security method that combines identity, data, model behaviour, agent posture, and environment signals into one assessment. It is designed to answer whether agent activity was appropriate in context, not merely whether a request was authorised.
Step mutation: A response technique that rewrites one unsafe action inside a multi-step workflow while letting the remaining steps continue. It is useful when the overall task is legitimate but a specific action has become inappropriate or risky during execution.
Identity context collapse: The condition where an agent's identity remains valid but the surrounding context that proves legitimate use no longer aligns. The session still authenticates correctly, yet the combined runtime signals show the workflow has drifted from the intended task.

Deepen your knowledge

Agentic AI runtime governance, five-signal correlation, and step-level response are core topics in our NHI Foundation Level course, the industry's only accredited NHI security programme. If you are building controls for AI agents that act inside authorised scope, it is worth exploring.

This post draws on content published by Zenity: Five Signals, One Answer: Why Single-Signal AI Security Always Fails. Read the original.

NHIMG Editorial Note
Published by the NHIMG editorial team on 2026-06-03.
NHI Mgmt Group — the independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org