Agent identity is replacing bot detection in agentic AI security

By NHI Mgmt Group Editorial TeamPublished 2026-05-13Domain: Agentic AI & NHIsSource: Arkose Labs

TL;DR: Agentic AI is collapsing the old bot-versus-human model because legitimate agents can book, negotiate, and file tickets while adapting in real time, according to Arkose Labs’ interview with Paul Rockwell. Access review processes assume access persists long enough to be reviewed; autonomous traffic can create, combine, and discard permissions within a single session.

At a glance

What this is: This is an Arkose Labs interview arguing that agentic AI makes bot detection alone insufficient because security teams now need an identity layer for agents and explicit authorization boundaries.

Why it matters: It matters because IAM, fraud, and trust and safety teams must govern machine actions, not just human logins, across NHI, autonomous, and human identity programmes.

👉 Read Arkose Labs' interview on agent identity and reasoning attacks

Context

Agentic AI changes the control problem from spotting automated traffic to governing who or what is authorised to act. In practice, that means security teams need to distinguish legitimate agent behaviour from abuse, define scope for machine actions, and preserve auditability when an agent acts on behalf of a person.

The article frames a shift that many IAM and trust and safety programmes are not yet built for: identity becomes the boundary for machine action, not just a record of user authentication. That creates direct pressure on NHI governance, agent authorisation, and accountability models that were designed for human-paced interactions.

Key questions

Q: How should security teams govern AI agents that act on behalf of users?

A: Security teams should govern AI agents as identities with explicit owners, scopes, and audit trails rather than as generic automation. The practical test is whether the agent is authorised to take a specific action, on a specific dataset or workflow, for a specific purpose. If that cannot be answered, the organisation does not yet have a usable control model for agentic activity.

Q: Why do agentic AI systems weaken traditional bot detection?

A: Agentic AI weakens traditional bot detection because the harmful behaviour may be unique, adaptive, and spread across multiple steps instead of repeating a fixed script. A rules engine tuned to repeated patterns will miss attacks that change tactics in response to each error message or control. That is why authorisation and behaviour correlation matter more than simple automation flags.

Q: What breaks when fraud controls assume attacks are repetitive?

A: When fraud controls assume attacks are repetitive, they miss reasoning attacks that alter tactics mid-session and compose actions across platforms. The control failure is not lack of data, but lack of a model for adaptive sequencing. Organisations should treat cross-platform identity correlation as a core fraud requirement, not an optional enhancement.

Q: What is the difference between consumer bot detection and agent identity governance?

A: Consumer bot detection asks whether traffic is automated. Agent identity governance asks who authorised the agent, what it is allowed to do, and how its actions are recorded. The first is a traffic classification problem. The second is an identity and access problem that determines accountability, scope, and liability.

Technical breakdown

Agent identity layers for agentic AI traffic

Traditional bot detection assumes the key question is whether traffic is human or automated. For agentic AI, that binary is too coarse. The control surface becomes the agent identity itself: who authorised it, what permissions it has, and which actions are within scope. This is closer to NHI governance than consumer fraud because the problem is durable identity, delegated authority, and auditable action history. Without an identity layer, security tooling cannot reliably separate legitimate machine behaviour from abuse that looks operationally normal.

Practical implication: build an inventory of authorised agents and bind each one to explicit scope, owner, and audit records.

Reasoning attacks and adaptive multi-step abuse

A reasoning attack is different from a replay attack because the adversary does not repeat the same script. Instead, the agent adapts mid-session, tests different paths, and uses feedback from the target environment to refine the next move. That breaks rules engines tuned to repeated patterns and single-step fraud events. When the same session can span account creation, credibility building, data extraction, and redeployment, the abuse chain crosses products and time horizons in ways legacy detection models were not built to correlate.

Practical implication: correlate identity, session, and cross-platform telemetry so adaptive sequences do not disappear between separate controls.

Agent-to-agent interaction as an ungoverned delegation model

When one AI agent negotiates with another, neither side necessarily has a human approval gate in the moment of action. That creates a delegation problem, not just a detection problem. Existing trust and safety controls largely assume one side of the interaction is human and can be held accountable through conventional policy and review. Agent-to-agent exchanges weaken that assumption because authorization, liability, and intended scope all become distributed across systems that can act faster than review cycles.

Practical implication: define policy for machine-to-machine delegation before agents are allowed to transact with each other.

Moltbook AI agent keys breach — Moltbook breach exposed 1.5M AI agent keys.
AI LLM hijack breach — attackers used stolen AWS access keys to hijack Anthropic LLM models on Bedrock.

Read our 52 NHI Breaches Analysis report for a comprehensive view of breaches impacting Non-Human Identities including AI Agents.

NHI Mgmt Group analysis

Agentic AI does not just expand the NHI estate, it changes what identity means. When a machine can act on behalf of a person, the key governance question stops being whether traffic is automated and becomes whether the action is authorised, bounded, and attributable. That is a different control model from consumer bot detection, and it belongs in the same governance conversation as service accounts and workload identity. Practitioners should treat agent identity as a first-class identity domain, not a logging afterthought.

Reasoning attack: the article names a useful failure mode for the field. The old fraud assumption was that abuse would repeat predictably enough to pattern-match. That assumption fails when the actor adapts in-session, changes tactics midstream, and composes a multi-step path across platforms. The implication is that controls built around static signatures and isolated events are structurally mismatched to agentic abuse.

Agent-to-agent negotiation creates a delegation chain with no stable human checkpoint. Trust and safety models often rely on a human operator somewhere behind the action, even if indirectly. That assumption weakens when both sides are machine actors that can negotiate scope and timing without human intervention. Practitioners need to rethink attribution, liability, and policy enforcement across machine-to-machine interactions rather than treating them as a variant of user automation.

Intent becomes the missing control variable in agent governance. The article’s strongest governance point is not about detection precision, but about authorization semantics. If security tools cannot express what an agent is allowed to do, they cannot distinguish useful autonomy from abuse. The practical conclusion is that policy must move from traffic classification to intent and scope enforcement.

From our research:
96% of technology professionals identify AI agents as a growing security threat, and 66% believe this risk is immediate, according to AI Agents: The New Attack Surface report.
Only 52% of companies can track and audit the data their AI agents access, leaving 48% with a complete blind spot for compliance and breach investigation.
That governance gap is why the OWASP Agentic Applications Top 10 matters for practitioners building policy, audit, and containment around autonomous actions.

What this signals

The operational signal for IAM and trust and safety teams is clear: agent identity is becoming a control plane, not a feature. With 80% of organisations reporting agents acting beyond intended scope in the referenced research, the next governance failure is likely to be attribution, not mere detection. Teams that can bind authorisation, auditability, and scope to each agent will be better positioned to absorb the shift without treating every machine action as hostile.

Reasoning attack governance: once an adversary can adapt within a single session, the programme problem is no longer signature quality but policy expressiveness. Security teams should connect agent authorisation to purpose, tool scope, and transaction context, then validate that downstream review processes can actually consume those signals. For broader framework context, the OWASP Top 10 for Agentic Applications 2026 is a useful reference point for control design.

For practitioners

Define an agent identity registry Record every authorised AI agent, its business owner, approved tools, scope boundaries, and revocation path so machine action is attributable end to end.
Replace human-versus-bot checks with authorisation checks Evaluate whether each automated interaction is permitted, not merely whether it looks automated, and attach policy to the action scope rather than the traffic source.
Correlate cross-session behaviour Join identity, session, and platform telemetry so adaptive multi-step abuse can be detected across account creation, data access, and downstream use.
Set policy for agent-to-agent delegation Require explicit rules for machine-to-machine negotiation, including who can authorise it, what can be exchanged, and when the chain must be terminated.

Key takeaways

Agentic AI breaks the old bot-versus-human assumption by making authorised machine actions look operationally normal.
Adaptive, multi-step attacks are harder to catch than repetitive bot traffic because the tactic changes within a single session.
Security teams need an identity layer for agents, with scope, ownership, and auditability defined before deployment scales.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 address the attack and risk surface, while NIST AI RMF and NIST CSF 2.0 set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
OWASP Agentic AI Top 10		Covers agent identity, tool misuse, and adaptive abuse in agentic systems.
NIST AI RMF		Supports governance for AI systems that make or influence decisions.
NIST CSF 2.0	PR.AA-01	Identity and access controls must cover machine actors, not only humans.

Map agent authorization, scope, and telemetry to agentic AI controls before broad deployment.

Key terms

Agent Identity: Agent identity is the set of attributes, permissions, and accountability signals attached to an AI system that can act on behalf of a user or organisation. In practice, it must answer who authorised the agent, what it may do, and how its actions are traced across sessions and tools.
Reasoning Attack: A reasoning attack is an adaptive abuse pattern in which the attacker changes tactics based on the target’s responses instead of replaying a fixed script. This makes the attack harder to detect with signature-based controls and more dependent on identity, policy, and cross-session correlation.
Agent-To-Agent Delegation: Agent-to-agent delegation is a machine interaction pattern where one AI agent negotiates or transacts with another without a human in the immediate loop. That creates new governance requirements for scope, authorisation, and accountability because both sides can act dynamically.
Authorisation Scope: Authorisation scope is the boundary that defines what an identity may do, where it may do it, and under what conditions. For agentic systems, scope has to be explicit enough to govern tool use, data access, and downstream actions, not just login permissions.

Deepen your knowledge

Agent identity governance and authorised machine action are core topics in our NHI Foundation Level course, the industry's only accredited NHI security programme. If you are building controls for agentic AI in a similar starting point, it is worth exploring.

This post draws on content published by Arkose Labs: AI “It’s Not a Replay Attack. It’s a Reasoning Attack.” by Cassie Stevenson. Read the original.

NHIMG Editorial Note
Published by the NHIMG editorial team on 2026-05-13.
NHI Mgmt Group — the independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org