How should security teams classify agentic traffic at login without blocking legitimate users?

Why This Matters for Security Teams

agentic traffic at login is not just “automation” to be blocked or “users with tools” to be trusted. It is a trust decision about whether the session represents a legitimate workflow, a high-risk autonomous workload, or abuse trying to blend into normal sign-in patterns. That distinction matters because agents can chain tools, retry rapidly, and change behavior based on environment cues in ways human users do not. Guidance from the OWASP Agentic AI Top 10 and NHIMG’s OWASP NHI Top 10 both point to the same operational reality: identity signals alone are too coarse when the workload is autonomous.

The login layer is often where security teams either over-block legitimate AI-assisted journeys or under-classify malicious automation that is using a human-like front end. A better model treats population, intent, and workflow sensitivity as separate signals, then routes sessions into different control paths rather than a single allow or deny decision. In practice, many security teams encounter agent abuse only after a user journey has already been normalised by the application, rather than through intentional detection design.

How It Works in Practice

Classification should begin with what the client can prove and what the workflow can tolerate. Self-disclosing agents can present an explicit agent marker, workload identity, or application token, while non-disclosing agents may only reveal themselves through behavioural patterns such as fast retries, parallelised actions, or unusual login-to-action ratios. Malicious automation often tries to mimic the human login sequence, so the goal is not perfect detection at the front door, but risk-aware routing.

At implementation time, security teams typically combine several layers:

Workload identity for the agent itself, using cryptographic proof of what the agent is, not just a password or session cookie.

Intent-based authorisation, where access is evaluated at request time against the action being attempted, not only against a static role.

Just-in-time credential issuance with short TTLs, so an agent receives only the secrets needed for one task and they expire quickly.

Policy-as-code for real-time decisions, using context such as device posture, population risk, and workflow sensitivity.

This approach aligns with the intent behind the NIST AI Risk Management Framework and the CSA MAESTRO agentic AI threat modeling framework, which both emphasise contextual risk decisions rather than one-time trust at authentication. NHIMG research on AI LLM hijack breach and LLMjacking shows why static credentials and broad login trust are dangerous when attackers can move from exposed secrets to active abuse within minutes. These controls tend to break down in consumer-facing systems with high-friction sign-up flows because legitimate humans, browser automation, and adversarial bots often look identical at the login boundary.

Common Variations and Edge Cases

Tighter classification often increases false positives and support overhead, requiring organisations to balance fraud resistance against user experience and operational latency. That tradeoff is especially visible in environments with embedded copilots, browser extensions, RPA tooling, or delegated authentication where a human and an agent share the same journey.

There is no universal standard for this yet, so current guidance suggests using graduated responses instead of binary blocks. For example, a self-disclosing agent might receive a lower-friction path with narrow scope, while an unknown automation pattern could be challenged, rate-limited, or forced into a higher-assurance step before it reaches sensitive actions. This is also where logging matters: classification must preserve enough evidence to distinguish legitimate experimentation from abuse, without exposing secrets in the process. The State of Non-Human Identity Security highlights how visibility gaps and weak monitoring still drive NHI incidents, which means classification without telemetry is only partial control. In practice, edge cases cluster around shared IP ranges, proxy-heavy enterprise networks, and workflows where the agent is acting on behalf of many users, because the signal becomes ambiguous exactly where business value is highest.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
OWASP Agentic AI Top 10	A1	Login classification must account for agentic misuse and deceptive automation.
CSA MAESTRO	TRT-01	MAESTRO focuses on threat modeling autonomous workflows and agent trust boundaries.
NIST AI RMF		AI RMF governance supports context-aware decisions for autonomous traffic.

Model login flows as workflow trust zones and assign controls by agent intent and sensitivity.

#1 Authority in NHI Education, Research and Advisory, empowering organizations to tackle the critical risks posed by Non-Human Identities (NHIs), including AI Agents.

How should security teams classify agentic traffic at login without blocking legitimate users?

Why This Matters for Security Teams

How It Works in Practice

Common Variations and Edge Cases

Standards & Framework Alignment

Related resources from NHI Mgmt Group