Subscribe to the Non-Human & AI Identity Journal

How should security teams detect malicious AI agents in commerce flows?

Security teams should combine persistent device intelligence, pre-login risk signals, and action-level behaviour monitoring. A malicious AI agent often looks legitimate in isolation, so the key is to score the full journey, including repeated environment reuse, velocity, and unusual sequencing, rather than relying on one control signal alone.

Why This Matters for Security Teams

Malicious AI agents are difficult to spot because they can borrow valid credentials, imitate normal commerce traffic, and complete low-friction actions that look benign until they are chained together. Security teams cannot rely on a single login event or a one-time device check. The real risk is that an agent can reuse the same environment across many sessions, test boundaries quickly, and pivot faster than a human operator.

That is why current guidance favors layered detection: persistent device intelligence, pre-login risk scoring, and action-level monitoring that looks for velocity, sequencing, and reuse patterns. This is consistent with the direction of OWASP Agentic AI Top 10 and the broader risk framing in the NIST AI Risk Management Framework, both of which emphasize that autonomous systems must be governed at runtime, not just at enrollment.

NHI Management Group research also shows how hard this visibility gap can be in practice: The State of Non-Human Identity Security reports that only 1.5 out of 10 organisations are highly confident in securing NHIs. In practice, many security teams encounter malicious agent behavior only after abnormal commerce activity has already succeeded, rather than through intentional pre-transaction detection.

How It Works in Practice

Commerce-flow detection works best when the security stack treats the agent as a continuing actor, not a single authenticated session. The strongest signals come from combining what the agent is, what it is trying to do, and how that activity unfolds across time. That means binding telemetry from device posture, identity assurance, session context, and transaction behavior into one scoring model.

A practical sequence looks like this:

  • Establish persistent device intelligence so the same runtime, emulator, or browser environment can be recognized across multiple sessions.
  • Apply pre-login risk checks to detect impossible travel, unusual client fingerprints, rotating infrastructure, or recycled environments before the agent reaches commerce endpoints.
  • Monitor action-level behavior for fast retries, unusual product-view-to-checkout ratios, coupon abuse, cart stuffing, account takeover patterns, and abnormal API sequencing.
  • Correlate those events with workload identity and token provenance, so detection can distinguish a legitimate automation workflow from a suspicious one.

This approach aligns with NIST Cybersecurity Framework 2.0 and the agentic control logic discussed in OWASP NHI Top 10, because both imply continuous evaluation rather than static approval.

For high-confidence response, teams should separate detection from enforcement. A transaction may be challenged, rate-limited, step-up authenticated, or blocked depending on the confidence score and business criticality. That is especially important where agents can chain tools, scrape catalog data, test checkout paths, and adapt their behavior after each friction point. These controls tend to break down in headless, API-only commerce environments because the same automation patterns used by legitimate integrators can also be used by malicious agents.

Common Variations and Edge Cases

Tighter detection usually increases friction for legitimate automation, so organisations have to balance fraud resistance against conversion, partner experience, and support overhead. There is no universal standard for this yet, especially in commerce environments where bots, RPA, customer service workflows, and AI agents can overlap.

One common edge case is the trusted agent that behaves maliciously only after compromise. In that scenario, a clean login is not enough, because the risk emerges later when the agent starts reusing sessions, escalating actions, or calling tools outside its usual pattern. Another edge case is delegated commerce automation, where a partner system has broad but legitimate access. Here, best practice is evolving toward intent-aware authorisation and short-lived credentials rather than broad standing privileges.

Teams should also be cautious about overfitting to one fraud signature. A malicious AI agent may not look like a botnet, and a well-tuned commercial integration may look suspicious if only one signal is used. Research from Top 10 NHI Issues and the CSA MAESTRO agentic AI threat modeling framework supports the same operational lesson: detection should be contextual, not binary, because the same technical pattern can be legitimate automation in one flow and an attack in another.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework Control / Reference Relevance
OWASP Agentic AI Top 10 A2 Agent abuse patterns in commerce align with runtime misuse controls.
CSA MAESTRO TM-02 MAESTRO focuses on threat modeling agent behavior and tool abuse.
NIST AI RMF AI RMF supports ongoing risk monitoring for autonomous system behavior.

Use AI RMF governance to define monitoring, escalation, and accountability for agentic fraud.