OpenClaw exposes why autonomous agent governance is breaking IAM

By NHI Mgmt Group Editorial TeamPublished 2026-02-10Domain: Breaches & IncidentsSource: EnforceAuth

TL;DR: OpenClaw rapidly reached 150,000 GitHub stars and 300,000 to 400,000 users in under two weeks, while researchers found 341 malicious skills, CVE-2026-25253 with CVSS 8.8, and 42,665 exposed instances, according to EnforceAuth. The real failure is not access control alone but the assumption that authenticated agents can be safely governed after they begin acting independently.

At a glance

What this is: OpenClaw is a fully autonomous AI agent whose rapid adoption exposed a structural authorization gap, with malicious skills, exposed instances, and runtime abuse showing why traditional IAM assumptions fail.

Why it matters: It matters because enterprises are now governing actors that can browse, call APIs, run shell commands, and persist across sessions, which changes how NHI, autonomous, and human identity programmes define trust, review, and accountability.

By the numbers:

Researchers discovered 341 malicious skills designed to steal credentials.
14% single-day jump in Cloudflare's stock., p in Cloudflare's stock.

👉 Read EnforceAuth's full OpenClaw security analysis

Context

OpenClaw is an autonomous AI agent, not a chatbot. It can select tools, execute shell commands, browse the web, send messages, and retain memory across sessions, which means its identity behaviour is defined by runtime action rather than login alone. That breaks the assumption that access can be reviewed after the fact, because the actor is already deciding and acting on its own.

For IAM and NHI teams, the immediate issue is not only secret exposure or prompt injection. The deeper problem is authorization at the moment of action, especially when an agent can inherit credentials, process untrusted inputs, and chain external communications without a human approval gate. This is typical of what happens when agentic systems move from experimentation into production without a governance model built for autonomy.

The article also shows how ecosystem scale amplifies the problem. A local agent, a skill marketplace, persistent memory, and cross-platform messaging create a wider trust boundary than conventional enterprise automation ever had, which is why autonomous AI security now belongs in identity governance rather than a separate AI-only track.

Key questions

Q: What breaks when autonomous AI agents are governed like normal user accounts?

A: Identity controls built for human sessions assume access is granted, used, and then reviewed later. Autonomous agents break that model because they can chain tools, change state, and act before a reviewer intervenes. The result is an authorization gap where login is visible but runtime behaviour is not. Governance must move to per-action control, not just access assignment.

Q: Why do autonomous agents complicate least privilege in enterprise IAM?

A: Least privilege is easy to define when the actor's intent is stable at provisioning time. Autonomous agents decide at runtime which tools to use and which path to take, so privilege needs can shift inside a single session. That makes static entitlements too blunt for safe governance. Teams need contextual authorization that evaluates the action, not only the identity.

Q: How do memory and persistent state change AI agent security risk?

A: Persistent memory lets an agent carry instructions, preferences, and learned context across sessions, which means harmful input can survive logout. This turns a one-time compromise into a durable governance problem because later decisions may inherit poisoned state. Security teams should treat memory stores as part of the agent's trust boundary, not as passive logs.

Q: Who is accountable when an autonomous agent sends data or changes systems on its own?

A: Accountability sits with the organisation that granted the agent's access and allowed its operating conditions. If a human sponsor, platform owner, or security team did not define approval boundaries, they still own the outcome. The practical test is whether the delegation chain was explicit, revocable, and monitored at the point of action.

Technical breakdown

Autonomous agents turn login into only the first decision

Traditional IAM assumes identity is validated before the meaningful work begins. With an autonomous agent, authentication is only the starting point because the agent then decides which tool to use, which data to read, and when to act. That means the real security boundary is the runtime decision, not the session start. OpenClaw’s ability to operate through messaging platforms, shells, APIs, and memory makes that boundary continuous, not discrete. In practice, every downstream action becomes an authorization event that static role assignment cannot describe well.

Practical implication: move from session-level access checks to per-action authorization for autonomous workloads.

Persistent memory creates a new identity state surface

OpenClaw’s SOUL.md, MEMORY.md, and SQLite-based history show that agent identity is not just credentials and tokens. Persistent memory becomes part of the control plane because it can store instructions, boundary definitions, and learned behaviour across sessions. That creates a stateful attack surface where poisoning one session can alter later actions without re-authentication. Conventional secrets management can reduce exposure, but it does not govern how memory is written, mutated, or re-used as part of decision-making.

Practical implication: treat agent memory as governed state and enforce integrity controls on every write path.

Malicious skills and untrusted content are a tool-chain problem

OpenClaw’s skill marketplace and embedded content ingestion show how autonomous agents can absorb hostile instructions through third-party skills, posts, emails, and web pages. This is not only prompt injection. It is a tool-chain issue in which untrusted content becomes actionable because the agent can connect it to tools and external communication channels. The result is a lethal combination of private data access, untrusted input, outbound communication, and memory persistence. That combination creates a runtime governance problem that perimeter controls were never intended to solve.

Practical implication: validate third-party skills and content sources before they can influence tool execution or external communication.

Threat narrative

Attacker objective: The objective is to turn an autonomous agent's trusted runtime access into persistent control, credential theft, and silent exfiltration across connected enterprise systems.

Entry began when attackers and malicious authors exploited public exposure, compromised skills, or crafted links that reached the agent gateway and its connected services.
Credential access and abuse followed through stolen gateway tokens, exposed API keys, OAuth tokens, and malicious skills that could exfiltrate data or alter configuration.
Escalation occurred as the agent executed shell commands, API calls, and persistent re-injection logic, turning a local assistant into a durable command-and-control path.
Impact was broad credential theft, remote code execution, data exfiltration, and agent-to-agent abuse across messaging, email, and cloud-connected workflows.

Moltbook AI agent keys breach — Moltbook breach exposed 1.5M AI agent keys.
AI LLM hijack breach — attackers used stolen AWS access keys to hijack Anthropic LLM models on Bedrock.

Read our 52 NHI Breaches Analysis report for a comprehensive view of breaches impacting Non-Human Identities including AI Agents.

NHI Mgmt Group analysis

Authorization based on a stable login session was designed for actors whose intent does not change mid-execution. That assumption fails when the actor is autonomous because the agent selects tools, combines inputs, and executes actions after authentication without a human approval gate. The implication is that identity governance must stop treating login as the primary security event for agentic systems. OpenClaw makes that failure visible because the threat emerges inside the runtime, not at the door. This is the core assumption collapse behind autonomous identity governance.

Persistent memory is not an add-on feature, it is a governance boundary. When an autonomous agent carries state across sessions, identity, policy, and behaviour no longer reset cleanly at logout. That means the control question changes from who has access to what, to what state can the agent carry forward into the next decision cycle. Practitioners should recognise this as memory-driven identity persistence, not simple data retention.

Tool chaining is the real blast-radius multiplier. OpenClaw can read data, act on it, and communicate externally in the same reasoning loop, which means compromise of one input channel can cascade into outbound harm. This is exactly where OWASP-NHI and OWASP Agentic AI concerns intersect: the same actor can become both the consumer and the executor of trust. The field now needs policy that governs action combinations, not isolated permissions.

The authorization gap is a category-level problem, not a deployment mistake. IAM programs built around authenticate, assign, and review were sufficient when action followed identity in a predictable sequence. Autonomous agents break that sequence by making multiple decisions between human checkpoints. The practitioner conclusion is that governance models must account for independent action timing, not only role design.

OpenClaw shows why autonomous agent security belongs in the same governance conversation as NHI lifecycle management. The same offboarding, review, and least-privilege disciplines still matter, but they are no longer enough when the actor can alter its own operating context between reviews. That is where agentic AI governance becomes an extension of identity governance, not a separate discipline.

From our research:
33% of organisations report their AI agents have accessed inappropriate or sensitive data beyond their intended scope, according to AI Agents: The New Attack Surface.
Only 44% of organisations have implemented any policies to govern AI agents, even though 92% say governance is critical to enterprise security.
For a broader framework lens, see OWASP NHI Top 10 for the runtime risks that policy must cover.

What this signals

Authorization gaps are now the default failure mode for agentic systems. The practical lesson for security teams is that access review cadences and token inventory alone will not keep up with actors that make multiple decisions between human checkpoints. The programme signal is to re-centre governance on runtime decisions, not just on identity proofing or secret storage.

Persistent state turns AI agent governance into a lifecycle problem. Once an agent can store memory, learn across sessions, and re-enter workflows later, offboarding is no longer a simple token revocation exercise. Teams should align agent lifecycle controls with the NHI Lifecycle Management Guide and think in terms of state reset, not just account closure.

With 98% of companies planning to deploy even more AI agents within the next 12 months, the gap between adoption and governance is widening faster than most identity programmes can absorb. The signal for practitioners is to formalise ownership, approval boundaries, and auditability before more agents inherit production access.

For practitioners

Inventory autonomous agent runtimes and their connected surfaces Map every agent that can browse, call APIs, send messages, run shell commands, or retain memory across sessions. Include messaging platforms, local gateways, skill marketplaces, and any account the agent inherits from a human sponsor.
Separate approval boundaries from execution boundaries Require a distinct policy check for each high-risk action instead of relying on a single login or session grant. Focus especially on outbound communication, file writes, configuration changes, and tool calls that cross system boundaries.
Treat persistent memory as governed state Apply integrity controls to memory files and history stores, and review who can write boundary instructions, task context, and persistent preferences. If the memory layer is mutable, it needs lifecycle controls just like credentials.
Constrain third-party skills before runtime use Review marketplace skills, custom tools, and external content feeds before they influence agent behaviour. Block skills that can silently exfiltrate data, fetch remote instructions, or modify agent policies without explicit authorisation.
Add agent-specific offboarding and reset checks When a use case ends, revoke the inherited access, clear durable state, and confirm no scheduled re-injection path remains. Offboarding must include the memory layer, not just the account or token.

Key takeaways

OpenClaw demonstrates that autonomous agents fail differently from ordinary software because they make and chain decisions after authentication.
The exposure numbers are already material, with 341 malicious skills, 42,665 exposed instances, and a CVSS 8.8 gateway exploit showing the scale of the problem.
The control that matters most is runtime authorization of each action, because static identity review cannot govern an actor whose state and tools change mid-session.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and OWASP Non-Human Identity Top 10 address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
OWASP Agentic AI Top 10		OpenClaw maps directly to agentic AI risks like tool misuse and memory poisoning.
OWASP Non-Human Identity Top 10	NHI-03	Persistent secrets and exposed gateway tokens make NHI rotation and exposure controls central.
NIST AI RMF		Autonomous agent governance needs organisational accountability and continuous risk treatment.

Inventory and rotate agent credentials, then reduce standing exposure paths for each runtime.

Key terms

Autonomous AI Agent: A software entity that can decide what action to take, which tool to use, and when to act without a human approval gate for each step. In security terms, it behaves like a runtime identity with decision authority, not just an automated workflow.
Authorization Gap: The gap between proving that an identity is authenticated and governing what that identity is allowed to do at the moment of action. For autonomous systems, this gap widens because decisions happen after login and can chain across tools, systems, and sessions.
Persistent Memory: Stored state that an AI agent carries across sessions, such as instructions, preferences, history, or learned context. It matters because state can be poisoned, reused, or modified, turning memory into a control surface rather than a passive archive.
Decision-Centric Authorization: An authorization model that evaluates each action as a separate governed event using context, policy, and resource sensitivity. It is designed for systems where identity alone is not enough to explain or constrain behaviour at runtime.

Deepen your knowledge

OpenClaw security, autonomous agent governance, and runtime authorization are core topics in our NHI Foundation Level course, the industry's only accredited NHI security programme. If you are building controls for agentic systems that can act across sessions, it is worth exploring.

This post draws on content published by EnforceAuth: OpenClaw security crisis and decision-centric authorization analysis. Read the original.

NHIMG Editorial Note
Published by the NHIMG editorial team on 2026-02-10.
NHI Mgmt Group — the independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org