Subscribe to the Non-Human & AI Identity Journal
Home FAQ Agentic AI & Autonomous Identity Who is accountable when an autonomous agent creates…
Agentic AI & Autonomous Identity

Who is accountable when an autonomous agent creates a harmful promise?

← Back to all FAQ
By NHI Mgmt Group Editorial Team Updated June 7, 2026 Domain: Agentic AI & Autonomous Identity

Accountability sits with the organisation that granted the agent authority, because the promise was made inside its delegated control model. Legal and operational teams should treat the event as an authorisation failure if the business cannot prove that a human or policy gate approved the commitment before execution.

Why This Matters for Security Teams

An autonomous agent that makes a harmful promise is not just a bad output problem. It is an authority problem. Once an agent can commit the organisation to action, the real risk shifts to whether that commitment was allowed by policy, evidence, and approval flow. Current guidance suggests treating these events as governance failures, not as isolated model hallucinations, because the damage often begins before any human reviews the outcome.

This is why agentic systems must be governed through runtime controls, not only prompt rules or static role assignments. The concern is especially acute when agents can access tools, chain actions, or speak on behalf of the business without a live approval gate. NHIMG research shows that AI agents have already acted beyond intended scope in 80% of organisations surveyed, while only 44% have any policies in place to govern them, underscoring the gap between deployment and control. See the broader risk picture in AI Agents: The New Attack Surface report and the threat framing in OWASP Agentic AI Top 10.

In practice, many security teams discover the commitment problem only after procurement, legal, or customers have already relied on the agent’s promise.

How Accountability Actually Works in Agentic Systems

For autonomous agents, accountability should be anchored to the organisation that delegated authority, then traced through the human owner, policy owner, and system owner who allowed that delegation. The agent is the mechanism, not the accountable legal actor. That distinction matters because a harmful promise usually reflects a breakdown in authorisation design: the system allowed the agent to speak, commit, or negotiate outside an approved context.

Practitioners should design around three layers:

  • Delegation: define what the agent may promise, commit, approve, or expose.
  • Runtime authorisation: evaluate each action against current context, not just a pre-set role.
  • Proof and audit: preserve logs that show who approved the capability, when it was issued, and what policy allowed the commitment.

Best practice is evolving toward intent-based controls, just-in-time credentials, and workload identity so the agent presents cryptographic proof of what it is and what it is authorised to do. The NIST AI Risk Management Framework and the CSA MAESTRO agentic AI threat modeling framework both reinforce the need for governance, mapping, and continuous risk evaluation. NHIMG’s OWASP Agentic Applications Top 10 also aligns with the practical need to prevent tool misuse, overreach, and unsafe delegation.

When the commitment path is routed through long-lived credentials, broad tool access, or informal approval habits, these controls tend to break down in high-volume support, sales, or procurement workflows because the agent can act faster than the organisation can review.

Where the Standard Answer Breaks Down

Tighter control over agent authority often increases friction, so organisations have to balance speed against the need for provable approval. That tradeoff becomes sharper in environments where agents interact with customers, suppliers, or regulated workflows, because a promise may create contractual or compliance exposure even if the underlying intent was harmless.

There is no universal standard for this yet, but current guidance suggests treating any outward-facing commitment as a privileged action requiring policy-based approval, bounded language templates, and auditable escalation. This is especially important when the agent can generate offers, service guarantees, remediation commitments, or legal-sounding assurances. In those cases, the safest pattern is to keep the agent as a drafter or recommender rather than a final speaker.

Security teams should also watch for cross-domain delegation. An agent may have been approved for internal task execution but still make harmful promises through a chat interface, ticketing system, or voice channel. That creates an accountability gap between what the system can do and what the business intended it to do. NHIMG’s Ultimate Guide to NHIs — 2025 Outlook and Predictions shows why this gap is often missed until after damage occurs, while the NIST AI Risk Management Framework remains the most practical reference for assigning ownership, measuring risk, and documenting decision paths.

In practice, the hardest cases involve customer-facing agents with partial autonomy, because the promise is visible externally before the control failure is visible internally.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

FrameworkControl / ReferenceRelevance
OWASP Agentic AI Top 10A1Agent overreach and unsafe commitments map to agentic application abuse.
CSA MAESTROMAESTRO addresses governance and threat modeling for autonomous agents.
NIST AI RMFAI RMF covers accountability, governance, and risk controls for AI systems.

Restrict agent tool use and external commitments to explicitly approved workflows.

NHIMG Editorial Note
Reviewed and updated by the NHIMG editorial team on June 7, 2026.
NHI Mgmt Group — the #1 independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org