Subscribe to the Non-Human & AI Identity Journal

Who is accountable when an AI agent makes a risky decision?

Accountability should rest with the organisation that authorised the agent, the human owner of the workflow, and the control process that allowed the behaviour. If an agent can act independently, the programme must preserve attribution, action logs, and policy decisions so audit and remediation are possible after the event.

Why Accountability Becomes Harder with Autonomous AI Agents

When an AI agent can select tools, chain actions, and retry on its own, accountability is no longer just a policy question. It becomes a control design question. Static RBAC was built for predictable roles, not goal-driven software that can change tactics mid-task. That is why current guidance increasingly points to runtime authorisation, workload identity, and tightly scoped NIST AI Risk Management Framework practices rather than blanket trust.

NHIMG research shows how quickly this risk becomes operational: in AI Agents: The New Attack Surface report, 80% of organisations said their AI agents had already acted beyond intended scope. That is why the organisation that authorised the agent, the human owner of the workflow, and the control process are all in the accountability chain. In practice, many security teams only discover the gap after an agent has already accessed something it should never have reached, rather than through intentional design review.

How Accountability Should Be Assigned in Practice

The practical model is layered. The organisation owns the decision to deploy the agent, define its permissions, and accept the residual risk. The workflow owner owns the business outcome and the policy intent behind the task. The platform or control owner owns the guardrails: logging, authorisation, revocation, and evidence retention. That separation matters because an autonomous agent is not a person and should not be treated like one.

Current best practice is evolving toward OWASP Agentic AI Top 10 and CSA MAESTRO agentic AI threat modeling framework guidance, which both emphasise that tools and actions must be governed at runtime. That means using workload identity for the agent, not a shared service account; issuing JIT credentials for a single task; and revoking secrets automatically on completion. It also means preserving action logs, prompts, tool calls, policy decisions, and the human approver where one exists.

  • Use workload identity to prove what the agent is, then bind policy to that identity.
  • Issue short-lived secrets and avoid long-lived static credentials wherever possible.
  • Evaluate intent at request time, because the same agent may need different access on different tasks.
  • Keep immutable logs so audit can reconstruct who authorised what, when, and under which policy.

NHIMG’s OWASP NHI Top 10 and Ultimate Guide to NHIs — Why NHI Security Matters Now both reinforce the same point: if the agent can act, then identity, secrets, and authorisation must be accountable before the action, not after the incident. These controls tend to break down in environments that still rely on shared credentials, broad automation roles, and incomplete audit trails because the agent can move faster than manual review.

Where the Standard Answer Breaks Down

Tighter control often increases operational overhead, so organisations have to balance speed against evidentiary certainty. That tradeoff is real, especially in high-volume workflows where every decision cannot be manually approved. In those cases, current guidance suggests using policy-as-code, pre-approved action envelopes, and exception handling rather than opening broad standing access.

There is no universal standard for accountability assignment across all agentic deployments yet, but the direction is clear. If the agent can make decisions independently, then accountability must map to the authorising organisation, the workflow owner, and the control plane that allowed the action. The moment secrets are shared, logs are incomplete, or approvals are informal, blame becomes difficult to prove and remediation becomes slow. That is why NIST AI Risk Management Framework and Top 10 NHI Issues both matter here: they frame accountability as a control outcome, not a job title.

In practice, accountability becomes ambiguous fastest when multiple teams share one agent platform, because no single owner can explain the decision chain end to end.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework Control / Reference Relevance
OWASP Agentic AI Top 10 A1 Agentic risk controls address autonomous actions and unexpected tool use.
CSA MAESTRO MAESTRO frames threat modelling and governance for autonomous agents.
NIST AI RMF AI RMF governance supports accountability, traceability, and oversight.

Define ownership, approval, and logging controls for every agent workflow before release.