Who is accountable when an AI agent triggers a banking error or compliance breach?

Why This Matters for Security Teams

An AI agent that can act, call tools, and move money or data is not just another application component. It is a delegated actor with operational authority, so the institution that issued that authority remains accountable when something goes wrong. That is why regulators will ask whether the bank controlled prompts, constrained scope, reviewed actions, and could reconstruct the decision path. NHI risk is not theoretical: NHI failures often become audit failures, as shown in Ultimate Guide to NHIs — Regulatory and Audit Perspectives and The 52 NHI breaches Report.

Current guidance suggests treating agent accountability as a governance problem, not a blame-shifting exercise. Frameworks such as the OWASP Agentic AI Top 10 and the NIST AI Risk Management Framework both push organisations toward documented ownership, runtime controls, and traceability for autonomous behaviour. In practice, many security teams encounter this only after a reconciliation error, payment exception, or compliance breach has already exposed the gap between “approved use case” and actual agent behaviour.

How It Works in Practice

For autonomous workloads, accountability starts with identity and ends with evidence. The bank should assign the agent a workload identity, not a shared service account, and bind that identity to a narrowly defined task. That identity should be paired with just-in-time, ephemeral secrets rather than long-lived credentials, so access can be issued per action, revoked automatically, and scoped to the specific workflow. Static RBAC is usually too coarse for agentic systems because an agent does not follow a stable human job pattern; it may chain tools, switch contexts, and act on fresh inputs in ways that no prewritten role matrix can predict.

Practitioners increasingly pair CSA MAESTRO agentic AI threat modeling framework with NIST Cybersecurity Framework 2.0 to define who owns the agent, what it may do, and how every action is logged. That should include intent-based or context-aware authorisation at request time, not just pre-approved entitlements at onboarding. A bank also needs evidence-grade logging for human prompts, tool calls, outputs, and overrides, because regulators will expect traceability across both human instruction and autonomous execution.

Bind each agent to a unique workload identity and a named business owner.

Issue JIT credentials per task, with short TTLs and automatic revocation.

Evaluate policy at runtime against intent, context, and risk.

Log prompts, tool use, and side effects so audit teams can reconstruct events.

Anthropic — first AI-orchestrated cyber espionage campaign report reinforces how quickly tool-using systems can be abused once authority is available, while OWASP NHI Top 10 highlights why overbroad identity design becomes an incident multiplier. These controls tend to break down when an agent is allowed to chain multiple tools across separate business systems because the blast radius outgrows the original approval.

Common Variations and Edge Cases

Tighter runtime control often increases operational overhead, requiring organisations to balance fast execution against stronger containment. That tradeoff is real in banking, where some teams want agents to move quickly for customer support or fraud handling, while others need slower, highly reviewed flows for payments, lending, or regulatory reporting. Best practice is evolving, and there is no universal standard for this yet, but the direction is clear: high-risk agent actions should use narrower scopes, stronger approval gates, and more frequent human confirmation.

One common edge case is a “supervised” agent that still causes a breach because the human reviewer approved the wrong intent or did not understand the downstream tool chain. Another is a multi-agent workflow where one agent inherits trust from another, creating privilege escalation that was never explicitly granted. The bank, not the model provider, is still accountable if it exposed those pathways without adequate controls. For this reason, the most defensible posture is to combine Ultimate Guide to NHIs — Why NHI Security Matters Now with standards-driven governance such as NIST AI Risk Management Framework and policy design that treats agent actions as time-bound, revocable, and fully attributable.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
OWASP Agentic AI Top 10	A01	Agent autonomy and tool use create direct prompt-to-action risk.
CSA MAESTRO	T1	Covers agent threat modeling, ownership, and control boundaries.
NIST AI RMF		AI RMF GOVERN is the accountability anchor for autonomous systems.

Map each agent to an owner, scope, and reviewed risk boundary before production use.

#1 Authority in NHI Education, Research and Advisory, empowering organizations to tackle the critical risks posed by Non-Human Identities (NHIs), including AI Agents.

Who is accountable when an AI agent triggers a banking error or compliance breach?

Why This Matters for Security Teams

How It Works in Practice

Common Variations and Edge Cases

Standards & Framework Alignment

Related resources from NHI Mgmt Group