Subscribe to the Non-Human & AI Identity Journal

Who is accountable when a fraudster convinces an agent to bypass verification?

Accountability usually sits with the organisation, not the individual agent alone. If the workflow allows exceptions under pressure, the issue is a governance failure in policy, training, and control design. Regulators and auditors will expect the contact center to demonstrate that high-risk actions required stronger verification and that exceptions were monitored, recorded, and reviewed.

Why This Matters for Security Teams

When a fraudster persuades an agent or agent-assisted workflow to bypass verification, the failure is rarely just “social engineering.” It is a control-design problem: the organisation gave an autonomous or semi-autonomous system enough reach to take a risky action without sufficiently strong, context-aware approval. That is why current guidance treats this as a governance and assurance issue, not an incident that can be solved by blaming the software.

For agentic systems, static role definitions and pre-approved pathways are often too blunt. A fraudster can manipulate the conversation, redirect the goal, or trigger an exception path that was meant to be rare. Security teams should map those failure modes against the OWASP Agentic AI Top 10 and the NIST AI Risk Management Framework, then decide where human review is mandatory and where automated denial is safer. NHI Management Group’s guidance on the Ultimate Guide to NHIs also shows why poor credential discipline amplifies downstream abuse.

In practice, many security teams discover weak verification controls only after an exception has already been used to move money, reset an account, or expose data.

How It Works in Practice

Accountability should be assigned across the control chain: the organisation owns the policy, the product or platform owner owns the workflow design, and the operations or fraud team owns the exception handling and monitoring. If an AI agent is involved, the question becomes whether it was authorised to act on the request, whether the request was in scope, and whether the system required enough proof before proceeding.

That is where static IAM often fails. A role that is “allowed to help customers” is not precise enough when the real decision is “allow account recovery only after step-up verification, unless a validated fraud signal is present.” For autonomous and semi-autonomous systems, best practice is evolving toward runtime authorisation: decisions made at the moment of action based on intent, context, confidence, and risk. Pair that with CSA MAESTRO agentic AI threat modeling framework and the MITRE ATLAS adversarial AI threat matrix to model how manipulation, prompt injection, or workflow redirection could lead to unsafe privilege use.

  • Use step-up verification for high-risk actions such as refunds, password resets, or identity changes.
  • Issue just-in-time access for agents and staff where elevated privilege is needed only for a single task.
  • Keep secrets short-lived and revoke them automatically after the transaction completes.
  • Log the original request, the verification step, the exception reason, and the approving identity.
  • Review patterns of repeated overrides as a control weakness, not as isolated human error.

This matters even more when the system has broad tool access or can chain actions across channels. NHI Management Group reports that 80% of identity breaches involved compromised non-human identities such as service accounts and API keys, which is a strong warning that weak machine identities can turn one bypass into a wider compromise. These controls tend to break down when contact-center workflows mix human judgment, agent automation, and legacy exception routing because the approval boundary becomes ambiguous.

Common Variations and Edge Cases

Tighter verification often increases friction, so organisations have to balance fraud resistance against customer abandonment and operational cost. There is no universal standard for this yet, but current guidance suggests that high-value or account-altering actions should use stronger controls than low-risk informational requests.

One common edge case is the “helpful override,” where a well-intentioned employee bypasses the process for an urgent customer. Another is delegated automation, where an agent has authority to prepare a change but not to finalise it. In both cases, the organisation remains accountable if the policy allowed the exception without sufficient guardrails. The right response is usually to separate recommendation from execution, require independent approval for irreversible actions, and make exception paths visibly auditable.

For teams building agentic workflows, the emerging pattern is to anchor authority to workload identity and runtime policy rather than to a broad role alone. That approach aligns with how AI LLM hijack breach scenarios unfold in the real world and with the risk-management expectations reflected in the NIST AI Risk Management Framework. Where the workflow depends on a long-lived secret, an undocumented exception rule, or an unreviewed manual override, the accountability problem becomes a governance gap rather than a one-off fraud event.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework Control / Reference Relevance
OWASP Agentic AI Top 10 A3 Covers tool misuse and unsafe agent actions under manipulation.
CSA MAESTRO Threat modeling helps trace accountability across agent workflows.
NIST AI RMF GOVERN Govern function defines accountability, oversight, and policy ownership.

Gate agent actions by risk and require step-up checks before high-impact tool use.