TL;DR: Autonomous agents can create legally binding commitments at machine speed, as illustrated by an airline refund case that cost eight figures, according to Strata Identity. Access review processes assume decisions remain visible long enough for humans to intercept them; autonomous behaviour collapses that window inside the session.
NHIMG editorial — based on content published by Strata Identity: The $10 million lesson in why machines need adult supervision
Questions worth separating out
Q: How should teams govern autonomous agents that can make binding commitments?
A: Teams should separate proposal authority from binding authority.
Q: What breaks when human review thresholds are too slow for agent actions?
A: The review model breaks because the system can complete the action before the reviewer sees it.
Q: How do security teams know if HITL is actually working for agents?
A: HITL is working when high-impact actions consistently pause before completion, route to a qualified human, and produce a record that shows what was requested and who approved it.
Practitioner guidance
- Classify commitment-bearing agent actions separately Map which agent actions can create financial, legal, or reputational obligations and treat them as a distinct control class.
- Set consequence-based approval thresholds Define dollar, data, and regulatory thresholds that trigger human review before the agent can complete the action.
- Log reviewer rationale with the agent decision Capture the requested action, the policy decision, the human approver, and the rationale in the same record.
What's in the full article
Strata Identity's full article covers the operational detail this post intentionally leaves for the source:
- The exact thresholding approach for routing agent actions to human review before commitment.
- The agentic sandbox workflow used to rehearse failures without exposing production systems.
- The logging and evidence trail fields needed to defend oversight decisions during audit or legal review.
👉 Read Strata Identity's analysis of human-in-the-loop controls for autonomous agents →
Autonomous agents and human oversight: are your controls keeping up?
Explore further
Human approval gates were designed for decisions that remain reviewable long enough to intercept. That assumption fails when the actor is autonomous because it can select actions, sequence them, and execute them before a review queue even forms. The implication is not simply that teams need more review points. It is that governance built around delayed human judgment no longer matches the actor's runtime speed.
A few things that frame the scale:
- 80% of identity breaches involved compromised non-human identities such as service accounts and API keys, according to the Ultimate Guide to NHIs.
- 91.6% of secrets remain valid five days after the targeted organisation is notified, showing how slowly remediation can follow exposure.
A question worth separating out:
Q: Who is accountable when an autonomous agent creates a harmful promise?
A: Accountability sits with the organisation that granted the agent authority, because the promise was made inside its delegated control model. Legal and operational teams should treat the event as an authorisation failure if the business cannot prove that a human or policy gate approved the commitment before execution.
👉 Read our full editorial: Human-in-the-loop controls for autonomous agents: where oversight fails