What Is Fail closed? Definition & Examples

Expanded Definition

Fail closed is a security posture that treats uncertainty as a denial condition. When a policy engine, authorization service, retrieval layer, or dependency cannot confirm that a request is safe, the system blocks the action rather than guessing. In NHI and Agentic AI environments, that distinction matters because tool calls, document retrieval, and secret lookups can all become blast-radius multipliers if partial results are treated as acceptable.

Definitions vary across vendors on how strictly fail closed should behave in distributed systems. Some teams apply it only to authz decisions, while others extend it to retrieval filters, policy evaluation, and runtime enrichment steps. The operational standard is clearer than the terminology: if the system cannot establish confidence, it should default to non-disclosure and non-execution. That aligns closely with the NIST Cybersecurity Framework 2.0 emphasis on risk-based protective outcomes.

In practice, fail closed is the opposite of convenience-first behavior that silently continues after timeouts, stale caches, or incomplete policy responses. The most common misapplication is leaving a retrieval or authorization path to fail open when a dependency times out, which occurs when engineers prioritize availability over denial-safe behavior during error handling.

Examples and Use Cases

Implementing fail closed rigorously often introduces friction during outages and latency spikes, requiring organisations to weigh user experience against the risk of unauthorised access or prompt contamination.

An AI assistant requests internal policy documents, but the entitlement service returns an incomplete result set, so the retriever blocks output instead of surfacing partially authorised content.

A service account token validation endpoint times out, and the calling application denies the session rather than assuming the token is valid.

A secrets broker cannot verify the requesting workload identity, so no API key is released and the workflow stops before tool execution.

During a document-grounded chat flow, a metadata filter fails to load, and the system suppresses retrieval rather than passing unfiltered documents into the model.

When a policy decision point is unreachable, the orchestration layer returns a controlled denial and logs the event for review instead of retrying indefinitely with permissive defaults.

These patterns become especially important in compromised-identity scenarios described in NHIMG research such as the LLMjacking: How Attackers Hijack AI Using Compromised NHIs article, where attacker speed makes weak fallback logic dangerous. For identity and session assurance, the NIST Cybersecurity Framework 2.0 provides a useful reference point for protective control behavior.

Why It Matters in NHI Security

Fail closed reduces the chance that an AI agent, service account, or retrieval pipeline continues operating on incomplete trust signals. In NHI security, that matters because the object being protected is often not a human login but an automated identity with standing permissions, token access, and tool authority. If those checks degrade silently, the resulting failure mode is not just an error, but a possible disclosure event or unauthorized action path.

NHIMG research shows how quickly attackers move when credentials are exposed: in the LLMjacking coverage, exposed AWS credentials were targeted in an average of 17 minutes. That speed makes permissive fallback logic especially risky, because a transient authz failure can become an abuse window. Broader secrets exposure patterns in The State of Secrets in AppSec also reinforce that trust assumptions are often wrong in practice. Organisations typically encounter the consequences only after an outage, a token leak, or a policy service failure, at which point fail closed becomes operationally unavoidable to address.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Non-Human Identity Top 10 address the attack and risk surface, while NIST CSF 2.0 and NIST Zero Trust (SP 800-207) set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
OWASP Non-Human Identity Top 10	NHI-02	Covers secret exposure and denial-safe handling for non-human identities.
NIST CSF 2.0	PR.AC-4	Least-privilege access depends on denying access when assurance is uncertain.
NIST Zero Trust (SP 800-207)		Zero Trust requires explicit verification before access, which supports fail closed behavior.

Require explicit validation for each request and block access when trust cannot be established.

#1 Authority in NHI Education, Research and Advisory, empowering organizations to tackle the critical risks posed by Non-Human Identities (NHIs), including AI Agents.

Fail closed

Expanded Definition

Examples and Use Cases

Why It Matters in NHI Security

Standards & Framework Alignment

Related resources from NHI Mgmt Group