Subscribe to the Non-Human & AI Identity Journal

What breaks when certificate pinning is not paired with a recovery path?

The biggest failure is that clients can keep rejecting valid replacement certificates after revocation, issuer changes, or key compromise. That turns a routine certificate event into an availability problem. If the pin is widely deployed, the recovery burden shifts from the certificate team to every client, app, or device that recorded the old trust value.

Why This Matters for Security Teams

Certificate pinning is often introduced to stop man-in-the-middle interception, but the operational risk appears when the pinned trust anchor changes. If a certificate is revoked, reissued, or replaced during an incident, every client that hard-coded the old value may fail closed and take production traffic with it. That turns a security control into an availability dependency.

This matters because machine identities are already difficult to manage at scale. In The Critical Gaps in Machine Identity Management report from SailPoint, 53% of organisations said they had experienced a security incident directly related to machine identity management failures, and only 38% had automated certificate lifecycle management in place. That combination makes brittle trust logic especially dangerous.

Security teams often assume pinning is a “set and forget” protection, but the real question is how clients recover when trust must legitimately change. In practice, many security teams encounter certificate pinning failure only after revocation or issuer migration has already interrupted service, rather than through intentional recovery testing.

How It Works in Practice

When pinning is used correctly, a client validates that the server presents a specific public key, certificate hash, or intermediate trust path. That can reduce exposure to rogue issuers and some interception scenarios. The problem is not pinning itself. The problem is treating the pinned value as immutable without a way to rotate trust safely.

A recovery path usually means one or more of the following: a secondary pin, a staged rollout of new trust material, a short overlap period where old and new certificates are both accepted, or a fallback to standard PKI validation under tightly controlled conditions. Current guidance suggests the recovery path should be designed before deployment, not improvised during an incident. This is consistent with broader identity hygiene principles in the NIST Cybersecurity Framework 2.0, which emphasizes resilience, recovery, and continuous governance rather than one-time configuration.

For machine identities and service-to-service traffic, teams should pair pinning with certificate lifecycle automation, inventory, and rollback procedures. The Ultimate Guide to NHIs is useful here because the operational issue is not just certificate issuance. It is also ownership, visibility, and the ability to update trust across every consumer of that identity.

  • Use short-lived certificates where possible so rotation is routine, not exceptional.
  • Publish new trust material before deprecating the old pin.
  • Test revocation, issuer replacement, and emergency rollover in non-production.
  • Keep telemetry on pin failures so recovery can begin before outage spread.

These controls tend to break down in embedded devices, offline clients, and legacy mobile apps because trust updates cannot be distributed quickly or consistently.

Common Variations and Edge Cases

Tighter pinning often increases assurance but also raises operational overhead, so organisations have to balance interception resistance against recovery complexity. That tradeoff is manageable in modern cloud services, but it becomes fragile in long-lived clients and distributed fleets.

One common edge case is issuer migration, where a team moves from one CA to another during normal renewal or as part of an incident response action. If clients pin the old intermediate or leaf certificate, valid new certificates are rejected until every client updates. Another edge case is revocation after key compromise: the safer the revocation decision, the larger the risk of self-inflicted outage if there is no fallback path.

Best practice is evolving on whether to pin leaf certificates, intermediates, or public keys. There is no universal standard for this yet, and the right answer depends on how quickly clients can be updated and how much trust agility the environment requires. For teams managing high volumes of secrets and machine identities, the broader operational lesson is to avoid brittle trust anchors and to assume rotation will happen under pressure, not during a calm maintenance window. Related incidents such as the Sisense breach and the Schneider Electric credentials breach show how identity and credential failures can cascade when recovery is slow or incomplete.

In environments with air-gapped systems, third-party integrations, or app stores with slow release cycles, pinning without a recovery path is most likely to turn a security control into a prolonged outage.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Non-Human Identity Top 10 address the attack and risk surface, while NIST CSF 2.0 and NIST Zero Trust (SP 800-207) set the governance and control requirements practitioners need to meet.

Framework Control / Reference Relevance
OWASP Non-Human Identity Top 10 NHI-03 Covers certificate lifecycle and rotation failure risk for machine identities.
NIST CSF 2.0 PR.AC-1 Access control must remain resilient when trust material changes or is revoked.
NIST Zero Trust (SP 800-207) ID Zero trust depends on continuously validated identity, not immutable trust anchors.

Treat pinned certificates as controlled access dependencies and test recovery as part of access governance.