Subscribe to the Non-Human & AI Identity Journal

How should security teams verify certificate revocation in production?

Security teams should verify revocation by checking the certificate’s CRL Distribution Point, downloading the current CRL, and matching the certificate serial number against the revoked list. They should do this from the same network locations used by applications and automation so the test reflects real trust conditions, not a local admin path.

Why This Matters for Security Teams

Certificate revocation is one of those controls that looks simple on paper and fails quietly in production. If a revoked certificate is still accepted by applications, the problem is not the revocation list itself but the path, cache, or validation logic used by the workload. NHI Management Group research on machine identity management shows how common operational blind spots are, especially when identity inventories and lifecycle controls are incomplete, as discussed in the Ultimate Guide to NHIs — What are Non-Human Identities. That matters because certificate validation is only trustworthy when tested under the same conditions as production traffic, not from an admin workstation with different routes, proxies, or trust stores.

Zero Trust guidance from NIST SP 800-207 Zero Trust Architecture reinforces the principle that trust decisions must be made from current context, not assumptions. Revocation checks are a practical example: if a workload cannot reach the CRL Distribution Point, or if an intermediate device caches stale status, the security team may believe revocation works when the service path says otherwise. In practice, many security teams discover broken revocation only after a rotated or compromised certificate is still trusted by an application.

How It Works in Practice

Production verification should start with the full validation path the application actually uses. That means identifying the certificate chain, checking the CRL Distribution Point extension, confirming the CRL is reachable from the workload subnet, and comparing the target certificate serial number against the current revoked set. If the environment uses OCSP, teams should validate whether the application performs live OCSP queries, soft-fails on timeout, or relies on stapled responses. The question is not just whether revocation data exists, but whether the relying party consumes it correctly.

A useful operational pattern is to test from the same compute tier, namespace, VPC, or service mesh where production traffic originates. This reduces false confidence caused by local admin access, different DNS, or special outbound exceptions. NHI Management Group guidance on machine identity risk in the Ultimate Guide to NHIs — The NHI Market aligns with this reality: certificate lifecycle problems are rarely isolated events, they are usually part of broader operational sprawl. The Sisense breach also illustrates why validation must assume exposed secrets and compromised trust material can exist in real environments.

  • Fetch the current CRL from the exact distribution point listed in the certificate.
  • Confirm the CRL is fresh and within its validity window.
  • Verify the serial number appears on the revoked list, if applicable.
  • Repeat the test from each production network path that matters.
  • Record whether the application hard-fails, soft-fails, or bypasses revocation entirely.

These controls tend to break down when load balancers, proxies, or offline validation caches intercept revocation lookups because the application never sees the authoritative status source.

Common Variations and Edge Cases

Tighter revocation checking often increases operational overhead, requiring organisations to balance security assurance against availability and network complexity. That tradeoff is especially visible in segmented environments, where outbound access to CRL endpoints may be blocked by default. In those cases, current guidance suggests explicitly allowing the minimum required paths and documenting any compensating control, such as short certificate lifetimes or stapled OCSP where supported. There is no universal standard for this yet across all stacks.

Edge cases matter. Some applications cache CRLs aggressively, which can delay recognition of revocation until cache expiry. Others support OCSP but only as a soft-fail control, meaning network failure may still permit trust. Air-gapped or highly restricted environments may need internal CRL mirrors, but those mirrors must be monitored for freshness and integrity. Security teams should also remember that revocation only helps if certificates are actually replaced or removed after compromise; otherwise, a revoked object may remain present in secrets stores and automation workflows.

For teams building a broader NHI program, revocation verification should sit alongside inventory, rotation, and monitoring practices rather than replace them. The main lesson from NHI Management Group research is that visibility gaps drive delayed detection, and revocation checks are most reliable when they are exercised continuously, not treated as a one-time audit task.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Non-Human Identity Top 10 address the attack and risk surface, while NIST CSF 2.0 and NIST Zero Trust (SP 800-207) set the governance and control requirements practitioners need to meet.

Framework Control / Reference Relevance
OWASP Non-Human Identity Top 10 NHI-03 Revoked certs and rotation failures are core NHI credential lifecycle risks.
NIST CSF 2.0 PR.AC-1 Revocation validates that access credentials are no longer accepted.
NIST Zero Trust (SP 800-207) Zero trust requires runtime validation of trust decisions, including revocation.

Test certificate lifecycle controls in production paths and revoke trust when serials or status checks fail.