Subscribe to the Non-Human & AI Identity Journal

How can security teams know whether SSH certificate controls are working?

Look for short certificate lifetimes, documented revocation events, and complete logs showing who accessed which system and when. If certificates persist far beyond their intended use or revocation is manual and slow, the control is not governing access, only authenticating it.

Why This Matters for Security Teams

ssh certificate are only useful when they prove more than login success. Security teams need evidence that the control is limiting exposure through short-lived trust, rapid revocation, and complete auditability of each session. That matters because machine and non-human identities now outnumber human identities in many environments, and certificate management failures are a common way access quietly outlives intent. NHIMG research shows only 38% of organisations have automated certificate lifecycle management in place, which helps explain why expiry and renewal issues keep surfacing in production. See the broader machine identity context in Ultimate Guide to NHIs — What are Non-Human Identities and the control expectations in the Ultimate Guide to NHIs — Standards.

For practitioners, the real question is whether the certificate is acting as a governing mechanism or just a stronger password. A control can authenticate an SSH client and still fail if lifetimes are long, revocation is manual, or logs do not link a certificate to a specific workload and purpose. The NIST Cybersecurity Framework 2.0 reinforces the need for verifiable control operation, not just deployment. In practice, many security teams discover certificate drift only after a maintenance window, incident review, or expired key outage has already exposed the gap.

How It Works in Practice

Teams should evaluate SSH certificate controls as a lifecycle system: issuance, use, logging, renewal, and revocation. A working control usually produces short certificate lifetimes, ideally measured in hours or days rather than weeks, plus automated issuance tied to workload identity or an approved admin flow. The purpose is to make access temporary by design, not by exception. This is consistent with current guidance in NIST SP 800-207 Zero Trust Architecture, which emphasizes continuous verification, and with the operational focus in SSH certificate-based authentication implementations that rely on a trusted CA and enforced expiry.

To tell whether the control is functioning, security teams should look for evidence in four places:

  • Certificate validity periods are short and consistent with the task being performed.
  • Revocation is automated or near real-time, not dependent on manual cleanup after a ticket.
  • Session logs record who issued the certificate, which host accepted it, and what command or login path was used.
  • Monitoring can distinguish normal renewal from suspicious reissuance or reuse across systems.

This is where SSH certificates differ from static keys. Static keys can remain valid for years, while SSH certificates can expire quickly and be issued only when needed. That gives teams a measurable signal: if access still works long after the expected TTL, the control is failing. If a revoked certificate continues to authenticate, trust is being enforced too loosely. The same visibility problem appears in broader machine identity management, where outage and incident data are often driven by weak lifecycle oversight, as described in The Critical Gaps in Machine Identity Management report and the access-risk findings in the State of Non-Human Identity Security.

These controls tend to break down when SSH access is distributed across legacy hosts, hand-managed bastions, or emergency break-glass workflows because those environments usually bypass automated issuance and revocation.

Common Variations and Edge Cases

Tighter certificate controls often increase operational overhead, so organisations have to balance stronger assurance against support burden and outage risk. Best practice is evolving here, and there is no universal standard for the ideal SSH certificate TTL across all environments. A short lifetime is only effective if certificate issuance is reliable and the logging pipeline can keep up. If the CA, bastion, or directory service becomes a bottleneck, teams may lengthen lifetimes to keep operations moving, which weakens the control.

Edge cases usually show up in high-availability systems, disaster recovery sites, and third-party support access. In those settings, teams may need separate policies for human administrators, automation accounts, and service-to-service SSH tunnels. The control should still produce proof of use, but the evidence may come from different sources depending on the environment. For example, ephemeral automation may justify very short-lived certificates, while a recovery workflow may require a tightly logged exception path with additional approval.

Security teams should also watch for false confidence. A certificate program can look mature if issuance is automated, yet still fail if logs are incomplete or if revocation events are never tested. Current guidance suggests treating certificate expiry, revocation time, and log completeness as operational metrics, not background configuration details. When those metrics are missing, the organisation can prove authentication but not governance.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Non-Human Identity Top 10 address the attack and risk surface, while NIST CSF 2.0 and NIST Zero Trust (SP 800-207) set the governance and control requirements practitioners need to meet.

Framework Control / Reference Relevance
OWASP Non-Human Identity Top 10 NHI-03 Short-lived credentials and lifecycle control are central to proving SSH cert governance.
NIST CSF 2.0 PR.AA-1 Identity proofing and access verification align to validating SSH certificate use.
NIST Zero Trust (SP 800-207) PR.AC Zero trust requires continuous, context-based access decisions instead of static trust.

Measure certificate TTL, automate rotation, and verify revocation works before access is considered controlled.