Subscribe to the Non-Human & AI Identity Journal

Who should own the decision between replacement and compensating controls for legacy HSMs?

The decision should be shared by PKI, IAM, and hardware operations owners, because it crosses certificate assurance, access governance, and device lifecycle management. Where replacement is not immediate, compensating controls such as network restriction and personnel vetting must be documented as temporary risk treatment, not as proof of quantum safety.

Why This Matters for Security Teams

Legacy HSMs sit at the point where certificate assurance, key custody, and operational resilience converge, so the replacement versus compensating-controls decision cannot be owned by a single function. PKI teams need to understand cryptographic assurance, IAM owners need to judge who can administer and approve use, and hardware operations need to manage device health, firmware, and lifecycle constraints. Current guidance suggests treating this as a shared risk decision aligned to NIST Cybersecurity Framework 2.0 rather than a purely technical refresh.

The practical issue is that compensating controls often look reassuring on paper while leaving the underlying trust anchor in place. NHI Mgmt Group’s Ultimate Guide to NHIs — Standards notes that 97% of NHIs carry excessive privileges, which is a reminder that device and identity governance fail together when ownership is unclear. In practice, many security teams encounter HSM risk only after a renewal failure, access dispute, or audit finding has already exposed how fragmented the decision path really is.

How It Works in Practice

A workable decision process starts by separating three questions: is the HSM still supportable, can it meet current assurance needs, and what temporary controls reduce exposure while a replacement is planned? PKI owners typically assess whether the HSM still supports required algorithms, key lengths, signing performance, certificate policy constraints, and recovery procedures. IAM owners assess who can approve administrative actions, whether privileged access is governed, and whether separation of duties is actually enforced. Hardware operations assess vendor support, firmware currency, physical security, and failure tolerance.

Shared ownership works best when it is documented as a risk acceptance workflow with explicit decision rights. That usually means:

  • Defining a replacement threshold, such as end-of-support, unpatchable firmware, or inability to meet new cryptographic requirements.
  • Recording compensating controls as time-bound measures, not permanent exceptions.
  • Requiring approvals from PKI, IAM, and hardware operations before any extended exception is granted.
  • Reviewing whether access restrictions, network segmentation, and personnel vetting actually reduce exposure enough to justify delay.

For organisations mapping this into broader governance, NIST guidance on identity and access management is most effective when tied to lifecycle ownership, and NHI Mgmt Group’s Ultimate Guide to NHIs — Standards is a useful reference for why lifecycle discipline matters as much as control design. These controls tend to break down when the HSM is embedded in a legacy application estate where no single team can prove who owns the key path end to end.

Common Variations and Edge Cases

Tighter HSM governance often increases operational overhead, so organisations have to balance cryptographic assurance against downtime, migration cost, and regulatory deadlines. That tradeoff is especially visible when the HSM supports revenue-critical systems, a constrained production window, or long-lived certificates that cannot be reissued quickly.

There is no universal standard for this yet, but current guidance suggests the decision becomes more conservative when compensating controls depend on manual processes. If the “temporary” plan relies on broad admin access, undocumented network exceptions, or stale personnel vetting, it is not a strong substitute for replacement. In those cases, the decision should move up to a formal risk committee with clear expiry dates and an explicit owner for the migration plan.

One useful rule is that compensating controls should reduce blast radius, not redefine the device as safe. Controls such as segmented management access, strict change windows, and limited operator sets can buy time, but they do not eliminate cryptographic obsolescence. Where certification programs, third-party attestation, or contractual obligations are involved, the replacement decision may also need input from risk, legal, or compliance owners, but PKI, IAM, and hardware operations should still remain the core decision-makers.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Non-Human Identity Top 10 address the attack and risk surface, while NIST CSF 2.0 and NIST CSF 2.0 set the governance and control requirements practitioners need to meet.

Framework Control / Reference Relevance
NIST CSF 2.0 GV.RM-01 Shared HSM risk ownership fits governance-led risk treatment and decision accountability.
NIST CSF 2.0 PR.AA-01 HSM admin access must be governed alongside PKI and operations responsibilities.
OWASP Non-Human Identity Top 10 NHI-03 Legacy HSMs often protect non-human credentials and keys that need lifecycle controls.

Restrict HSM administration to approved roles and verify separation of duties before accepting exceptions.