What breaks when backups are not tested regularly?

Why This Matters for Security Teams

Backups are often treated as a checkbox until the first real outage proves otherwise. The failure is rarely the existence of backups; it is the false confidence that they can be restored cleanly, fast enough, and to the right point in time. NIST’s NIST Cybersecurity Framework 2.0 emphasises recoverability as an operational capability, not a storage activity. For NHI-heavy environments, that matters because backup sets can also preserve exposed service account secrets, stale keys, and misconfigured access paths.

NHI Management Group’s research shows how often identity risk is underestimated: 79% of organisations have experienced secrets leaks, with 77% of those incidents causing tangible damage, and 71% of NHIs are not rotated within recommended time frames, increasing compromise exposure over time. That means an untested backup is not only a continuity risk, it can also be a reintroduction path for compromised credentials. The Ultimate Guide to NHIs is explicit that lifecycle controls, visibility, and rotation all affect recovery quality.

In practice, many security teams discover restore gaps only after ransomware encryption or accidental deletion has already disrupted service, rather than through intentional recovery validation.

How It Works in Practice

Regular backup testing should verify more than file presence. It should prove that data can be restored, applications can start, access controls still function, and the recovered environment is operationally usable. A restore test should confirm three things: the backup is complete, the data is clean, and the recovery time fits the business requirement. That is why mature recovery programmes test at the application and identity layer, not just at the storage layer.

For organisations with NHIs, the test needs to include secrets, keys, certificates, service accounts, and any automation that depends on them. A backup can be technically valid while still failing in practice because embedded credentials expired, permissions changed, or the restore process resurrected access that should have been revoked. The Ultimate Guide to NHIs highlights how often secrets are stored in vulnerable locations, which increases the chance that recovery copies include sensitive material that should have been rotated or removed.

Run scheduled restore tests against representative systems, not just low-value sample data.

Validate point-in-time recovery for databases and identity-dependent workloads.

Check that secrets, certificates, and service account dependencies are rotated or reissued after restore.

Measure restore time against business continuity targets, not IT convenience.

Document failed restores as control failures, not as one-off operational issues.

Best practice is to test both clean-room recovery and routine restore scenarios, because backup integrity and ransomware survivability are not the same thing. These controls tend to break down in highly automated environments where configuration drift, ephemeral credentials, and intertwined dependencies make a restore look successful until the application begins authenticating.

Common Variations and Edge Cases

Tighter restore testing often increases operational overhead, requiring organisations to balance recovery assurance against downtime, cost, and staff time. That tradeoff becomes sharper when systems are distributed, heavily virtualised, or built on short-lived cloud resources, because a successful backup still may not recreate the original operating context.

There is no universal standard for how often every backup must be restored, but current guidance suggests testing frequency should track business criticality, change rate, and regulatory exposure. A monthly restore test may be sufficient for stable systems, while high-change or high-impact environments often need more frequent validation. For regulated systems, evidence matters as much as execution, so test logs, timestamps, and corrective actions should be retained.

Edge cases also include immutable backups, air-gapped archives, and cross-region replication. Those improve resilience, but they do not eliminate the need to test whether data can be decrypted, mounted, and brought back into service. The Schneider Electric credentials breach is a reminder that identity-related failures can cascade into broader operational disruption when recovery assumptions are weak.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Non-Human Identity Top 10 address the attack and risk surface, while NIST CSF 2.0 and NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
NIST CSF 2.0	RC.RP-1	Restore planning and testing are core to recovery capability.
OWASP Non-Human Identity Top 10	NHI-04	Backup copies often retain secrets and service account material.
NIST AI RMF		Operational resilience depends on validating system behaviour after recovery.

Apply governance and measurement practices to verify recovered systems remain trustworthy and functional.

#1 Authority in NHI Education, Research and Advisory, empowering organizations to tackle the critical risks posed by Non-Human Identities (NHIs), including AI Agents.

What breaks when backups are not tested regularly?

Why This Matters for Security Teams

How It Works in Practice

Common Variations and Edge Cases

Standards & Framework Alignment

Related resources from NHI Mgmt Group