DevOps delivery authority is about approving and running changes into environments, while SRE operational authority is about stabilising and restoring services under production conditions. They should not be identical by default. Separating them reduces the chance that one identity can both ship risky changes and repair the consequences without independent oversight.
Why This Matters for Security Teams
DevOps delivery authority and SRE operational authority are often confused because both roles touch production systems, but they optimise for different risks. Delivery authority exists to move approved changes safely through build, test, and deployment paths. SRE operational authority exists to keep services stable, restore them quickly, and limit blast radius during incidents. If the same identity can both push change and unilaterally recover from failure, separation of duties weakens and post-incident accountability becomes blurry.
This distinction matters most in environments where service accounts, CI/CD tokens, and automation pipelines already carry broad access. NHI Mgmt Group research shows that 97% of NHIs carry excessive privileges and only 5.7% of organisations have full visibility into their service accounts, which makes role overlap especially dangerous when authority is not explicitly bounded in code or policy. The operational lesson aligns with the NIST Cybersecurity Framework 2.0: identity, access, and resilience controls need to be designed around function, not job title.
In practice, many security teams discover the overlap only after a failed release and an incident response action happen under the same credential, rather than through intentional access design.
How It Works in Practice
The cleanest way to separate these authorities is to treat them as distinct operational boundaries with different approval paths, credentials, and audit expectations. DevOps delivery authority should be scoped to change introduction: pipeline promotion, infrastructure updates, release toggles, and controlled deployment actions. SRE operational authority should be scoped to live-service intervention: scaling, failover, feature rollback, traffic shaping, emergency config changes, and recovery actions under incident conditions.
A practical implementation usually combines:
- Separate NHI identities for delivery and operations, each with narrow, task-specific permissions.
- Short-lived credentials issued through just-in-time workflows, not standing access.
- Policy checks at request time, so the system evaluates context such as environment, ticket, incident severity, and change window.
- Immutable logging that preserves who approved the change, who executed the deployment, and who performed the recovery action.
- Break-glass access for SREs that is time-bound and independently reviewed after the incident.
This is where NHI governance becomes operational rather than theoretical. The Ultimate Guide to NHIs — What are Non-Human Identities is useful background because the same lifecycle issues apply here: credential sprawl, over-privilege, and weak offboarding. For incident-driven environments, current guidance suggests pairing this with runtime policy enforcement, which is consistent with the broader direction of NIST Cybersecurity Framework 2.0. The point is not to slow teams down, but to make authority explicit and revocable.
These controls tend to break down in highly automated CI/CD environments where one pipeline identity is reused across build, deploy, rollback, and remediation because the system no longer knows which action is delivery versus recovery.
Common Variations and Edge Cases
Tighter separation of authority often increases operational overhead, requiring organisations to balance speed against control. That tradeoff is real, especially in smaller teams where the same engineers build, deploy, and support the service. Best practice is evolving here: there is no universal standard that says DevOps and SRE identities must never overlap, but the overlap should be deliberate, temporary, and compensating-controls heavy.
A few common edge cases matter:
- In small teams, shared human responsibility may be unavoidable, but the NHI credentials can still remain distinct.
- In platform engineering models, the deployment pipeline may own delivery authority while SRE retains emergency override permissions.
- During incident response, SRE break-glass access may temporarily exceed normal limits, but it should expire automatically and trigger review.
- In regulated environments, the most defensible model is often dual control: one identity can ship, another can stabilise, and neither can fully authorise both paths alone.
The most important failure mode is hidden dual-use access. The CI/CD pipeline exploitation case study illustrates why pipeline credentials should not quietly double as operational recovery keys. Where that distinction is absent, a compromised delivery path can become a production control plane, which turns a routine rollback into an unauthorised privilege escalation path.
Standards & Framework Alignment
This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.
OWASP Non-Human Identity Top 10 address the attack and risk surface, while NIST CSF 2.0 and NIST Zero Trust (SP 800-207) set the governance and control requirements practitioners need to meet.
| Framework | Control / Reference | Relevance |
|---|---|---|
| OWASP Non-Human Identity Top 10 | NHI-01 | Separating delivery and ops identities reduces over-privileged NHI reuse. |
| NIST CSF 2.0 | PR.AC-4 | Access permissions must match function, incident role, and environment. |
| NIST Zero Trust (SP 800-207) | SC-3 | Zero Trust supports runtime authorization instead of implicit production trust. |
Assign distinct, least-privilege NHIs for deploy and recovery actions, then review for shared credentials.
Related resources from NHI Mgmt Group
- What is the difference between attack surface management and NHI governance?
- What is the difference between reviewing human access and reviewing NHIs?
- What is the difference between role-based access and API key governance for NHI security?
- What is the difference between human IAM controls and NHI governance?