How should security teams implement on-demand permissions in cloud environments?

Why This Matters for Security Teams

On-demand permissions are not just a tighter form of access control. They are a response to the way cloud incidents now unfold: fast, identity-driven, and often through over-scoped service accounts, automation, or agentic workflows. Static roles age badly in cloud environments because the same identity may be reused across deployments, environments, and tools. Current guidance from the OWASP Non-Human Identity Top 10 and NHI Management Group research shows that over-privilege and weak credential hygiene remain recurring failure points, not edge cases. In the Ultimate Guide to NHIs — Key Challenges and Risks, the pattern is clear: once access is standing, it is easy to reuse, hard to justify, and frequently wider than the task requires.

The practical risk is not only misuse by an attacker. It is also accidental excess, where pipelines, operators, and workloads keep privileges long after the request that justified them has ended. That is why modern cloud governance is moving toward request-time authorization, context-aware approvals, and automatic expiry. In practice, many security teams discover excessive standing access only after an audit, outage, or privilege escalation has already exposed the problem.

How It Works in Practice

Effective on-demand permissions start with a policy decision at the moment of need, not a broad role assigned months earlier. Security teams define the minimum signals that can justify elevation, then evaluate those signals at request time. Common signals include ticket approval state, environment sensitivity, workload ownership, on-call assignment, change window, and whether the requester is a human operator, service, or agent. The best practice is evolving, but the direction is consistent: use short-lived access, scope it to one target action or resource set, and revoke it automatically when the task ends.

In cloud environments, that usually means combining identity, policy, and session controls. A privileged request might be granted only after a workflow engine checks context, then issues an ephemeral token or temporary role session with a strict TTL. For non-human identities, that token should be bound to a workload identity such as SPIFFE or OIDC, so the system can verify what the workload is, not merely what secret it holds. This aligns well with CISA Zero Trust guidance and with the 230M AWS environment compromise, where excessive trust and weak control boundaries are exactly the sort of conditions that turn small misconfigurations into major exposure.

Use policy-as-code so the approval logic is versioned, reviewed, and testable.

Set TTLs by task risk, not by convenience, and make revocation automatic.

Separate request approval from secret issuance so access is not granted until policy passes.

Log the full context of each elevation, including who, what, why, and for how long.

For cloud-native teams, this is often implemented with PAM, identity brokers, CI/CD guardrails, or cloud-native session controls, but the control objective stays the same: minimize the time window in which elevated access exists. These controls tend to break down when legacy applications require persistent credentials because the application design assumes standing trust and cannot tolerate session turnover.

Common Variations and Edge Cases

Tighter on-demand permissions often increase operational overhead, so organisations have to balance faster response against approval friction and automation complexity. That tradeoff is especially visible in multi-account clouds, break-glass workflows, and production incident response. In those cases, security teams should predefine exception paths with stronger monitoring rather than silently falling back to standing privilege. There is no universal standard for this yet, but current guidance suggests treating exceptions as rare, time-boxed, and heavily logged.

One common edge case is machine-to-machine access in deployment pipelines. A human-style approval workflow does not always fit, so the control may need to be a combination of workload identity, environment attestation, and policy evaluation at runtime. Another is delegated admin access for cloud operations teams, where broad RBAC roles are still common. In those environments, on-demand permissions work best when the role only enables request initiation, while the actual privilege is minted as an ephemeral session. For teams dealing with recurring NHI risk, the Snowflake breach is a reminder that long-lived access paths are easy to abuse once they exist.

Where this guidance breaks down most often is in services that cannot be restarted or reauthenticated cleanly, because the business has coupled uptime to persistent secrets rather than short-lived identity.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Non-Human Identity Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
OWASP Non-Human Identity Top 10	NHI-03	Addresses overlong or unrotated credentials used for elevated cloud access.
CSA MAESTRO		Relevant to runtime authorization and ephemeral access for autonomous cloud workloads.
NIST AI RMF		Supports governance of context-aware decisioning for dynamic access requests.

Replace standing secrets with short-lived credentials and rotate or revoke them automatically after use.

#1 Authority in NHI Education, Research and Advisory, empowering organizations to tackle the critical risks posed by Non-Human Identities (NHIs), including AI Agents.

How should security teams implement on-demand permissions in cloud environments?

Why This Matters for Security Teams

How It Works in Practice

Common Variations and Edge Cases

Standards & Framework Alignment

Related resources from NHI Mgmt Group