By NHI Mgmt Group Editorial TeamPublished 2025-02-19Domain: Agentic AI & NHIsSource: Defakto Security

TL;DR: AI security depends on giving every workload an identity, then using policy-based authorization to control what it can access, when, and under what context, according to Defakto Security. The governance gap is no longer about whether to credential AI, but whether existing IAM can preserve least privilege as agents discover and chain actions across systems.


At a glance

What this is: This analysis argues that securing AI starts with workload identity, then moves to fine-grained authorization and identity chaining for agentic behaviour.

Why it matters: It matters because IAM teams need to govern AI as a class of non-human identities, not as a special case outside existing access-control patterns.

By the numbers:

👉 Read Defakto Security's analysis of AI workload identity and agentic authorization


Context

AI security is an identity problem before it is a model problem. Once an AI system can act, call tools, and reach data sources, it behaves like a workload with execution authority, which means IAM, authorization, and credential lifecycle controls become the real control plane. For NHI practitioners, the key question is whether AI workloads are being managed as governed identities or as an exception that bypasses normal access discipline.

This argument is strongest where agentic AI can discover resources, request access dynamically, and chain context from one identity to another. That creates a familiar governance pattern with a new scale problem, because the blast radius of a mistake can expand quickly when access is granted at runtime rather than prebuilt into static roles. That starting position is increasingly typical for modern AI adoption, not an edge case.


Key questions

Q: How should security teams govern AI workloads as non-human identities?

A: Security teams should inventory AI workloads as distinct NHIs, assign each one a unique identity, and bind access to purpose, scope, and duration. That means separating authentication from authorization, logging every decision, and reviewing ownership and lifecycle controls with the same rigor used for service accounts and other machine identities.

Q: When does ephemeral access reduce risk for AI agents, and when does it not?

A: Ephemeral access reduces risk when it replaces standing privilege with short-lived, task-specific access and when policies are tightly scoped. It does not reduce risk if the policy layer is broad, the context is lost across chained requests, or the agent can repeatedly request new privileges without meaningful oversight.

Q: What is the difference between workload identity and authorization for AI systems?

A: Workload identity proves what the AI system is, while authorization decides what it can do. A strong identity without tight authorization still allows overreach, and tight authorization without reliable identity cannot safely distinguish one workload from another. Effective AI governance needs both controls working together.

Q: Why do AI agents complicate zero trust and least privilege models?

A: AI agents complicate zero trust because they can discover resources, chain actions, and request access dynamically while still appearing legitimate. Least privilege becomes harder when access decisions must be made at runtime and across multiple identities in one workflow, which raises the bar for policy quality, logging, and review.


Technical breakdown

Workload identity for AI systems

Workload identity gives software a cryptographically verifiable identity so it can authenticate as itself rather than borrowing a human account or shared secret. In this model, the workload is attested, issued a short-lived credential, and then allowed to present that credential to downstream systems. Standards such as SPIFFE are relevant because they separate identity from environment and make issuance and rotation more systematic. For AI, this matters because training jobs, inference services, and agents all need distinct identities if access is to be traced and constrained.

Practical implication: Treat each AI workload as a distinct identity with its own credential lifecycle, not as an extension of a developer or service account.

Policy-based authorization and contextual access

Authorization is the decision layer that determines what an authenticated AI workload can do. The article’s key point is that identity alone is not enough, because the real control comes from policies that evaluate context, resource sensitivity, and task scope at the moment of access. That is what makes least privilege workable for AI, especially when access must be temporary or conditional. When workloads request access dynamically, the policy engine becomes the enforcement point for whether the action is allowed, logged, and time-bounded.

Practical implication: Use policy-based controls to scope AI access to the minimum dataset, action, and time window needed for the task.

Identity chaining and non-deterministic behaviour

Agentic AI complicates governance because it can act in sequences that are not fully predictable up front. Identity chaining means the authorization system must preserve the context of multiple identities involved in a request, including the initiating user, the agent, and any downstream service identity the agent uses. This is how the system avoids treating every action as isolated. The challenge is not merely authentication or approval, but maintaining enough context to know who should be able to see or do what when an agent assembles a response from multiple sources.

Practical implication: Design access decisions to retain request context across chained actions, so downstream data exposure is evaluated in full.


Threat narrative

Attacker objective: The attacker wants to turn an AI workload into a trusted execution path that can be abused for broader access and data exfiltration.

  1. Entry begins when an AI workload is allowed to act with broad or poorly scoped access instead of a dedicated workload identity.
  2. Escalation occurs when the agent discovers additional resources and requests further credentials or permissions at runtime without adequate policy constraints.
  3. Impact follows when the agent can chain access across systems and disclose or manipulate data beyond the original task scope.

Read our 52 NHI Breaches Analysis report for a comprehensive view of breaches impacting Non-Human Identities including AI Agents.


NHI Mgmt Group analysis

AI workload identity is now a baseline governance requirement, not a specialised control. Once an AI can execute actions or call tools, it becomes part of the identity estate and must be governed accordingly. Traditional IAM was built to manage humans and service accounts, but agentic AI forces teams to treat software execution paths as first-class identities. The practical conclusion is straightforward: if it can act, it needs an identity boundary.

Ephemeral access does not solve trust debt if the authorization model is weak. Temporary credentials reduce standing exposure, but they do not fix overly broad policy, poor context preservation, or excessive data scope. The real issue is whether runtime decisions can enforce task-scoped access with enough fidelity to stop an agent from wandering outside its assignment. Teams should not confuse short-lived tokens with effective control.

Identity chaining is the named concept practitioners should watch. AI systems increasingly combine user context, agent context, and downstream service context inside one request path. That creates a governance challenge that is broader than simple authentication, because access decisions must account for the whole chain of actors. The field needs more explicit models for how chained identities are recorded, evaluated, and audited.

Least privilege for AI will fail unless authorization becomes dynamic and observable. Static role design alone cannot keep pace with agentic systems that discover resources at runtime. Policy decisions must be logged, reviewable, and tied to a clear business purpose so security teams can explain why access was granted. Practitioners should expect AI governance to converge with NHI governance, not sit outside it.

Zero Trust becomes more relevant when AI is treated as a workload, but it also becomes harder to operationalise. The model fits the problem because every AI action should be re-evaluated, not assumed safe after initial authentication. The complication is scale, since autonomous systems can generate far more authorization decisions than human workflows. That means the implementation burden shifts toward automation, policy quality, and auditability.

From our research:

  • 53% of organisations have experienced a security incident directly related to machine identity management failures, according to The Critical Gaps in Machine Identity Management report.
  • Average time to detect a compromised machine identity is 214 days, which shows how long NHI issues can remain invisible before they become operational risk.
  • For a broader view of identity lifecycle controls, see Ultimate Guide to NHIs, which connects discovery, rotation, and offboarding to practical governance.

What this signals

Identity context will become the deciding factor in AI governance. As agents start to chain requests and act across multiple systems, teams will need policies that preserve who initiated the action, which workload executed it, and what downstream service consumed it. That is the only way to make runtime decisions explainable and auditable at scale.

With 69% of organisations now having more machine identities than human ones, the governance lesson is that AI cannot be handled as an isolated exception. The same inventory, ownership, and lifecycle discipline that machine identities require will increasingly define whether agentic AI can be deployed safely.

Identity chaining will turn into an audit and containment problem. Security programmes should prepare for requests that involve the user, the agent, and several services in one transaction. That means policy engines, logging pipelines, and access reviews must be designed to explain multi-hop authorization rather than single-step login events.


For practitioners

  • Assign each AI workload a distinct identity Map every training job, inference endpoint, and agent to its own workload identity and credential lifecycle. Avoid shared secrets and shared service accounts, because they erase accountability and widen the blast radius. Use a dedicated identity boundary for each execution path, including external tool access.
  • Enforce task-scoped authorization policies Define policies that bind access to task, resource, and time window, then log every runtime authorization decision. Where possible, require step-up approval for high-risk datasets and actions. This keeps dynamic access from becoming permanent privilege by accident.
  • Preserve identity context across chained requests Carry the originating user, agent identity, and downstream service context through each hop in the request path. That lets reviewers reconstruct why access was granted and whether the final response exceeded the original scope. Without that context, audit trails become incomplete.
  • Review AI access as part of NHI governance Bring AI workloads into the same inventory, review, rotation, and offboarding process used for other NHIs. Treat them as governed identities with owners, purpose statements, and deprovisioning rules. This reduces the risk of shadow AI and unowned access paths.

Key takeaways

  • AI security begins with workload identity, because anything that can act must be governed as an identity-bearing system.
  • Ephemeral credentials help, but they do not compensate for weak authorization, poor context preservation, or broad task scope.
  • Practitioners should fold AI into NHI governance now, or risk building an unbounded access layer above existing IAM controls.

Key terms

  • Workload Identity: A workload identity is a cryptographically verifiable identity assigned to software rather than a person. For AI systems, it lets training jobs, inference services, and agents authenticate as themselves, making access controls, logging, and rotation possible without shared accounts or human impersonation.
  • Identity Chaining: Identity chaining is the preservation of multiple identities across a single request path, such as the initiating user, the AI agent, and downstream service identities. It matters because authorization must evaluate the full chain, not just the last hop, to avoid overexposure and incomplete audit trails.
  • Task-Scoped Access: Task-scoped access is permission granted only for the specific work an identity needs to complete, and only for the time required to complete it. In AI environments, it is the practical form of least privilege because agents can discover resources dynamically and should not hold broad standing rights.
  • Authorization Context: Authorization context is the set of signals used to decide whether access should be granted, including identity, purpose, resource, sensitivity, and request conditions. For agentic AI, preserving context is essential because a correct decision depends on what the system is trying to do, not just who or what is asking.

Deepen your knowledge

AI workload identity and policy-based authorization are core topics in our NHI Foundation Level course, the industry's only accredited NHI security programme. If you are extending identity governance to agents and other machine identities, it is worth exploring.

This post draws on content published by Defakto Security: AI Security Begins with Workload Identity. Read the original.

NHIMG Editorial Note
Published by the NHIMG editorial team on 2025-02-19.
NHI Mgmt Group — the independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org