Subscribe to the Non-Human & AI Identity Journal

How should IAM teams govern AI agents as identity programmes mature?

Treat AI agents as identities that need discovery, entitlement boundaries, and continuous oversight. They do not wait for ticket queues or static review cadences, so governance has to adapt to runtime behaviour. The practical test is whether the programme can control access at machine speed without relying on manual approval loops.

Why This Matters for Security Teams

AI agents change the identity problem because they are not fixed users with stable routines. They act on goals, chain tools, and can request new access at runtime, which makes static RBAC reviews and annual entitlement recertifications too slow to be reliable. Current guidance suggests treating agent identity as a live governance problem, not a one-time provisioning event. That aligns with the risk patterns described in the Ultimate Guide to NHIs and the runtime control expectations in the OWASP Agentic AI Top 10.

The practical shift is from identity records to identity behaviour. IAM teams need to know what an agent is allowed to do, what it is actually doing, and whether those actions still fit the current task context. That requires machine-speed decisions, short-lived credentials, and continuous oversight rather than approval loops built for human workflows. In practice, many teams discover this only after an agent has already chained tools, reached a sensitive API, or reused a secret outside its intended scope.

How It Works in Practice

Mature governance starts by classifying agents as distinct workload identities, not as generic service accounts. The identity primitive is the cryptographic workload identity, such as SPIFFE, OIDC-based workload tokens, or comparable attestation-backed credentials. That lets the programme bind an agent to a workload, an environment, and a trust level, instead of relying on a static username and password model. The access decision then becomes runtime authorisation based on intent, context, and policy, which is closer to the direction described in the NIST AI Risk Management Framework and the CSA MAESTRO agentic AI threat modeling framework.

  • Issue just-in-time credentials for a single task or bounded session, then revoke them automatically when the task ends.
  • Prefer short-lived secrets with tight TTLs over durable API keys that can be reused after the original context has changed.
  • Enforce policy-as-code at request time, using tools such as OPA or Cedar-style controls, so approval depends on what the agent is trying to do right now.
  • Separate discovery from entitlement: first identify every agent, connector, and tool path; then map which actions are truly required.
  • Log tool use, policy denials, and token issuance together so incident responders can reconstruct agent behaviour.

This is where NHIs already create measurable risk. NHI Mgmt Group reports in the Ultimate Guide to NHIs that only 5.7% of organisations have full visibility into their service accounts, which shows how quickly unmanaged machine identities outrun governance. These controls tend to break down when agents operate across multiple SaaS tools and shadow connectors because the policy engine loses a single, reliable control plane.

Common Variations and Edge Cases

Tighter agent controls often increase operational overhead, so organisations have to balance velocity against containment. There is no universal standard for this yet, especially for multi-agent systems, delegated agents, and agents that can spawn sub-agents with inherited context. Best practice is evolving, but the direction is consistent: treat every delegation step as a new trust decision, not a continuation of the original session. That is why the agentic guidance in OWASP NHI Top 10 and MITRE ATLAS adversarial AI threat matrix matters even when the system looks like a normal automation workflow.

Edge cases show up when agents need broad discovery privileges, when they interact with legacy systems that cannot issue ephemeral tokens, or when business users expect “human-like” continuity across long tasks. In those environments, compensation controls matter: segmentation, scoped proxy services, workflow checkpointing, and aggressive secret rotation. NHIMG’s research also shows how quickly exposed credentials can be abused, which is relevant when agents use secrets outside a vault boundary. The security test is not whether an agent can be made powerful, but whether its power can be made temporary, observable, and revocable.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework Control / Reference Relevance
OWASP Agentic AI Top 10 A3 Runtime agent authorisation and tool misuse are core agentic risks.
CSA MAESTRO T3 MAESTRO covers trust boundaries, delegation, and agentic control points.
NIST AI RMF GOVERN AI RMF governs accountability for autonomous systems and their decisions.

Model each agent delegation step as a new trust decision with scoped credentials.