Which frameworks are most relevant to AI runtime security?

Why This Matters for Security Teams

ai runtime security is not just a model problem. It is the control plane that decides what an AI system can do while it is executing, which means policy, identity, telemetry, and containment all have to work together at request time. NIST’s Cybersecurity Framework 2.0 remains useful because it ties protection and monitoring to operational outcomes, but AI runtime adds a dynamic layer that traditional application security often misses.

That gap is visible in NHI environments too. NHI Management Group notes that lack of credential rotation is cited as the top cause of NHI-related attacks by 45% of organisations in The State of Non-Human Identity Security. For AI runtime security, that matters because exposed or long-lived secrets can become the entry point for model abuse, tool chaining, or downstream data access.

Security teams often assume they can secure AI runtime with the same static patterns used for ordinary apps. In practice, many teams encounter privilege misuse only after an agent has already chained tools, touched sensitive systems, or been abused through a compromised token, rather than through intentional runtime governance.

How It Works in Practice

Current guidance suggests treating AI runtime security as a combination of governance, workload identity, and request-time authorization. The strongest frameworks do not replace each other. They layer together. NIST CSF helps structure the operational controls, while NHIMG’s standards guidance frames how non-human identities should be governed across lifecycle stages. For agentic and LLM-driven systems, Top 10 NHI Issues is especially relevant because the runtime often fails at the identity boundary before the model itself is the problem.

At implementation level, the goal is to issue short-lived credentials, verify workload identity, and evaluate policy on every tool call or data request. That usually means:

Using workload identity rather than shared secrets, so the runtime proves what it is before it receives access.

Issuing just-in-time credentials with short TTLs, then revoking them when the task ends.

Applying policy-as-code at runtime instead of relying only on pre-approved roles.

Logging tool use, prompt-to-action transitions, and downstream access for forensic review.

For environments with autonomous agents, this aligns with the emerging view in lifecycle management for NHIs: credentials should be created for a purpose, used briefly, and removed quickly. There is no universal standard for agent runtime policy semantics yet, so teams should expect to combine NIST, OWASP, and internal control patterns rather than rely on a single prescriptive framework. These controls tend to break down when agents inherit broad API permissions in flat environments because the runtime can pivot faster than manual review can react.

Common Variations and Edge Cases

Tighter runtime controls often increase operational overhead, requiring organisations to balance stronger containment against latency, policy complexity, and developer friction. That tradeoff is especially visible in multi-agent systems, where one agent may call another, inherit context, or trigger workflows across several services.

Best practice is evolving, but the common edge cases are clear. If a workload uses third-party connectors, OAuth grants, or long-lived service tokens, runtime security must extend beyond the model host and into the connected systems. If the environment relies on human-style RBAC alone, it will struggle to reflect the runtime context of autonomous actions. For that reason, guidance from The State of Non-Human Identity Security and the DeepSeek breach both point to the same lesson: exposed secrets and weak visibility turn AI runtime into an attack surface, not just an execution environment.

Where the answer gets nuanced is regulated or high-availability systems. Some teams will accept slightly longer-lived credentials to preserve uptime, while others will enforce aggressive revocation and accept occasional retries. Current guidance suggests the safest baseline is to minimise standing privilege, but there is no universal standard for how short an AI runtime credential must be in every environment.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 address the attack and risk surface, while NIST AI RMF and NIST CSF 2.0 set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
NIST AI RMF		AI RMF governs risk, transparency, and monitoring for runtime AI systems.
NIST CSF 2.0	PR.AC-4	Access control and least privilege are core to runtime AI security.
OWASP Agentic AI Top 10		Agentic AI guidance addresses tool abuse, prompt injection, and runtime misuse.

Use AI RMF to define runtime risk controls, accountability, and monitoring for each AI execution path.

#1 Authority in NHI Education, Research and Advisory, empowering organizations to tackle the critical risks posed by Non-Human Identities (NHIs), including AI Agents.

Which frameworks are most relevant to AI runtime security?

Why This Matters for Security Teams

How It Works in Practice

Common Variations and Edge Cases

Standards & Framework Alignment

Related resources from NHI Mgmt Group