How do security teams decide which controls to prioritise for AI applications?

Why This Matters for Security Teams

AI application risk is usually not a single control problem. Security teams have to decide whether the bigger exposure is model manipulation, data leakage, tool abuse, or secret theft, and those failure modes require different controls. NIST’s NIST Cybersecurity Framework 2.0 is useful here because it forces teams to connect protection choices to business outcomes, not just technical categories.

For NHI-focused programs, the key question is often whether the AI system can be reached through exposed credentials or whether untrusted prompts and content can alter execution paths. NHIMG research on the LLMjacking pattern shows why this matters: when attacker access is enabled by compromised NHIs, the problem quickly shifts from “model safety” to “identity abuse and operational compromise.” In parallel, the State of Secrets in AppSec research shows how often secrets management gaps and code exposure drive the actual blast radius.

Practitioners get this wrong when they prioritise visible AI controls first and identity, provenance, and runtime validation later. In practice, many security teams discover that the decisive failure was credential exposure or tool-chain trust collapse only after an AI workflow has already been abused.

How It Works in Practice

A practical prioritisation model starts with the attack path, then maps controls to the point where the attack becomes possible. If the main threat is refusal bypass, prompt injection, or unsafe model output, start with content filtering, policy tuning, jailbreak detection, and human review for high-impact responses. If the main threat is instruction hijacking through tools, connectors, or retrieval sources, prioritise trust separation, provenance checks, output validation, and strict tool authorization.

For applications that call external systems, the highest-value controls are usually identity and runtime guardrails rather than static allowlists. That means binding each agent or service to a workload identity, issuing short-lived credentials per task, and validating every privileged action at request time. In agentic environments, static RBAC alone is rarely enough because the same agent can chain tools, change context, and reach data paths that were not obvious at design time. Guidance from the DeepSeek breach illustrates how quickly broad exposure can follow from weak secret handling, while the JetBrains GitHub plugin token exposure case is a reminder that compromised tokens often create the shortest route to misuse.

Prioritise alignment, refusal hardening, and jailbreak testing when the model itself is the primary control point.

Prioritise provenance, input sanitisation, and output validation when untrusted content can influence downstream systems.

Prioritise ephemeral credentials, workload identity, and least privilege when the AI system can invoke tools or APIs.

Prioritise audit logging and anomaly detection when the environment has many connectors, agents, or shared secrets.

These controls tend to break down in highly interconnected environments with shared service accounts, long-lived API keys, and opaque third-party connectors because the runtime decision boundary disappears.

Common Variations and Edge Cases

Tighter control selection often increases integration overhead, so organisations have to balance stronger containment against developer friction and operational latency. That tradeoff is especially visible in production copilots, autonomous agents, and retrieval-augmented workflows where every extra approval step can slow delivery.

There is no universal standard for this yet, but current guidance suggests separating controls by failure mode rather than by technology label. For example, an AI chatbot used only for drafting text may need stronger jailbreak testing than tool isolation, while an AI agent that can create tickets, query databases, or trigger deployments needs the reverse. The Ultimate Guide to NHIs is useful for teams trying to anchor this decision in identity governance rather than treating every AI workload the same.

Another edge case is shared infrastructure. If multiple AI apps reuse the same model endpoint, vector store, or secrets manager, the control priority should shift toward segmentation and blast-radius reduction. In those cases, the most dangerous weakness is often not model quality but shared trust. Security teams should review whether a compromise in one app can pivot into another through reused tokens, overly broad service identities, or permissive connector scopes.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
OWASP Agentic AI Top 10	A2	Covers prompt injection and tool misuse in AI apps.
CSA MAESTRO	TRM-04	Maps directly to runtime trust and agent control decisions.
NIST AI RMF		Supports risk-based prioritisation across AI failure modes.

Test app controls against prompt injection and constrain tool use at runtime.

#1 Authority in NHI Education, Research and Advisory, empowering organizations to tackle the critical risks posed by Non-Human Identities (NHIs), including AI Agents.

How do security teams decide which controls to prioritise for AI applications?

Why This Matters for Security Teams

How It Works in Practice

Common Variations and Edge Cases

Standards & Framework Alignment

Related resources from NHI Mgmt Group