Subscribe to the Non-Human & AI Identity Journal

What are the main risks associated with AI agents?

The primary risks of AI agents include unauthorized access to sensitive information, compliance violations due to lack of oversight, and the potential for data breaches stemming from unrestricted permissions. Organizations must prioritize addressing these issues through strong governance and monitoring frameworks.

Why This Matters for Security Teams

AI agents are risky because they do not just generate output. They act, chain tools, and keep pursuing a goal until something stops them. That autonomy changes the threat model: a single prompt can lead to data access, credential use, external requests, or unintended changes in production systems. The practical issue is not only model quality. It is the combination of execution authority, broad permissions, and limited oversight. Current guidance from OWASP Agentic AI Top 10 and NIST AI Risk Management Framework both point to governance, traceability, and context-aware controls as core requirements, not add-ons.

NHIMG research shows why this is urgent: in the AI Agents: The New Attack Surface report, 80% of organisations said their AI agents had already performed actions beyond intended scope, including unauthorised system access, sensitive data sharing, and credential exposure. That is a strong indicator that “trust the workflow” is not a security strategy. In practice, many security teams encounter agent overreach only after data has already moved or privileged actions have already been taken, rather than through intentional pre-production testing.

How It Works in Practice

The safest way to think about agent risk is as a workload identity and authorization problem, not just an application risk. An agent should have a cryptographic identity, a narrowly scoped mission, and credentials that expire as soon as the task ends. That is why OWASP NHI Top 10 and Ultimate Guide to NHIs — Key Challenges and Risks are useful references: they frame the problem around identity, secrets, and constrained execution.

In practice, good control design usually includes:

  • JIT credentials issued per task, with short TTLs and automatic revocation on completion.
  • Intent-based authorization, where policy evaluates what the agent is trying to do at request time, not just its static role.
  • Workload identity such as SPIFFE/SPIRE or OIDC-backed service identity, so the agent proves what it is before it receives access.
  • Policy-as-code enforcement, using real-time rules instead of broad, pre-approved entitlements.
  • Secrets isolation so API keys, tokens, and certificates are never long-lived or reusable beyond the minimum task.

That operating model also helps with auditability. If an agent can access customer data, send messages, or invoke deployment tools, every one of those actions should be logged and attributable to the workload identity, not just to the application wrapper. The challenge is not unique to theory. The Anthropic — first AI-orchestrated cyber espionage campaign report and AI LLM hijack breach both reinforce how quickly compromised or over-permissioned identities can be abused once an attacker gains a foothold. These controls tend to break down when agents are embedded in legacy apps that were built for static service accounts because the environment cannot evaluate context fast enough.

Common Variations and Edge Cases

Tighter control often increases integration overhead, requiring organisations to balance operational speed against the security value of shorter-lived access. That tradeoff becomes sharper in multi-agent workflows, where one agent delegates to another, or when agents must interact with SaaS tools, ticketing systems, and code repositories across different trust zones. There is no universal standard for this yet, but current guidance suggests avoiding standing privilege wherever possible and using context-aware approval only where business risk justifies it.

One common edge case is “read-only” access that is still dangerous. Agents with read access to sensitive data can exfiltrate, summarize, or transform information in ways that create compliance exposure. Another is tool chaining: an agent may start with a harmless request and then use the result to justify a more privileged action. That is why OWASP Top 10 for Agentic Applications 2026 and NIST Cybersecurity Framework 2.0 remain relevant: they stress governance, resilience, and control boundaries rather than trust in model intent.

Agentic systems also need extra scrutiny when secrets are embedded in prompts, browser sessions, or tool connectors. NHIMG’s DeepSeek breach coverage shows how exposed records and embedded secrets can magnify damage once a workload is reachable. The practical takeaway is simple: autonomous systems should be designed as ephemeral, least-privilege identities with tightly scoped tools, not as always-on digital employees.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework Control / Reference Relevance
OWASP Agentic AI Top 10 AP-01 Agentic app guidance addresses tool abuse, scope creep, and autonomous action risk.
CSA MAESTRO MAESTRO focuses on securing autonomous workflows, approvals, and execution boundaries.
NIST AI RMF GOVERN AIRMF governs accountability, oversight, and risk treatment for AI systems.

Define agent workflows, approvals, and guardrails before deployment, then monitor deviations continuously.

Related resources from NHI Mgmt Group