By NHI Mgmt Group Editorial TeamPublished 2025-09-29Domain: Agentic AI & NHIsSource: Cyera

TL;DR: AI use is now mainstream in enterprises, but only 13% of organisations report strong visibility into how it touches data, while 66% have already caught AI over-accessing sensitive information, according to Cyera’s 2025 State of AI Data Security Report. The real problem is not adoption, it is that governance still treats AI like an ordinary app or user and therefore misses prompt-layer risk and over-permissioned access.


At a glance

What this is: Cyera’s survey shows enterprise AI adoption has outpaced visibility, monitoring, and access controls, leaving data governance exposed at the prompt and agent edge.

Why it matters: IAM and security teams need to treat AI as a governed identity and data-access problem because the same gaps affect NHI, autonomous agents, and human-controlled workflows.

By the numbers:

👉 Read Cyera's 2025 State of AI Data Security Report


Context

AI data security is the governance problem that appears when AI systems can reach enterprise data faster than teams can observe, classify, or constrain that access. The first failure is often not a breach but a visibility gap, because prompt-layer interactions and data retrieval are treated as application telemetry rather than access events.

For IAM and security teams, that gap matters because AI is already acting like a non-human identity inside daily workflows. Once AI can query, retrieve, or surface sensitive material, the programme needs identity, data, and monitoring controls that work together across NHI, human oversight, and agentic behaviour.


Key questions

Q: How should security teams govern AI systems that access sensitive data?

A: Treat the AI as a governed non-human identity, not as a normal application. Scope access by data classification and task boundary, log prompts and retrievals as access events, and enforce controls before sensitive output is exposed. If the system can retrieve or generate confidential material, it needs identity, data, and monitoring controls that work together.

Q: Why do AI deployments over-access data so easily?

A: They often inherit broad permissions designed for convenience, not for machine behaviour. When access is provisioned like a generic application or user account, the system can reach more data than its task truly requires. The result is predictable over-access, weak accountability, and a review process that discovers the problem only after data has already been touched.

Q: How do organisations know whether AI controls are actually working?

A: Look for evidence at the point of use, not just policy approval. Useful signals include prompt logging, retrieval auditability, blocked sensitive output, and the ability to trace which data the AI touched during a session. If those records do not exist, control effectiveness cannot be demonstrated, only assumed.

Q: Who should own accountability for AI data access risk?

A: Accountability should sit with the teams that own identity, data governance, and security operations together. If AI can access enterprise data, then ownership must cover entitlement design, monitoring, and incident response across the full workflow. The governance gap is not just technical, because without a named owner, no one can prove who approved or contained the access.


Technical breakdown

Prompt-layer visibility and the data access blind spot

The prompt layer is where an AI system receives instructions that can trigger retrieval, summarisation, or action against enterprise data. If logs stop at the application layer, teams see that a model was used but not what data was exposed, which query path was taken, or whether the response included sensitive material. That leaves auditability weak and incident reconstruction incomplete. Cyera’s findings point to a common failure mode: access is granted, but evidence of use is missing. In identity terms, the AI becomes an access path without a corresponding control trail.

Practical implication: instrument prompt, retrieval, and output events as access signals, not just application logs.

AI as a first-class identity and the problem of default access

When AI is treated like a normal user or application, it inherits broad access patterns that were never intended for probabilistic systems. That is why over-access shows up so quickly in AI deployments: the system is often provisioned for convenience, then left to roam across knowledge sources, SaaS content, or data stores. In governance terms, the issue is not only permission scope but identity classification. If AI is not managed as its own identity type, least privilege cannot be expressed clearly and lifecycle review becomes blurry.

Practical implication: assign AI a distinct identity model, then scope its access by data class and task boundary.

Real-time blocking, not after-the-fact review, at the agent edge

AI controls fail when they depend on retrospective review alone. If risky prompts, excessive retrieval, or sensitive output are only discovered after the event, the organisation has already accepted the blast radius. Real-time monitoring and policy enforcement at the agent edge are the difference between evidence and containment. This is especially important for autonomous agents, which can chain requests faster than a human operator can manually intervene. The technical issue is timing: the control must evaluate before sensitive material is exposed or forwarded.

Practical implication: place blocking, redaction, and approval gates before sensitive output leaves the AI workflow.


Read our 52 NHI Breaches Analysis report for a comprehensive view of breaches impacting Non-Human Identities including AI Agents.


NHI Mgmt Group analysis

AI data security is now an identity problem, not just a data problem. Once AI can search, retrieve, and surface enterprise information, it behaves like a governed non-human actor with access rights, audit obligations, and containment requirements. The article shows that the hard part is not model capability but access governance, because 83% use AI while visibility remains at 13%. Practitioners should treat AI access as identity-controlled data movement, not as a side effect of application design.

Default access is the named failure mode here, and it is more dangerous than most policy language admits. The report describes AI systems being given broad reach, then monitored too weakly to catch over-access in time. That is a classic identity governance lapse, but in AI it also becomes a prompt-layer data exposure problem because the system can expose information without a conventional login event. The implication is that entitlement models built for deterministic users do not hold when the actor is probabilistic and context-driven.

Prompt-layer blind spots create an evidence gap that undermines both investigations and compliance. If only 13% of organisations have strong visibility, then most teams cannot prove what AI saw, retrieved, or emitted during an incident. That weakens forensic confidence and makes access review largely ceremonial. In practice, this is where NIST CSF visibility objectives and OWASP NHI controls converge: without event evidence at the AI boundary, governance claims remain unverified.

Ephemeral prompt privilege debt: AI systems accumulate short-lived but poorly bounded access paths that are easy to grant and hard to evidence later. The concept matters because AI often touches data through transient prompts, retrieval sessions, and tool calls that disappear before review cycles can act. Practitioners should recognise that the governance debt is not only over-permissioning, but the inability to reconstruct privilege use after the fact.

Autonomous agents intensify the governance gap because they compress decision time below human review cadence. The report’s 76% finding on autonomous agents being hardest to secure is a signal that the control model changes when runtime decisions happen faster than operators can certify them. Access review processes assume access persists long enough to be observed, but autonomous activity can create and consume privilege inside a single operational window. Practitioners need to rethink evidence timing before they rethink policy wording.

From our research:

  • 70% of organisations grant AI systems more access than they would give a human employee performing the exact same job, according to The 2026 Infrastructure Identity Survey.
  • Only 13% of organisations feel extremely prepared for the reality of agentic AI, which helps explain why over-permissioning persists even as deployment accelerates.
  • For a deeper look at how AI agents change identity governance assumptions, see OWASP NHI Top 10 and map the controls to runtime behaviour.

What this signals

Ephemeral prompt privilege debt: AI programmes now create access paths that are short-lived in execution but long-lived in governance fallout. With 70% of organisations granting AI systems more access than an equivalent human employee, the programme risk is not just over-permissioning but the absence of a reliable evidence trail when something goes wrong.

That means the next control maturity step is not another policy statement, it is boundary enforcement where prompts, retrievals, and outputs are observable and blockable. Teams that cannot show what an AI touched will struggle to defend compliance claims, incident timelines, or entitlement decisions.

For practitioners, the most useful signal is whether identity and data teams are operating from the same telemetry set. If AI access cannot be tied to data classification, session context, and containment decisions, the organisation has not solved governance, it has only documented intent.


For practitioners

  • Instrument AI access as identity telemetry Log prompts, retrievals, tool calls, and outputs as access events so you can reconstruct what data the AI touched and when.
  • Assign AI a distinct identity class Stop inheriting broad application permissions by default and define task-scoped entitlements by data classification, environment, and approved use case.
  • Enforce pre-output policy controls Apply redaction, approval gates, and kill switches before sensitive content leaves the AI workflow, especially where public prompts or external models are involved.
  • Build real-time containment for risky AI behaviour Create detection rules for over-access, exfiltration attempts, and unexpected data retrieval, then make automatic blocking available for high-confidence cases.

Key takeaways

  • AI data security fails first as a governance and visibility problem, not as a model-quality problem.
  • The scale of the gap is already material, with most organisations granting AI broader access than comparable human users.
  • Teams need identity-style controls at the prompt and retrieval boundary if they want evidence, containment, and accountability.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Non-Human Identity Top 10 and OWASP Agentic AI Top 10 address the attack and risk surface, while NIST CSF 2.0 set the governance and control requirements practitioners need to meet.

FrameworkControl / ReferenceRelevance
OWASP Non-Human Identity Top 10NHI-01AI systems accessing data need identity-scoped governance and auditability.
NIST CSF 2.0PR.AA-01AI visibility gaps map directly to access and asset management weaknesses.
OWASP Agentic AI Top 10Autonomous agents and prompt-layer control issues match agentic AI threat patterns.

Inventory AI access paths and verify monitoring, logging, and containment before expanding use.


Key terms

  • Prompt Layer: The prompt layer is the interface where instructions enter an AI system and can trigger retrieval, generation, or tool use. In practice, it is also an identity boundary because it determines what data the system can reach, what actions it can request, and what evidence should be captured.
  • AI As A First-Class Identity: AI as a first-class identity means the system is governed as its own access-bearing actor, not hidden inside a generic application account. That approach ties entitlements, logging, review, and containment to the AI's actual behaviour, which is essential when the system can retrieve, summarise, or expose sensitive data.
  • Auto-Blocking: Auto-blocking is the ability to stop risky AI behaviour in real time before sensitive data is exposed or forwarded. It turns detection into containment, which matters because after-the-fact review cannot undo a prompt that already retrieved or emitted confidential information.

Deepen your knowledge

AI data security governance and identity control are core topics in our NHI Foundation Level course, the industry's only accredited NHI security programme. If you are building AI guardrails from a similar starting point, it is worth exploring.

This post draws on content published by Cyera: 2025 State of AI Data Security Report. Read the original.

NHIMG Editorial Note
Published by the NHIMG editorial team on 2025-09-29.
NHI Mgmt Group — the independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org