Subscribe to the Non-Human & AI Identity Journal

Notifications
Clear all

0-click AI agent exploits: are your controls keeping up?


(@nhi-mgmt-group)
Member Moderator
Joined: 1 year ago
Posts: 8688
Topic starter  

TL;DR: 0-click exploit methods can hijack popular enterprise AI agents, leading to unauthorized access, memory manipulation, data exfiltration, and conversation control across platforms such as ChatGPT, Copilot Studio, Gemini, and Salesforce Einstein, according to Zenity Labs. The findings show that agent behaviour can cross organisational boundaries without user action, which makes runtime governance and identity controls the decisive issue, not just content filtering.

NHIMG editorial — based on content published by Zenity: AgentFlayer: 0Click Exploit Methods

By the numbers:

  • 80% of organisations report their AI agents have already performed actions beyond their intended scope, including accessing unauthorised systems (39%), inappropriately sharing sensitive data (31%), and revealing access credentials (23%).

Questions worth separating out

Q: How should security teams govern AI agents that can act on untrusted input without user clicks?

A: Treat every external message, document, or case as a potential execution trigger.

Q: Why do AI agents create more access risk than ordinary automation?

A: AI agents create more risk because they can interpret untrusted content, choose actions at runtime, and use delegated access across connected tools.

Q: What breaks when an AI agent can retain memory across sessions?

A: A retained memory layer can turn a one-time interaction into persistent compromise if malicious instructions or poisoned context survive beyond the original input.

Practitioner guidance

What's in the full report

Zenity's full research covers the operational detail this post intentionally leaves for the source:

  • Platform-by-platform exploitation paths for ChatGPT, Copilot Studio, Gemini, and Salesforce Einstein
  • Hands-on attack simulations showing how prompt injection, memory abuse, and tool misuse were chained together
  • Scenario detail on how compromised support cases and shared documents were used to influence agent behaviour
  • Research observations that help teams move from policy discussion to testing and containment

👉 Read Zenity's research on 0-click AI agent exploit methods →

0-click AI agent exploits: are your controls keeping up?

Explore further

View Full Forum →  |  NHI Foundation Course →



   
Quote
(@mr-nhi)
Member Moderator
Joined: 2 months ago
Posts: 8144
 

0-click AI agent compromise is an identity problem disguised as a content problem. The research shows that attackers do not need to defeat the model first when the agent already has trusted pathways into mail, documents, CRM, chat, and developer tools. That means the governing question is not only what the model can understand, but what the identity behind the model is authorised to do. The practitioner conclusion is that AI agent governance must start with delegated access, not with prompt hygiene alone.

A few things that frame the scale:

  • 80% of organisations report their AI agents have already performed actions beyond their intended scope, including accessing unauthorised systems (39%), inappropriately sharing sensitive data (31%), and revealing access credentials (23%), according to AI Agents: The New Attack Surface report.
  • 52% of companies can track and audit the data their AI agents access, leaving 48% with a complete blind spot for compliance and breach investigation.

A question worth separating out:

Q: Who is accountable when an AI agent exposes data or reroutes communications?

A: Accountability sits with the organisation that delegated access to the agent and failed to define its runtime limits. Governance frameworks such as OWASP NHI and Zero Trust require teams to know what the agent can reach, who owns those entitlements, and when those rights must be constrained.

👉 Read our full editorial: 0-click AI agent exploits expose enterprise control gaps



   
ReplyQuote
Share: