Notifications

Clear all

AI agent identity and Lilli: what IAM teams need to know

Last Post

RSS

NHI Mgmt Group

(@nhi-mgmt-group)

Member Moderator

Joined: 1 year ago

Posts: 12212

Topic starter 24/06/2026 7:33 pm

TL;DR: McKinsey’s Lilli breach showed that authenticated tokens do not prove an AI agent was the right actor, approved for the right action, or operating under the right identity, according to 1Kosmos. The incident is a reminder that enterprise AI security now depends on continuously verifiable agent identity and action-level authorization, not endpoint-only controls.

NHIMG editorial — based on content published by 1Kosmos covering the McKinsey Lilli breach: AI agent identity, authentication, and action authorisation

By the numbers:

Two hours later, it had full read-write access to McKinsey & Company's internal AI platform, exposing 46.5 million chat messages, 728,000 files, 57,000 user accounts, and 95 system prompts.

Questions worth separating out

Q: What breaks when AI agents rely on token-based authentication alone?

A: Token-based authentication proves that a caller presented valid credentials, but it does not prove the caller is the intended agent, that the action was authorised, or that a human approved a consequential step.

Q: Why do AI agents complicate traditional IAM and PAM controls?

A: AI agents can decide, chain tools, and execute actions faster than periodic IAM and PAM processes can review.

Q: How can organisations tell whether an AI agent is acting within scope?

A: They need policy that binds each agent to a specific role, data boundary, and action class, then records whether each high-risk operation was approved or blocked.

Practitioner guidance

Bind each AI agent to a distinct identity Replace shared service account patterns with per-agent identities that can be verified as the specific registered agent instance, not just as a valid credential holder.
Separate read paths from write paths Classify database writes, prompt changes, and configuration edits as distinct privileged operations, then require explicit authorisation for each privileged action path.
Require approval for consequential actions Use a human approval gate for operations that can change model behaviour, expose cross-user data, or modify production policy, even when the request originates from a trusted agent.

What's in the full article

1Kosmos's full analysis covers the operational detail this post intentionally leaves for the source:

A step-by-step breakdown of the authentication chain used in the Lilli scenario and where conventional controls stop.
Concrete explanation of cryptographic agent identity binding and how it changes trust decisions at runtime.
Detailed examples of CIBA-style approval flow for high-risk AI agent operations.
A side-by-side comparison of traditional token checks versus identity-bound agent authorisation in production workflows.

👉 Read 1Kosmos's analysis of the McKinsey Lilli AI agent identity breach →

AI agent identity and Lilli: what IAM teams need to know?

Explore further

View Full Forum → | NHI Foundation Course →

Quote

Topic Tags

Mr NHI

(@mr-nhi)

Member Moderator

Joined: 2 months ago

Posts: 11787

25/06/2026 4:54 am

Token validation is not agent identity. The Lilli breach shows that a valid token can still belong to the wrong actor, the wrong action, or the wrong moment. Traditional authentication was designed for requests that arrive from a stable caller, not for AI agents that probe, decide, and act inside the same session. The practitioner implication is that agent identity has to be verified as behaviour, not assumed from credential possession.

A few things that frame the scale:

91.6% of secrets remain valid five days after the targeted organisation is notified, showing a critical gap in remediation procedures, according to the Ultimate Guide to NHIs.
Only 5.7% of organisations have full visibility into their service accounts, according to the Ultimate Guide to NHIs.

A question worth separating out:

Q: Who is accountable when an AI agent changes data or system behaviour?

A: Accountability should sit with the team that owns the agent, the approval policy, and the environment it can affect. If a platform allows an agent to modify prompts, data, or configuration without a named approver, accountability has been designed out of the workflow. Governance must be explicit before deployment, not reconstructed after an incident.

👉 Read our full editorial: AI agent identity is the real failure in the Lilli breach

ReplyQuote

Forum Statistics

11 Forums

13.5 K Topics

25.8 K Posts

86 Online

135 Members

Latest Post: Silk Typhoon arrest and exposed credentials: what do teams need to watch? Our newest member: Alex Recent Posts Unread Posts Tags

Forum Icons: Forum contains no unread posts Forum contains unread posts

Topic Icons: Not Replied Replied Active Hot Sticky Unapproved Solved Private Closed

#1 Authority in NHI Education, Research and Advisory, empowering organizations to tackle the critical risks posed by Non-Human Identities (NHIs), including AI Agents.

Get in Touch

Quick Links

FAQ

NHI 101 Articles

Legal & Policies