What should organisations do first when AI agents and shadow integrations are spreading?

Start with discovery, then rank connections by data sensitivity and delegated authority. The immediate goal is to identify which non-human identities can access regulated or confidential data, because those paths define the highest containment priority when an incident occurs.

Why This Matters for Security Teams

When AI agents and shadow integrations spread, the first risk is not scale alone. It is delegated authority without visibility. An agent that can call APIs, read files, and chain actions across services can bypass the assumptions built into human-centric IAM. That makes discovery the essential first step, because security teams cannot contain what they have not mapped. Guidance from the NIST AI Risk Management Framework and OWASP Agentic AI Top 10 both point toward identifying high-risk behaviors before trying to fine-tune policy.

NHIMG research reinforces the urgency: in the AI Agents: The New Attack Surface report, only 52% of companies can track and audit the data their AI agents access, leaving the rest with a compliance blind spot. That gap matters because shadow integrations often appear as “temporary” automations, then become persistent, privileged pathways into regulated data. In practice, many security teams encounter the breach path only after an agent has already touched sensitive systems, rather than through intentional review.

How It Works in Practice

The practical first move is to build an inventory of every AI agent, bot, service account, token exchange, webhook, and low-code integration that can act without a human approving each request. This is discovery at the workload level, not just the app level. Security teams should identify what each identity can reach, what data it can read or write, and whether the authority is static or conditional. That is where NHI governance becomes operational, especially when agents are using short-lived tokens, delegated OAuth grants, or tool calls that are difficult to distinguish from normal application traffic.

A workable triage method usually starts with three questions:

Does the agent or integration touch regulated, confidential, or production data?
Can it create new access paths, such as forwarding data to other tools or systems?
Does it hold standing privilege, or can it be constrained through just-in-time issuance?

Discovery should then be paired with authority ranking. An agent that can only summarize public content is not the same as one that can retrieve customer records, trigger payments, or export credentials. Use that ranking to prioritize containment: revoke unused credentials, reduce scopes, separate production from non-production, and move critical workflows toward workload identity and runtime policy checks. The security model should be informed by sources such as the Ultimate Guide to NHIs and 2025 outlook and the CSA MAESTRO agentic AI threat modeling framework, both of which emphasize that identity, data access, and runtime behavior must be evaluated together.

These controls tend to break down when shadow integrations are embedded in business workflows with no clear owner, because no one can confidently approve removal or scope reduction.

Common Variations and Edge Cases

Tighter discovery and ranking often increases operational friction, requiring organisations to balance containment speed against business continuity. That tradeoff is especially sharp when AI agents are embedded in customer support, engineering, or analytics workflows where teams rely on fast, autonomous action. Current guidance suggests starting with the highest-sensitivity connections first, rather than trying to catalog every integration before making any changes.

There is no universal standard yet for how to classify every agentic workflow, so teams should treat classifications as living decisions. A marketing assistant that drafts messages may be low risk until it inherits access to shared drives or CRM exports. Likewise, a “shadow integration” can be deceptively low friction while still carrying broad delegated authority through API keys or service accounts. This is why static role reviews alone are insufficient for autonomous systems.

Where this approach gets harder is in environments with fragmented secrets stores, unmanaged OAuth consent, or multiple teams creating agents independently. NHIMG’s State of Secrets in AppSec research highlights how fragmented secrets management weakens central control. In those environments, the first priority is not perfection; it is restoring visibility over which non-human identities can reach sensitive data, then constraining the most powerful paths before more integrations spread.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
OWASP Agentic AI Top 10	A01	Agent sprawl and hidden integrations create agentic attack surface.
CSA MAESTRO	M1	MAESTRO centers discovery and threat modeling for agentic workflows.
NIST AI RMF	GOVERN	AI RMF governance requires accountability for risky autonomous behavior.

Inventory agent actions and restrict tool access by runtime context, not static roles.

What should organisations do first when AI agents and shadow integrations are spreading?

Why This Matters for Security Teams

How It Works in Practice

Common Variations and Edge Cases

Standards & Framework Alignment

Related resources from NHI Mgmt Group