What should organisations do first if GenAI is connected to sensitive systems?

Start by limiting what the model can reach, then add monitoring and response controls around every action it can trigger. A model with broad permissions becomes a control-plane risk, so security teams should reduce authority before expanding deployment scope.

Why This Matters for Security Teams

When GenAI is connected to sensitive systems, the first risk is not just data leakage. It is delegated action. A model that can read tickets, query records, trigger workflows, or call internal APIs can become a control-plane risk if it has broad standing access. NIST’s NIST AI 600-1 GenAI Profile is useful here because it frames GenAI risk as something to govern across the full system lifecycle, not as a one-time deployment decision.

NHI Management Group’s analysis of the LLMjacking threat shows why this matters operationally: attackers actively target exposed credentials and AI-connected identities because the path from initial access to misuse can be very short. Once a model can act inside a trusted environment, the issue becomes how much authority it can exercise, how quickly that authority can be revoked, and whether every action is attributable. In practice, many security teams discover the exposure only after the model has already been granted access to systems it never should have reached.

How It Works in Practice

The safest first move is to shrink the model’s reachable surface before expanding use cases. That means mapping every system, dataset, and tool the GenAI application can touch, then classifying each one by sensitivity and business necessity. If a connection is not essential for the initial use case, remove it. If a tool is essential, wrap it in a narrow, explicit policy that limits which actions are allowed, on what objects, and under which context.

This is where runtime control matters more than static roles. Traditional IAM often assumes predictable human workflows, but GenAI systems are goal-driven and can chain actions in ways operators did not anticipate. Use short-lived credentials, workload identity, and per-task authorization so the model receives only the access needed for the current request. Current guidance suggests pairing this with policy evaluation at request time, not just at provisioning time. Standards and implementation guidance from NIST AI 600-1 GenAI Profile and NHI controls discussed in the DeepSeek breach research both point toward the same operational pattern: reduce standing authority, then observe every action that crosses a trust boundary.

Inventory every prompt, tool call, connector, and downstream system the model can reach.
Disable broad read and write permissions by default, then re-enable only what the task requires.
Issue ephemeral credentials tied to a single workflow or session, not long-lived secrets.
Log model actions separately from user actions so response teams can reconstruct intent and impact.
Place human approval on high-risk actions such as deletion, payment, privilege grants, or data export.

These controls tend to break down in highly integrated environments where the model is connected to legacy systems, shared service accounts, or automation layers that cannot enforce fine-grained authorization.

Common Variations and Edge Cases

Tighter model access often increases integration overhead, requiring organisations to balance speed of deployment against the cost of wrapping every sensitive workflow in policy and monitoring. That tradeoff is real, especially for pilot programs that were built quickly and later expanded into production. Best practice is evolving, but there is no universal standard for how much autonomy is safe without compensating controls.

For low-risk use cases, a read-only model with curated retrieval may be acceptable if sensitive repositories are segmented and redacted. For higher-risk workflows, the model should never hold broad standing credentials, even if the business asks for convenience. In particular, teams should treat internet-facing agents, multi-agent pipelines, and systems that can invoke finance, identity, or admin tools as separate risk tiers. OWASP’s LLM Top 10 remains a useful reference for prompt injection and tool misuse, while the emerging NHI lesson is that access design must assume the model will eventually be pushed beyond its intended path. The deepest failures usually appear when a “temporary” integration becomes permanent without revisiting scope.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
OWASP Agentic AI Top 10	A1	GenAI tool use and action chaining are core agentic abuse paths.
CSA MAESTRO	M1	MAESTRO governs agent autonomy, permissions, and guardrails for sensitive workflows.
NIST AI RMF	GOVERN	AI RMF governance applies when GenAI can act on sensitive systems.

Assign accountability, define acceptable use, and track model-driven actions as governed risk.

What should organisations do first if GenAI is connected to sensitive systems?

Why This Matters for Security Teams

How It Works in Practice

Common Variations and Edge Cases

Standards & Framework Alignment

Related resources from NHI Mgmt Group