How should security teams start using generative AI safely?

Why This Matters for Security Teams

Generative AI is most useful when it accelerates work that already exists, but it becomes risky the moment it is allowed to shape security outcomes without review. The safest starting point is low-risk support tasks: summarising alerts, drafting investigations, compressing threat intelligence, and polishing reports. The harder problem is not the prompt itself, but controlling what data enters the model, what the model can infer, and what actions humans still own.

That distinction matters because early mistakes often look like convenience gains until sensitive context leaks into prompts, outputs are treated as facts, or a draft recommendation is mistaken for an approved change. NIST’s NIST AI 600-1 Generative AI Profile frames this as a risk management problem, not a novelty problem, which is the right starting point for security teams.

NHI Management Group’s research on the DeepSeek breach and the Microsoft Azure OpenAI service breach shows why model-facing systems need the same discipline as other production services: secrets control, logging, and strict access boundaries. In practice, many security teams discover the real exposure only after a prompt, plugin, or dataset has already carried sensitive data outside the intended boundary.

How It Works in Practice

Safe adoption starts by treating generative AI as an assistive control, not an autonomous control. The first use cases should be bounded, reversible, and easy to verify. That means the model can draft, classify, or summarise, but a human still approves anything that changes tickets, alerts, access, or production state. The workflow should also keep sensitive material out of prompts unless there is a documented business need and an approved retention model.

A practical rollout usually includes four controls:

Data minimisation: remove secrets, personal data, and incident-sensitive details before the prompt is formed.

Human review: require analyst sign-off for summaries, recommendations, and any remediation text.

Prompt governance: use approved templates, logging, and version control so teams know what was asked and why.

Output validation: compare model output against source evidence, not against confidence or fluency.

For teams building this into security operations, the model should be connected through least-privilege service accounts and monitored like any other sensitive workflow. That includes clear ownership, audit logs, and a rollback path if the output is wrong or misleading. The key is to make the AI useful without letting it become a hidden decision-maker. Guidance from the NIST AI 600-1 GenAI Profile supports this staged approach, while NHIMG’s reporting on live credential abuse patterns in the DeepSeek breach reinforces why prompt and secret hygiene matter from day one. These controls tend to break down when teams connect the model directly to ticketing, messaging, or cloud-admin tools because the output can trigger action faster than review can keep up.

Common Variations and Edge Cases

Tighter review and data controls often increase friction, so teams have to balance speed against the cost of bad output or leakage. That tradeoff becomes visible in incident response, where analysts want rapid summaries but cannot afford unverified recommendations. Current guidance suggests keeping AI in the draft layer for high-impact workflows until the team has evidence quality checks and a clear approval path.

There are also edge cases where “safe” use is less obvious. Model-assisted threat hunting may be acceptable if the model only ranks hypotheses and never queries live systems directly. Customer-facing content generation is often lower risk than internal incident data, but it can still expose confidential tactics if prompt history is retained or reused. Best practice is evolving on whether organisations should allow fine-tuning, retrieval, or external plugins in early deployments, so those decisions should be treated as exceptions rather than defaults.

Security teams should also be careful with shared assistants, copied prompts, and unmanaged browser tools, since these often blur the boundary between approved and unapproved use. A sensible first rollout keeps generative AI inside a named workflow with named owners, instead of letting it spread ad hoc across the SOC.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
OWASP Agentic AI Top 10		Covers prompt injection, unsafe tool use, and AI output trust issues.
CSA MAESTRO		Applies to secure governance of AI workflows and operational guardrails.
NIST AI RMF		Sets the risk management basis for introducing GenAI into security processes.

Constrain prompts, isolate tools, and require human approval before any AI-driven action.

#1 Authority in NHI Education, Research and Advisory, empowering organizations to tackle the critical risks posed by Non-Human Identities (NHIs), including AI Agents.

How should security teams start using generative AI safely?

Why This Matters for Security Teams

How It Works in Practice

Common Variations and Edge Cases

Standards & Framework Alignment

Related resources from NHI Mgmt Group