Why do AI projects get stuck in pilot purgatory?

Why This Matters for Security Teams

AI projects rarely fail because the model cannot produce useful output. They stall when governance cannot answer a basic operational question: who can approve, with what authority, and under what conditions? In security terms, that is an access and accountability problem, not a model-quality problem. The gap matters because pilots often touch sensitive data, secrets, production APIs, or customer workflows long before the organisation has defined a durable control path.

This is why the issue shows up in secrets handling, policy review, and exception management at the same time. NHIMG’s research on The State of Secrets in AppSec shows how fragmented control can be even in mature environments, while NIST Cybersecurity Framework 2.0 emphasises governance as a core function rather than an afterthought. If approval authority stays informal, pilots become permanent exceptions instead of managed services. In practice, many security teams encounter production shadowing only after business users have already built dependency around the pilot, rather than through intentional release governance.

How It Works in Practice

Moving out of pilot purgatory requires treating the AI workload like an operational service with defined control boundaries. That means assigning a named owner, a decision path for production approval, and a policy set that can be enforced consistently. For AI systems that interact with data, tools, or other systems, the governance model has to cover both access and behaviour. A pilot may be allowed to experiment in a constrained environment, but production should require explicit criteria for data scope, logging, human oversight, rollback, and secrets handling.

In practice, the most effective teams separate three questions:

What is the model or agent allowed to see?

What actions can it take without human intervention?

Who can override, revoke, or promote it into production?

That structure aligns with current guidance from NIST Cybersecurity Framework 2.0, where governance and risk ownership are not optional add-ons. It also fits the real-world lessons in NHIMG’s LLMjacking: How Attackers Hijack AI Using Compromised NHIs research, which shows how quickly exposed credentials can be abused once an AI system touches live infrastructure. For AI pilots, approval should not depend on a meeting cadence. It should depend on whether the system meets predefined controls for identity, data access, and operational containment. These controls tend to break down when pilots share credentials with human operators because revocation, attribution, and auditability become ambiguous.

Common Variations and Edge Cases

Tighter governance often increases delivery overhead, requiring organisations to balance speed against control. That tradeoff is real, especially when business teams want to prove value quickly. The best practice is evolving, but current guidance suggests that speed should come from automation and clear thresholds, not from bypassing approval. If every pilot needs bespoke review, bottlenecks are inevitable.

Some environments make this harder. Regulated industries may need formal risk sign-off before any production exposure, while fast-moving product teams may prefer lightweight guardrails and staged rollout criteria. Multi-team AI platforms also create edge cases where the pilot owner, data owner, and security approver are different people, so the approval model must be explicit or it collapses into informal consensus. NHIMG’s DeepSeek breach coverage is a reminder that scale and speed magnify exposure when governance trails behind deployment. There is no universal standard for this yet, but mature programmes define promotion gates, revoke stale exceptions, and require a production control owner before a pilot can advance.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 address the attack and risk surface, while NIST CSF 2.0 and NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
NIST CSF 2.0	GV.OV-01	Pilot purgatory is a governance and ownership failure.
NIST AI RMF	GOVERN	The issue is weak accountability and unclear decision rights.
OWASP Agentic AI Top 10	A2	AI systems need explicit operational guardrails before broad release.

Define accountable AI governance roles, escalation paths, and promotion gates for production use.

#1 Authority in NHI Education, Research and Advisory, empowering organizations to tackle the critical risks posed by Non-Human Identities (NHIs), including AI Agents.

Why do AI projects get stuck in pilot purgatory?

Why This Matters for Security Teams

How It Works in Practice

Common Variations and Edge Cases

Standards & Framework Alignment

Related resources from NHI Mgmt Group