Subscribe to the Non-Human & AI Identity Journal

Should organisations separate MLOps approvals from security approvals?

No. AI release decisions should combine platform, security, and risk approval because the same change can affect model behaviour, data integrity, and infrastructure exposure at once. Splitting those controls creates gaps where a model can be operationally deployed before its trust chain has been verified.

Why This Matters for Security Teams

Separating MLOps approvals from security approvals creates a false sense of control in exactly the kind of change process that can affect model behaviour, training data integrity, runtime permissions, and infrastructure exposure at the same time. For AI systems, release risk is rarely isolated to one layer. A model promotion can change what the system can infer, what data it can reach, and which secrets or tools it can use.

That is why current guidance increasingly treats AI release governance as a shared decision point rather than a handoff between silos. The NIST Cybersecurity Framework 2.0 supports coordinated governance across risk, protection, and recovery, while NHIMG research on the State of Non-Human Identity Security shows how often organisations still lack confidence in controlling non-human access. In AI operations, that gap becomes dangerous when deployment controls, identity controls, and model-risk checks are not evaluated together.

Practitioners usually underestimate how quickly an approved model change can become a security event once it is connected to live credentials, external APIs, or autonomous tool use. In practice, many security teams encounter the abuse path only after the release has already been promoted into production.

How It Works in Practice

Effective AI release governance uses a single approval flow with distinct checks, not separate approval authorities that can be bypassed or delayed independently. The MLOps function should validate model quality, lineage, and deployment readiness. Security should validate secrets handling, workload identity, access boundaries, and logging. Risk or governance should confirm the business impact and acceptable use profile. The key is that the release does not move forward until all required checks are complete.

For autonomous or semi-autonomous AI systems, this usually means treating the deployment as a workload identity event, not just a software release. The model, agent, or serving pipeline should present cryptographic identity, obtain just-in-time credentials where needed, and operate under runtime policy that is evaluated at request time. Standards such as NIST CSF 2.0 and emerging AI governance guidance from NIST support this kind of integrated control model, even though there is no universal standard yet for AI release approval design.

  • Require one release gate with multiple required sign-offs, not separate go or no-go paths.
  • Bind approvals to the exact model artifact, dataset version, prompt package, and runtime policy.
  • Verify workload identity and secret scope before production access is granted.
  • Use short-lived credentials and revoke them automatically when the release is rolled back or retired.
  • Log who approved what, with evidence attached to the change record.

This approach aligns with NHIMG guidance on controlling non-human access, especially where identity sprawl and secrets exposure increase deployment risk. The Ultimate Guide to Non-Human Identities highlights how excessive privileges and weak secret hygiene can turn routine operational change into a breach path. These controls tend to break down in fast-moving CI/CD environments where model promotion, infrastructure provisioning, and credential issuance happen on different timelines.

Common Variations and Edge Cases

Tighter approval control often increases release latency, requiring organisations to balance speed against the cost of an unsafe or unaudited deployment. That tradeoff becomes more pronounced when teams run multiple models, experiment frequently, or ship to customer-facing environments on short cycles.

There is no universal standard for this yet, but best practice is evolving toward risk-tiered approvals. Low-impact internal models may need lighter review, while customer-facing, agentic, or data-connected systems should require stronger cross-functional sign-off. The important distinction is not whether MLOps and security use different expertise. It is whether they are allowed to approve the same release independently without a shared control record.

Edge cases also matter. A retrained model with no code change can still alter outputs, access patterns, and downstream automation. A configuration-only release can widen data exposure if it changes token scope or tool permissions. If the system uses external APIs or third-party plugins, approval should include the trust chain around those dependencies as well. NHIMG research on the Hugging Face Spaces breach is a reminder that operational convenience and identity trust are tightly linked in real deployments. In environments with delegated engineering teams and weak change traceability, separate approvals tend to fail because nobody owns the combined risk picture.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Non-Human Identity Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework Control / Reference Relevance
OWASP Non-Human Identity Top 10 NHI-03 Release approvals must cover credential rotation and trust-chain changes.
CSA MAESTRO MAESTRO addresses governance for agentic and model-driven systems at runtime.
NIST AI RMF AI RMF GOVERN and MAP functions fit shared AI release risk decisions.

Tie every model release to secret rotation, scope review, and revocation evidence.