Subscribe to the Non-Human & AI Identity Journal

How should security teams govern MCP workflows that mix models, servers, and users?

Treat MCP as a delegated workflow problem, not just a protocol feature set. Security teams should assign control points for sampling, external authorisation, and structured clarification, then log who approved each transition. The goal is to keep identity, consent, and execution boundaries explicit whenever a server participates more actively in the workflow.

Why This Matters for Security Teams

MCP is not just a transport layer for prompts and tools. When a workflow mixes a model, a server, and a human approver, the security problem becomes delegated execution: who can request action, who can approve it, and what the server is allowed to do on behalf of that request. That is why governance should align with the OWASP Agentic AI Top 10 and the NIST Cybersecurity Framework 2.0, not with a protocol-only checklist.

Current guidance suggests that the most fragile point is not the model itself, but the transition between intent and execution. If the server can sample, ask for clarification, or trigger external authorisation, each step becomes a security boundary that must be explicit, logged, and attributable. NHIMG research on OWASP Agentic Applications Top 10 shows how quickly agentic workflows expand attack surface when control points are vague. In practice, many security teams discover permission drift only after a server has already chained a benign request into an unsafe action.

How It Works in Practice

The safest MCP governance model treats every workflow transition as a policy decision. Security teams should define which party can initiate the request, which server capabilities are in scope, and which actions require human confirmation before the tool call is executed. This is closer to intent-based authorisation than static RBAC, because the risk depends on what the workflow is trying to do at that moment.

In practice, the control design usually includes three checkpoints:

  • Sampling approval, where the server is allowed to inspect or transform context only within a narrowly scoped purpose.
  • External authorisation, where a privileged or irreversible tool action is paused until a separate policy engine or human approves it.
  • Structured clarification, where the server can ask for missing data, but only through logged and bounded prompts that preserve consent.

That design should be paired with workload identity for the server, not just user authentication. Cryptographic identity, short-lived tokens, and request-time policy evaluation help prove what the server is and what it is allowed to do right now. For implementation patterns, the industry is still converging, but the most useful reference points are OWASP Top 10 for Agentic Applications 2026 and the NHI lifecycle guidance in NHIMG’s Ultimate Guide to NHIs.

Telemetry matters as much as access control. Teams should log the approving identity, the server action, the policy decision, and the data scope at each transition so that audits can reconstruct who consented to what. NHIMG’s Analysis of Claude Code Security is a useful reminder that tool-using systems become harder to govern once approvals are implicit instead of recorded. These controls tend to break down when MCP servers are embedded in multi-tenant developer environments because approval context gets lost across sessions, projects, and shared tooling.

Common Variations and Edge Cases

Tighter MCP governance often increases workflow friction, so organisations have to balance safer approvals against developer speed and operator fatigue. Best practice is evolving here, and there is no universal standard for when a server may auto-continue versus when it must stop for consent.

One common edge case is “trusted” internal servers that still have broad tool access. The trust boundary does not disappear just because the server is inside the perimeter; in fact, the risk can be higher when the server can read secrets, call external APIs, or modify downstream systems without a second check. NHIMG’s Top 10 NHI Issues and Ultimate Guide to NHIs both reinforce that auditability and lifecycle control matter whenever delegated execution exists.

Another variation is the approval loop itself. If clarification prompts are too open-ended, the model can steer the human toward granting broader access than intended. If they are too rigid, users bypass the workflow entirely. The practical answer is to constrain the request format, require purpose binding, and retain immutable records of every approval path. That balance becomes especially hard in high-velocity environments such as CI/CD, customer support automation, and mixed human-agent operations, where context changes faster than manual review can keep up.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework Control / Reference Relevance
OWASP Agentic AI Top 10 A1 Covers unsafe agent actions and tool-use boundaries in MCP workflows.
CSA MAESTRO GOV-2 Addresses governance for multi-step agentic workflows and delegated actions.
NIST AI RMF GOVERN Applies risk management and accountability to model-led workflow decisions.

Define control points for sampling, approval, and clarification across the full MCP workflow.