MCP servers create new NHI governance concerns because they expose application capability to non-human callers through tools and prompts that can be invoked at runtime. That shifts the question from who can reach an API to what an agent is allowed to decide and execute. The governance challenge is preventing the agent from inheriting broader privilege than the user or workflow intended.
Why This Matters for Security Teams
MCP changes the governance model because the risk is no longer limited to API reachability or service authentication. An agent can now discover tools, interpret prompts, and execute actions at runtime, which makes privilege decisions dynamic rather than fixed. That is why static RBAC alone is a poor fit for autonomous workloads. Security teams need to think in terms of intent, context, and short-lived authority, not just whether a caller is authenticated. The governance gap becomes visible when the agent is allowed to compose actions that no single human would have been granted directly. This is a core concern in OWASP Agentic Applications Top 10 and NIST Cybersecurity Framework 2.0, both of which reinforce the need for stronger access governance and continuous control validation.
For NHI programs, MCP servers also expand the secret-handling problem. If tool access is backed by long-lived tokens, service keys, or broad OAuth grants, the agent can inherit more authority than the workflow intended. NHIMG’s Top 10 NHI Issues and Ultimate Guide to NHIs — Lifecycle Processes for Managing NHIs both show why lifecycle control, rotation, and scoped access matter more once non-human callers can act on behalf of users. In practice, many security teams encounter MCP over-privilege only after an agent has already chained tools into an unintended action, rather than through intentional design.
How It Works in Practice
The practical question is not whether an MCP server is “secure” in the abstract, but how it evaluates each request from an AI agent. Best practice is evolving toward intent-based authorisation: the server or policy layer checks what the agent is trying to do, which tool it wants, which data it can touch, and whether the user’s context supports that action. That is a better fit for agents than pre-defined role bundles, because agents are goal-driven and can change tactics mid-task. Guidance from OWASP Agentic AI Top 10 and Analysis of Claude Code Security both point to the same operational pattern: evaluate authority at runtime, not just at login.
That usually means combining several controls:
- JIT credential provisioning so the agent receives authority only for the current task.
- Ephemeral secrets with tight TTLs, so stolen credentials expire before they can be reused.
- Workload identity, such as SPIFFE/SPIRE or OIDC-based proof of workload identity, so the system knows what the agent is rather than only what secret it holds.
- Policy-as-code, such as OPA or Cedar, so the decision reflects current context, requested tool, and data sensitivity.
- Tool-level scoping, so an agent can call one function without inheriting an entire API surface.
This aligns with the governance posture in the Ultimate Guide to NHIs — Regulatory and Audit Perspectives and the agentic control direction in NIST Cybersecurity Framework 2.0. These controls tend to break down when MCP tools are exposed through shared gateways that cannot distinguish one task from another because context is lost at the policy boundary.
Common Variations and Edge Cases
Tighter control often increases operational overhead, requiring organisations to balance safety against developer velocity and agent reliability. That tradeoff becomes sharper in environments where agents must complete multi-step workflows across multiple tools, because every extra approval or token exchange can slow execution. Current guidance suggests using coarse guardrails for low-risk actions and finer-grained runtime policy for high-impact actions, but there is no universal standard for this yet. The emerging consensus in 52 NHI Breaches Analysis is that identity sprawl and weak lifecycle controls are what turn convenience into exposure.
Edge cases appear when MCP servers are embedded inside developer tooling, when agents operate across SaaS boundaries, or when a single agent can broker actions for multiple human users. In those cases, the question is not only “who authenticated?” but also “whose intent is this, and how far may the agent proceed?” That is where Ultimate Guide to NHIs — What are Non-Human Identities is useful, because it frames the identity lifecycle that has to exist before governance can be enforced. For control mapping, OWASP Agentic AI Top 10, NIST Cybersecurity Framework 2.0, and CSA MAESTRO all support the same direction: treat agent authority as temporary, inspectable, and bounded by purpose. The model becomes most fragile when an MCP deployment is treated like a normal service account problem, because agents do not behave like static workloads.
Standards & Framework Alignment
This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.
OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.
| Framework | Control / Reference | Relevance |
|---|---|---|
| OWASP Agentic AI Top 10 | Agentic tool use needs runtime policy and bounded authority. | |
| CSA MAESTRO | MAESTRO addresses governance for autonomous, multi-step agent workflows. | |
| NIST AI RMF | AI RMF supports accountable governance for unpredictable AI behaviour. |
Apply agentic controls to evaluate tool use at request time and limit autonomous action scope.