Retrieval-based AI returns information, while action-capable AI can trigger changes in external systems. That difference matters because retrieval can usually be governed with data controls, but action requires identity controls, policy checks, and audit logging. Once a model can act, the security question shifts from answer quality to delegated authority.
Why This Matters for Security Teams
Retrieval-based AI and action-capable AI may look similar from the outside, but they create very different risk models. Retrieval is primarily about what information the system can surface, so controls focus on data handling, prompt filtering, and output review. Once an AI can act, the concern shifts to delegated authority: which identities it uses, what systems it can reach, and how quickly those privileges can be revoked. That is where NHI governance becomes decisive.
This distinction is easy to miss when teams treat every AI workload as a chat interface. In practice, action-capable systems can invoke APIs, update records, open tickets, trigger workflows, and chain tools without a human in the loop. Current guidance suggests that those systems should be governed more like autonomous workloads than like search tools. NIST’s NIST Cybersecurity Framework 2.0 is useful here because it anchors the problem in identity, access, and oversight rather than in model quality alone. NHIMG’s Ultimate Guide to NHIs — What are Non-Human Identities frames the same issue from an operational identity perspective.
In practice, many security teams encounter abuse only after an agent has already used legitimate access to make an unauthorised change, rather than through intentional testing of delegated authority.
How It Works in Practice
Retrieval-based AI usually operates inside a bounded read path. It searches indexed content, ranks results, and returns text or citations. Security teams can often govern that with content controls, document permissions, and output moderation. Action-capable AI is different because it must be authorised to do something, not just say something. The identity question becomes central: what workload identity is the agent using, what scopes are attached, and are those scopes issued per task or held indefinitely?
For autonomous or semi-autonomous agents, best practice is evolving toward workload identity, runtime policy checks, and just-in-time credentials. That means the agent receives short-lived access for a specific task, the policy engine evaluates the request in context, and the credential is revoked when the task ends. This approach reduces the blast radius of a compromised prompt, poisoned tool output, or malicious instruction sequence. It also fits the reality that AI agents can chain actions across systems in ways humans do not fully predict. NHI governance should therefore focus on:
- short-lived secrets instead of static API keys
- fine-grained scopes for each tool or API call
- real-time policy evaluation before execution
- immutable audit logs for every external action
- separation between retrieval-only and write-capable workloads
When an AI system is allowed to act, identity becomes the security boundary. That is why the LLMjacking: How Attackers Hijack AI Using Compromised NHIs research is so relevant: once credentials are exposed, attackers move fast, and the exposed identity often matters more than the model itself. The operational pattern mirrors broader supply-chain abuse as described in the NIST Cybersecurity Framework 2.0. These controls tend to break down when an agent has broad write access across multiple SaaS platforms because the same credential can be reused to chain low-risk actions into a high-impact change.
Common Variations and Edge Cases
Tighter action controls often increase operational overhead, requiring organisations to balance speed against containment. That tradeoff is real: a retrieval-only assistant can often tolerate broader data access, while a write-enabled agent usually cannot. Current guidance suggests treating “action-capable” as a spectrum rather than a binary. A system that drafts an email, approves a workflow step, or posts to a ticketing system may be low risk in one context and high risk in another depending on who can approve the final action and what side effects are possible.
Some deployments blur the line further. A retrieval tool that calls external search APIs may already be leaving the local trust boundary, and an agent that “only recommends” an action may still influence a human to execute it. There is no universal standard for this yet, so teams should classify systems by the highest consequence they can cause, not by their intended purpose. This is especially important when one model fronts multiple tools with different trust levels.
NHIMG’s DeepSeek breach coverage is a reminder that security failures around AI often combine data exposure with operational exposure. The practical rule is simple: retrieval can usually be governed as information access, but action requires delegated authority controls, continuous policy evaluation, and revocation paths that work in seconds, not days.
Standards & Framework Alignment
This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.
OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.
| Framework | Control / Reference | Relevance |
|---|---|---|
| OWASP Agentic AI Top 10 | A01 | Covers unauthorized agent actions and tool misuse in autonomous workflows. |
| CSA MAESTRO | TIC-02 | Addresses agent trust boundaries and delegated execution risks. |
| NIST AI RMF | GOVERN | Supports accountability for AI systems that can influence or execute external actions. |
Classify each tool as read or write, then gate write actions with runtime policy and approvals.
Related resources from NHI Mgmt Group
- What is the difference between managed identities and hardcoded secrets for AI agents?
- What is the difference between human identity governance and AI agent governance?
- What is the difference between workload identity and API keys for AI agents?
- What is the difference between governing human access and governing AI agent access?