Subscribe to the Non-Human & AI Identity Journal

How should security teams handle AI client access to governed data without shared secrets?

Security teams should federate access through the corporate IdP and a token-brokering layer so the data platform receives short-lived credentials tied to an individual user. That preserves auditability, limits token reuse, and avoids creating a separate service-account estate just for AI clients.

Why This Matters for Security Teams

AI clients that read governed data are not just another integration point. They can chain prompts, tool calls, retrieval steps, and retries in ways that make static service accounts a poor fit. The real risk is not only unauthorised access, but the creation of durable credentials that outlive the task and can be reused outside the intended context. That is why modern guidance pushes toward federated identity, short-lived tokens, and policy decisions made at request time, consistent with OWASP Non-Human Identity Top 10 and NIST Cybersecurity Framework 2.0.

NHIMG research shows why this matters operationally: Guide to the Secret Sprawl Challenge documents how secrets fragmentation undermines control, while the Shai Hulud npm malware campaign demonstrates how exposed credentials turn routine automation into a breach path. In practice, many security teams only discover this failure mode after an AI workflow has already cached, copied, or replayed a token beyond the original data request.

How It Works in Practice

The clean pattern is identity federation plus token brokering. The AI workload authenticates to the corporate IdP using workload identity, not a shared password or long-lived API key. The IdP then issues a short-lived credential, often via OIDC, scoped to the user or task context. A token broker can exchange that identity for a downstream token that the data platform accepts, preserving audit trails back to the originating person and the specific action.

This is where intent-based authorisation becomes important. Instead of granting the AI client a broad role, policy evaluates what the agent is trying to do, which dataset is involved, whether the request is read-only, and whether additional approval is required. Current guidance suggests combining this with JIT credential provisioning, so the token exists only for the duration of the task and is revoked or expires immediately after completion. That keeps the control plane closer to ZTA than to legacy perimeter thinking, and it aligns with the spirit of OWASP Non-Human Identity Top 10.

A practical implementation usually includes:

  • Workload identity for the AI service, ideally backed by cryptographic proof rather than a shared secret.
  • Per-request policy evaluation, using policy-as-code to decide access at runtime.
  • Short TTL tokens and automatic revocation on task completion or anomaly detection.
  • Audit logging that records the human principal, the agent action, and the downstream resource.

The CI/CD pipeline exploitation case study and Reviewdog GitHub Action supply chain attack both show how quickly machine credentials become lateral-movement fuel once they are not tightly bound to purpose and time. These controls tend to break down when legacy data platforms only accept static bearer tokens because the platform cannot evaluate context at request time.

Common Variations and Edge Cases

Tighter token scoping often increases integration overhead, so organisations have to balance operational simplicity against stronger containment. That tradeoff is especially visible when multiple AI tools share one backend, or when a vendor-hosted model must access regulated data through a proxy boundary.

There is no universal standard for this yet. Current guidance suggests treating each of these as a separate trust decision: human user, AI workload, tool invocation, and data sensitivity. For high-risk environments, security teams may add step-up approval, row-level filtering, or an isolated broker that can issue different tokens for search, read, and export. This is also where 52 NHI Breaches Analysis is useful, because it shows how identity misuse often starts with overly broad machine access rather than obvious malicious action.

For autonomous or semi-autonomous agents, apply the same logic more strictly. Best practice is evolving toward policy that constrains the agent by intent, not by a standing role. That approach fits the direction of NIST Cybersecurity Framework 2.0 and OWASP Non-Human Identity Top 10, but it becomes difficult in environments that cannot propagate user context into the data plane, especially older warehouse products and SaaS platforms with limited delegation support.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Non-Human Identity Top 10 address the attack and risk surface, while NIST CSF 2.0 and NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework Control / Reference Relevance
OWASP Non-Human Identity Top 10 NHI-03 Short-lived, scoped machine credentials reduce exposed NHI secret risk.
NIST CSF 2.0 PR.AC-4 Access rights must be managed and enforced per request, not by standing access.
NIST AI RMF Agent decisions need governance, traceability, and human accountability.

Define accountability, logging, and oversight for AI access decisions before production use.