How should security teams prove who did what in AI systems?

Security teams should bind every meaningful AI action to a named identity, an authorization chain, and a timestamped record that survives audit review. The answer must be reconstructable from logs, not dependent on memory or manual explanation. Without that evidence chain, compliance will usually treat the action as untrusted and the deployment as incomplete.

Why This Matters for Security Teams

Proving who did what in AI systems is harder than in traditional applications because AI actions are often mediated by agents, tool calls, retrieval layers, and background workflows rather than a single user clicking a button. Security teams need evidence that ties each meaningful action to an identity, an authorization decision, and a timestamped record that can survive later review. The NIST Cybersecurity Framework 2.0 is useful here because it reinforces traceability, accountability, and continuous governance, but it does not replace the need for AI-specific evidence chains.

The practical problem is that AI systems can appear to act on behalf of a user while actually executing through service accounts, orchestration layers, or delegated credentials. That creates ambiguity unless logging is designed to preserve the full chain of custody from request to outcome. In NHIMG research on the State of Non-Human Identity Security, inadequate monitoring and logging is cited as a major attack driver, which is a reminder that proof fails when identity and action are separated in the audit trail. In practice, many security teams encounter missing accountability only after an incident review or compliance challenge has already exposed the gap.

How It Works in Practice

Strong attribution in AI systems starts by making every actor machine-verifiable. That means binding the human user, the AI agent, the service identity, and the tool or API call into one traceable record. For autonomous or semi-autonomous systems, the key is not just “who logged in,” but “who authorized this action, under what context, and with what credential or workload identity.” Where possible, use workload identity primitives such as SPIFFE or OIDC-backed tokens so the system can prove what executed, not just what was claimed.

Security teams usually need four layers of evidence:

Identity proof for the initiator, such as SSO session, API token, or agent workload identity.
Authorization proof showing the policy decision at the time of the action.
Action telemetry capturing tool calls, prompts, retrievals, and outbound side effects.
Immutable timestamps and correlation IDs so the event can be reconstructed later.

Current guidance suggests using policy-as-code and evaluating authorization at runtime, especially for agentic workflows where static roles are too blunt. The issue is not just access control, but replayable accountability. An AI system that can invoke tools, chain steps, or delegate sub-tasks needs a log trail that records the full decision path. The DeepSeek breach analysis is a useful reminder that AI risk often becomes visible only after data exposure or uncontrolled behavior has already occurred. This guidance tends to break down in highly distributed environments where logs are split across model hosts, vector databases, proxies, and third-party tools because correlation disappears between systems.

Common Variations and Edge Cases

Tighter attribution often increases operational overhead, requiring organisations to balance forensic certainty against latency, storage, and engineering complexity. That tradeoff is especially visible in agentic AI, where every tool call can generate multiple downstream actions and every extra logging hop can affect throughput.

There is no universal standard for this yet, but best practice is evolving toward layered evidence rather than a single log source. For low-risk internal copilots, request-level audit logs may be sufficient. For systems that can send emails, move money, change records, or trigger production workflows, security teams should preserve prompt context, policy decisions, model outputs, tool invocations, and revocation events. When secrets or delegated tokens are involved, short-lived credentials make attribution stronger because they reduce ambiguity after the fact.

Edge cases usually appear when human approval and agent autonomy overlap. If a person approves a task but the agent later chooses the sequence of tool use, the audit record must separate the human decision from the autonomous execution. That distinction matters for incident response, legal review, and regulatory reporting. The most common failure mode is assuming the model output is the action itself, when in reality the accountable event is the downstream side effect created by the system that consumed it.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
OWASP Agentic AI Top 10	A1	Agentic systems need traceable action chains and accountable execution.
CSA MAESTRO		MAESTRO addresses governance and observability for autonomous AI workflows.
NIST AI RMF	GOVERN	AI governance requires accountability, traceability, and oversight mechanisms.

Implement runtime audit trails that connect human intent to agent execution and downstream actions.

How should security teams prove who did what in AI systems?

Why This Matters for Security Teams

How It Works in Practice

Common Variations and Edge Cases

Standards & Framework Alignment

Related resources from NHI Mgmt Group