By NHI Mgmt Group Editorial TeamPublished 2026-06-24Domain: Agentic AI & NHIsSource: Kong

TL;DR: Generative AI becomes reliable only when successful reasoning is converted into deterministic artifacts, with human oversight, success-path capture, and skill stores turning probabilistic outputs into repeatable workflows, according to Kong. That shift matters because access, auditability, and control must be designed into agentic systems before they are trusted in production.


At a glance

What this is: This is Kong's analysis of how artifact-driven architecture turns probabilistic AI behaviour into repeatable, auditable execution.

Why it matters: It matters to IAM practitioners because the same governance pressure that shapes NHI and human access control now extends to agentic workflows that need validation, traceability, and lifecycle control.

👉 Read Kong's analysis of deterministic execution for agentic AI


Context

Agentic AI reliability breaks when organisations try to manage runtime behaviour with prompts alone. In identity terms, the programme assumes a stable execution path that can be reviewed, approved, and reused, but probabilistic systems do not behave that way unless their actions are converted into governed artifacts.

For IAM, NHI, and AI governance teams, the real issue is not whether the model sounds correct. It is whether the system can prove what happened, who or what executed it, and whether the same action will be repeatable under the same conditions.

Kong frames this as an architecture problem rather than a model-quality problem, and that is the right lens. Once AI can act across tools and data sources, governance has to move from checking outputs to governing execution itself.


Key questions

Q: How should security teams govern reusable AI workflows in production?

A: Security teams should treat reusable AI workflows like privileged machine actions. That means validating the first successful run, storing the approved execution path as an artifact, assigning ownership, and enforcing review and retirement rules. If the workflow can call tools or touch data, it needs the same lifecycle discipline as other governed non-human access.

Q: Why do deterministic AI controls matter for IAM and NHI programmes?

A: Deterministic controls matter because prompts alone do not guarantee repeatable behaviour. IAM and NHI programmes need stable execution paths, clear ownership, and auditable evidence when AI systems act across tools and data sources. Without that, governance can review outputs but cannot reliably govern the action that produced them.

Q: What do teams get wrong about human-in-the-loop AI governance?

A: Teams often treat human review as a final quality check instead of a gate for reuse. In practice, the reviewer is deciding whether a successful path is safe enough to become a deterministic artifact. That distinction matters because once a workflow is stored, it becomes part of the control environment.

Q: How do organisations know if an AI skill store is safe to use?

A: An AI skill store is safe when every stored workflow has a named owner, validated input conditions, version control, and a retirement path. If entries are reused without those controls, the store becomes a hidden execution layer rather than a governed one, which undermines auditability and change management.


Technical breakdown

Why prompts cannot deliver deterministic execution

Prompts influence model output, but they do not create a stable control plane. A generative model can choose different reasoning paths, different tool sequences, and different edge-case handling from one run to the next. Deterministic execution requires the opposite: fixed logic, validated code, and explicit guardrails around when an action is allowed to run. In practice, that means the model becomes the decision router while the critical workflow lives in code or an approved artifact. This is the same architectural shift that separates experimentation from production readiness in identity-adjacent systems.

Practical implication: treat prompts as discovery tools, not governance controls, and move repeatable AI actions into validated execution artifacts.

Human-in-the-loop as a governance checkpoint

Human review in mature agentic systems is not just quality assurance. It is the point at which a probabilistic success path is validated before it becomes a durable operational artifact. Kong's framing is that reviewers catch ambiguity, policy conflicts, and behavioural drift before the workflow is encoded for reuse. That matters because once an AI action is stored as a reusable skill or script, the system stops improvising and starts executing with much higher consistency. Governance therefore shifts from judging one answer to approving an execution pattern.

Practical implication: define human approval gates for first-run success paths before they enter reusable memory or production workflow stores.

Skill stores and artifact-driven AI governance

A skill store is effectively a controlled library of validated execution paths. Instead of asking the model to rediscover how to solve a recurring task, the architecture retrieves a known-good artifact and executes it under known conditions. That improves reliability and auditability, but it also creates a governance obligation: every stored artifact becomes a living control surface that needs ownership, versioning, and retirement rules. For identity teams, this looks a lot like lifecycle management for machine actions, not just for accounts or credentials.

Practical implication: assign ownership, review cadence, and offboarding rules to every reusable AI artifact just as you would to privileged machine access.


NHI Mgmt Group analysis

Deterministic execution is an identity governance problem, not just an AI engineering problem. Once a generative system can call tools, write artifacts, and reuse prior success paths, the core question becomes who governs the action boundary. That boundary is familiar to IAM teams because it looks like privileged execution, only now the executor is probabilistic unless it is forced into deterministic control structures. The implication is that agentic AI cannot be treated as a pure application layer concern.

Artifact-driven AI creates a new lifecycle class for machine behaviour. The same logic that governs NHI provisioning, review, rotation, and offboarding now applies to executable AI artifacts. A reusable script, workflow, or skill store entry is not just code. It is governed machine behaviour that can outlive the context in which it was validated. Practitioners should therefore think in terms of lifecycle ownership for AI actions, not only for credentials.

Success-path capture is the named concept this architecture introduces. It describes the moment when a one-time good outcome is captured, validated, and hardened into repeatable execution. That is useful because it collapses improvisation, but it also concentrates trust into the artifact itself. The implication is that governance moves from supervising model output to supervising the provenance and retirement of stored execution paths.

Human oversight remains the trust anchor, but its role changes from review to authorisation of reuse. In a mature AI architecture, the reviewer is not trying to re-judge every output. The reviewer is deciding whether a behaviour is stable enough to become deterministic infrastructure. That is a different governance decision from traditional model oversight, and it requires clearer criteria for acceptance, version drift, and exception handling.

Agentic AI will force IAM, NHI, and application security to converge around runtime control. The article points to a broader market shift where the security question is no longer only who can authenticate, but what action paths can be invoked and reused by non-human systems. That convergence should push practitioners toward shared control planes for identity, policy, and execution evidence. The field will need it because separate governance silos will not keep up.

From our research:

  • 85% of organisations lack full visibility into third-party vendors connected via OAuth apps, according to The State of Non-Human Identity Security.
  • The same research found that only 1.5 out of 10 organisations are highly confident in their ability to secure NHIs.
  • If you are maturing AI governance, the next step is to pair artifact control with the NHI Lifecycle Management Guide so reusable machine actions do not outlive their approval basis.

What this signals

Success-path capture is likely to become the practical bridge between AI experimentation and governed production use. The organisations that get this right will not simply write better prompts, they will define which AI behaviours can be hardened into reusable execution and which must remain one-off reasoning events.

With 85% of organisations lacking full visibility into third-party vendors connected via OAuth apps, the broader identity lesson is clear: once non-human systems act across tools and delegated access, visibility has to extend beyond accounts into execution artefacts and reuse boundaries.

For teams building controls around agentic AI, the useful lens is closer to the OWASP Agentic AI Top 10 than to traditional application security alone. Runtime action paths, delegated tool use, and reused workflow memory all become governance objects, not just implementation details.


For practitioners

  • Map which AI actions must become deterministic Inventory recurring AI tasks and separate exploration from repeatable execution. Keep novel reasoning in the model, but move stable workflows into approved code, scripts, or policy-backed artifacts before they are reused.
  • Put human approval before artifact promotion Require a human reviewer to validate the first successful path, then decide whether it can enter a skill store or long-term memory. Use the same approval logic for reusable AI actions that you would use for privileged machine access.
  • Apply lifecycle ownership to reusable AI artifacts Name owners, define review cadence, and set retirement rules for each executable AI artifact. Treat stale workflows as governed assets that can drift, just like over-permissioned service accounts or abandoned automation jobs.
  • Log the full execution path, not just the answer Capture tool calls, intermediate steps, inputs, and dependencies so you can audit why a workflow succeeded. Without that record, you cannot prove whether the same action is safe to rerun under similar conditions.

Key takeaways

  • Generative AI becomes governable only when repeatable actions are turned into deterministic artifacts with clear ownership.
  • Human review remains essential, but its job is to approve reuse of a success path, not just score model output quality.
  • IAM and NHI teams should extend lifecycle discipline to AI execution artifacts because reusable machine behaviour becomes part of the control environment.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and OWASP Non-Human Identity Top 10 address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

FrameworkControl / ReferenceRelevance
OWASP Agentic AI Top 10Agentic workflows and tool use create runtime identity and action risks.
NIST AI RMFGovernance and oversight of AI behaviour are central to this article.
OWASP Non-Human Identity Top 10NHI-03Reusable AI artifacts behave like governed machine identities and need lifecycle control.

Treat reusable AI execution paths as NHI assets and enforce review, ownership, and retirement.


Key terms

  • Deterministic execution: Deterministic execution is the practice of making a system perform the same governed action when the same conditions are met. In AI environments, this usually means moving a validated workflow out of freeform model output and into approved code, rules, or artifacts that can be tested, logged, and reused consistently.
  • Success path capture: Success path capture is the process of recording a successful AI workflow and turning it into a reusable, validated artifact. It includes the reasoning sequence, tool calls, dependencies, and constraints, so the organisation can audit and rerun the workflow without asking the model to improvise again.
  • Skill store: A skill store is a controlled library of validated execution artifacts that an AI system can retrieve when a task matches a known pattern. It reduces repeat planning, but it also creates governance obligations for ownership, versioning, review, and retirement because stored behaviours become part of the operating control surface.
  • Artifact-driven architecture: An artifact-driven architecture is a design pattern in which successful AI behaviour is converted into executable, documented, and governed building blocks. The model remains useful for novel reasoning, but recurring tasks are handed off to stable artifacts so the organisation can control reliability, auditability, and change management.

Deepen your knowledge

NHI governance, agentic AI identity, and machine identity security are core topics in our NHI Foundation Level course, the industry's only accredited NHI security programme. If you are responsible for identity security strategy or NHI governance in your organisation, it is worth exploring.

This post draws on content published by Kong: Moving from Probabilistic Reasoning to Deterministic Execution. Read the original.

NHIMG Editorial Note
Published by the NHIMG editorial team on 2026-06-24.
NHI Mgmt Group — the independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org