TL;DR: Ciroos says its multi-agent AI SRE system can identify root cause, collect evidence, and generate remediation steps before humans join the incident, while enterprises are already deploying it in production, according to WorkOS. That shifts the governance problem from observability volume to approval boundaries, evidence quality, and delegated action control.
At a glance
What this is: This is a WorkOS interview about Ciroos building AI SRE agents that investigate incidents and propose fixes, with human approval still required for action.
Why it matters: It matters because incident response, CI/CD governance, and NHI controls now have to account for AI systems that can inspect production data, correlate causes, and prepare remediation work across human, machine, and autonomous operating models.
👉 Read WorkOS's interview on Ciroos building AI SRE agents that fix incidents
Context
The core problem is not lack of telemetry. It is that mean time to repair stays high even when teams have logs, dashboards, and observability platforms, because the work still depends on humans stitching evidence together under pressure. In identity terms, that makes incident response a control problem as much as an engineering problem, especially once AI systems are allowed into production workflows.
AI SRE agents change the operational shape of that problem. Instead of waiting for an engineer to interpret signals manually, a multi-agent system can correlate evidence across network, cloud, security, application, and Kubernetes contexts, then propose a next action. That creates a new governance question for IAM, NHI, and platform teams: what access should an AI system have when its job is to investigate before a human arrives?
Key questions
Q: How should teams govern AI SRE agents that investigate incidents?
A: Start by separating investigative access from remediation authority. AI SRE agents should be able to collect evidence, correlate signals, and draft recommendations without being able to change production systems. Human approval should remain mandatory for any action that modifies state, and every proposed fix should be traceable to specific evidence.
Q: Why do AI incident response agents create new IAM risk?
A: They turn observability into a privileged workflow. Once an agent can read telemetry, inspect code, and prepare fixes, its identity is no longer passive. The risk is not just data exposure. It is delegated decision-making that can cross into change authority unless access, approval, and logging are tightly separated.
Q: What breaks when an AI SRE agent can both diagnose and act?
A: The boundary between detection and remediation collapses. A single identity can observe a problem, decide on a response, and carry out the change, which makes it harder to prove who authorised what and why. That is where accountability, rollback, and blast-radius control become much harder to defend.
Q: Should organisations let AI agents move from read-only to autopilot?
A: Only after they can prove that the agent’s actions are bounded, reversible, and fully auditable. The main decision is not whether the model is accurate enough. It is whether the organisation can constrain what the agent may change, verify why it changed it, and recover safely if the change was wrong.
Technical breakdown
Multi-agent incident triage and evidence correlation
A multi-agent SRE design breaks the job into specialist functions. One agent can inspect application signals, another can review cloud configuration, another can assess Kubernetes events, and another can synthesise the likely root cause. The architectural advantage is locality: the agents work where the data already exists instead of pulling everything into a single control plane. That matters because incident diagnosis is often limited by context switching, not raw data volume. If the system can gather evidence quickly, it can shorten the time between alert and credible hypothesis. The trade-off is that each agent becomes an identity-bearing executor with scoped access to sensitive telemetry and operational systems.
Practical implication: define separate access scopes for each investigative agent and log every evidence access path.
Read-only access before remediation authority
The article describes a progression from read-only access to later autopilot behaviour. That is a classic trust-building model, but it also creates a governance boundary: investigation access is not the same as remediation access. Read-only rights let an AI system inspect code, pipelines, and incident signals, while write access lets it alter configurations, create pull requests, or trigger operational change. Those are different risk classes. If the same identity can observe and act, the organisation loses a clean separation between diagnosis and intervention. In NHI terms, the access token becomes more than a sensor credential the moment it can influence state.
Practical implication: keep investigative credentials separate from any credential that can modify production state or generate deployable changes.
Human approval gates as the control boundary
The article makes clear that human-in-the-loop is still the enterprise default. That means the system is not autonomous in the strict sense because execution still depends on human approval gates. This matters for identity governance because approval gates are not a formality. They are the point where responsibility, liability, and rollback decisions remain human-owned. When an AI system prepares a remediation PR or recommends a fix, the technical quality of the recommendation is not enough. What matters is whether the organisation can review the evidence, validate the proposed action, and reject it safely if the diagnosis is wrong.
Practical implication: require explicit approval, evidence review, and rollback ownership before any agent-generated remediation is applied.
Breaches seen in the wild
- Moltbook AI agent keys breach — Moltbook breach exposed 1.5M AI agent keys.
- AI LLM hijack breach — attackers used stolen AWS access keys to hijack Anthropic LLM models on Bedrock.
Read our 52 NHI Breaches Analysis report for a comprehensive view of breaches impacting Non-Human Identities including AI Agents.
NHI Mgmt Group analysis
AI SRE systems expose a governance gap between observation and action. The article shows a system that can inspect telemetry, identify a likely root cause, and draft remediation before a human joins the incident. That means the old assumption that investigation is passive and intervention is deliberate no longer holds. Practitioners should treat agentic incident tooling as a privileged executor with bounded operational power, not as an enhanced dashboard.
Read-only access is a useful starting point, but it is not the end state. The article describes a trust ramp from read-only access toward autopilot. That ramp is sensible operationally, yet it also reveals how quickly an investigative identity can become a change-making identity. The field needs to distinguish evidence collection rights from change authority more sharply, because the same workflow can cross that line without a major architectural shift.
Autonomous remediation would collapse the assumption that human-paced review is always available. Access review processes were designed for access that persists long enough to be observed, evaluated, and certified. That assumption fails when an AI SRE can inspect, decide, and act inside an incident window faster than the review cycle can register. The implication is that governance for AI operations cannot rely on retrospective review alone.
Identity context now matters inside observability and incident response tooling. The article reflects a broader shift in which operational platforms are no longer neutral sinks for telemetry. They are becoming action-bearing systems that need lifecycle control, privilege separation, and auditable delegation. That aligns incident response with NHI governance rather than treating it as a separate domain. Practitioners should map which investigative identities can only read, which can propose, and which can alter state.
The market is moving toward delegated operations, not just better detection. The real change is not that AI can find anomalies faster. It is that enterprises are willing to use AI to close the repair gap where skilled operators are scarce. That pushes IAM teams to think about incident tooling as part of the identity plane, because operational trust now depends on what the system is authorised to do when no engineer is watching closely.
From our research:
- Organisations maintain an average of 6 distinct secrets manager instances, creating fragmentation that undermines centralised control, according to The State of Secrets in AppSec.
- Only 44% of developers are reported to follow security best practices for secrets management, exposing a significant developer behaviour gap.
- A useful next step is to compare this operational fragmentation with the AI LLM hijack breach, where stolen credentials turned access into direct model abuse.
What this signals
Identity for AI operations is becoming a governance layer inside observability. Once an AI system can inspect incidents and propose fixes, teams need to decide whether that system is a viewer, an advisor, or an executor. The practical signal is that observability platforms will increasingly need identity boundaries that look more like NHI governance than classic monitoring controls, especially when the same workflow touches CI/CD and production metadata.
With 6 distinct secrets manager instances on average, fragmentation is already a control problem in many programmes, according to The State of Secrets in AppSec. Add AI SRE agents into that environment and the question becomes which identity can touch which secrets source, and under what review path. That pressure will push teams toward cleaner delegation, tighter approval boundaries, and more explicit inventory of operational identities.
Operational AI will force teams to separate recommendation quality from authorisation quality. A system can be excellent at finding root cause and still be unsafe to let it touch live infrastructure. Teams should watch for workflows where a helpful recommendation quietly becomes an approved change, because that is where governance drift starts. The NIST Cybersecurity Framework 2.0 remains a useful reference point for response and recovery discipline.
For practitioners
- Separate investigative and remediation identities Give AI SRE agents distinct credentials for evidence gathering and for any action that can modify production state. Keep read-only telemetry access isolated from pull request creation, configuration changes, and deployment triggers.
- Require evidence-backed approval for every proposed fix Treat agent output as a recommendation until a human verifies the evidence chain, root-cause logic, and rollback path. Record who approved the change and which signals justified the decision.
- Limit agent access to the smallest useful operational scope Scope each agent to the domain it actually investigates, such as Kubernetes, cloud, or application logs, and deny lateral access to unrelated systems unless the investigation explicitly requires it.
- Track when agent recommendations become state-changing actions Monitor the point where a proposed remediation turns into a PR, ticket, or deployable change. That transition is where incident tooling stops being advisory and starts carrying governance risk.
Key takeaways
- AI SRE agents are changing incident response from a human-only diagnostic workflow into a delegated identity problem.
- The scale issue is not just faster triage. It is the collapse of the boundary between evidence gathering and state-changing action.
- Teams should keep read access, approval, and remediation authority separate if they want AI-assisted operations to stay governable.
Standards & Framework Alignment
This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.
OWASP Agentic AI Top 10 and OWASP Non-Human Identity Top 10 address the attack and risk surface, while NIST CSF 2.0 set the governance and control requirements practitioners need to meet.
| Framework | Control / Reference | Relevance |
|---|---|---|
| OWASP Agentic AI Top 10 | AG-03 | Agentic systems need bounded action and approval boundaries for remediation workflows. |
| OWASP Non-Human Identity Top 10 | NHI-05 | AI SRE agents operate as non-human identities with scoped operational access. |
| NIST CSF 2.0 | RS.MI | The article is about incident response and remediation workflows. |
Map AI-assisted response workflows to RS.MI and ensure every change remains traceable and reversible.
Key terms
- AI Sre Agent: An AI SRE agent is a software identity that helps investigate incidents, correlate signals, and recommend or prepare operational fixes. In practice, it behaves like a non-human operator with scoped access, so its permissions, logging, and approval boundaries must be governed as identity controls, not just as tooling settings.
- Delegated Remediation: Delegated remediation is the transfer of incident-response action from a human operator to a software identity that can propose or execute fixes. The key governance issue is not the quality of the recommendation, but whether the delegated actor has the authority to change state, and whether that authority is reversible and auditable.
- Investigation Identity: An investigation identity is a credential or account used to inspect telemetry, read code, and assemble evidence during an incident. It should be treated as a narrow, read-focused non-human identity, because once it can write, deploy, or trigger changes, the identity has crossed into a different risk class.
- Approval Gate: An approval gate is the human or policy checkpoint that must be crossed before an AI system can take a state-changing action. For identity governance, the gate is the point where responsibility stays with the organisation, and it should be enforced separately from the system’s ability to observe or recommend.
Deepen your knowledge
AI SRE governance, delegated incident response, and identity boundaries are core topics in our NHI Foundation Level course, the industry's only accredited NHI security programme. If you are building controls for AI systems that can inspect or recommend operational fixes, it is worth exploring.
This post draws on content published by WorkOS: Ciroos is building AI SREs that can actually fix things. Read the original.
Published by the NHIMG editorial team on 2026-01-14.
NHI Mgmt Group — the independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org