TL;DR: Attackers are bypassing chatbot guardrails, SOC teams are using automation to ease alert fatigue, and AI is making social engineering faster and harder to detect, according to Abnormal AI. The deeper issue is that security programmes now have to govern AI-mediated deception as a live operational risk, not a future scenario.
At a glance
What this is: A season recap on how AI is affecting both offensive and defensive security operations, with a focus on chatbot abuse, SOC automation, and AI-driven social engineering.
Why it matters: It matters because identity, access, and response controls now have to account for AI-assisted deception and automation across human, NHI, and emerging agentic workflows.
👉 Read Abnormal AI’s season recap on AI chatbots, SOC automation, and social engineering
Context
AI is changing cybersecurity by compressing the time it takes to create deception, analyse alerts, and pressure defenders into faster decisions. For identity and security teams, the practical question is no longer whether AI can assist attacks or operations, but how governance changes when machines shape both sides of the interaction.
This matters across human IAM, NHI governance, and emerging agentic AI oversight because the same environment can now host legitimate AI tools, automated SOC workflows, and adversarial misuse of chatbots. The governance gap is not limited to one control family. It spans authorisation, monitoring, escalation paths, and the trust assumptions embedded in human and machine-assisted processes.
Key questions
Q: How should security teams govern AI chatbots that can be manipulated by attackers?
A: Security teams should treat chatbot governance as an access and workflow problem, not only a content-safety problem. Identify what the chatbot can reach, what actions it can trigger, and where outputs flow into internal systems. Then limit tool access, log high-risk prompts, and require human review for requests that could expose data or alter privileges.
Q: Why does AI make social engineering harder to stop?
A: AI makes social engineering harder to stop because it lowers the cost of personalised deception. Attackers can iterate messages quickly, match tone to the target, and adapt to responses in ways that look routine. That reduces the warning signs people and filters traditionally relied on, so validation has to move closer to identity and workflow controls.
Q: What should SOC teams automate without losing control?
A: SOC teams should automate repetitive enrichment, correlation, and routing, but keep decisions that change incident status, evidence integrity, or containment authority under human oversight. The right boundary is where automation improves speed without becoming the final decision-maker for ambiguous events.
Q: How do organisations reduce risk from AI-assisted impersonation?
A: Organisations should require stronger verification for requests that move money, reset access, or transfer sensitive data. Use secondary channels, callback procedures, and approval workflows that do not depend on the same communication path the attacker may already control.
Technical breakdown
How attackers bypass chatbot guardrails
AI chatbot guardrails are policy layers, not proof of harmless behaviour. They can be bypassed through prompt manipulation, context steering, and re-framing requests so the system discloses or assists with actions that appear legitimate in isolation. The technical problem is that the model may still operate within its allowed interface while producing outputs that support malicious workflows. That makes the platform itself part of the attack surface, especially when it is connected to tools, memory, or downstream automation. Guardrails reduce obvious abuse, but they do not eliminate abuse that is disguised as normal interaction.
Practical implication: treat chatbot policy as one control layer and assess whether connected tools and outputs create a usable attack path.
SOC automation and alert fatigue
Security operations automation is designed to triage, enrich, and prioritise alerts so analysts can focus on higher-value decisions. In practice, the value comes from reducing repetitive work, not replacing accountability. Automation becomes risky when it is allowed to close the loop on weak signals without strong review criteria, because false positives and false negatives can both scale quickly. The operational challenge is not simply speed. It is ensuring that automated handling does not obscure the evidence needed for incident validation, escalation, and post-incident review. Efficiency gains are real, but only when decision boundaries remain clear.
Practical implication: define which alert actions automation may take and where human review must remain mandatory.
AI-driven social engineering and trust manipulation
AI changes social engineering by making language, timing, and context adaptation much cheaper for attackers. That matters because deception is often about exploiting trust cues, not just technical mistakes. AI can generate more believable pretexts, tailor tone to the recipient, and iterate quickly based on response patterns. The defensive issue is that traditional awareness training assumes human-crafted mistakes that are easier to spot over time. AI-assisted deception can be more coherent and more scalable, which raises the baseline difficulty of verifying intent across email, chat, voice, and workflow channels.
Practical implication: strengthen identity verification and out-of-band validation for requests that carry financial, access, or data-transfer risk.
NHI Mgmt Group analysis
AI-assisted deception is now an identity problem, not just a content problem. When attackers use legitimate AI platforms to manipulate targets, the control failure is not simply bad wording or spam volume. The real issue is that identity assurance is being tested through machine-generated trust signals that look normal at first glance. Security teams need to recognise that deception can be delivered through the same channels used for legitimate collaboration.
Automation reduces SOC noise, but it also raises governance stakes. AI that cuts alert fatigue only helps if the organisation preserves clear accountability for automated triage, escalation, and suppression decisions. Otherwise, the programme trades analyst time for opaque decision paths. The practical implication is that response governance matters as much as detection quality when automation becomes part of operations.
Supercharged social engineering exposes a trust-assurance gap across human and machine workflows. The same AI systems that help defenders can also increase the realism and speed of phishing, impersonation, and workflow abuse. That means identity programmes have to think beyond authentication events and into the quality of trust decisions made inside email, chat, and service workflows. Practitioners should treat trust validation as a cross-channel control, not a user-training afterthought.
AI security strategy now spans human IAM, NHI governance, and emerging agent oversight. This season’s themes show that the boundary between people, systems, and AI-mediated actions is getting harder to separate operationally. Access decisions, response orchestration, and social manipulation now sit in the same risk landscape. Organisations that still manage these as disconnected domains will miss the compounded effect of AI on both attack speed and defensive automation.
From our research:
- Only 44% of developers are reported to follow security best practices for secrets management, exposing a significant developer behaviour gap, according to The State of Secrets in AppSec.
- The average estimated time to remediate a leaked secret is 27 days, despite 75% of organisations expressing strong confidence in their secrets management capabilities.
- For a broader governance lens, the NHI Lifecycle Management Guide shows how provisioning, rotation, and offboarding controls need to be managed as a lifecycle, not a one-time event.
What this signals
AI-assisted trust decisions now sit alongside secrets, access, and response governance. Abnormal AI’s themes point to a programme design problem: teams are still separating chatbot risk, SOC automation, and social engineering when the adversary can chain them together. That argues for a unified review of how AI touches approvals, workflows, and identity validation across the stack.
For practitioners, the next step is to map where AI changes decision speed, decision quality, and the evidentiary trail that supports response. Controls that work on paper can fail when the request, the assessment, and the action all happen inside one AI-shaped interaction.
For practitioners
- Review chatbot-connected workflows for abuse potential Inventory where legitimate AI chatbots can reach sensitive data, tools, or internal systems, then test those paths for prompt manipulation, context leakage, and unintended action escalation.
- Define strict automation boundaries in the SOC Separate low-risk enrichment from actions that suppress, close, or escalate incidents, and require explicit human approval for decisions that change evidence or containment status.
- Harden verification for high-impact requests Use out-of-band checks for payment changes, access resets, vendor onboarding, and data transfers when the request arrives through email, chat, or AI-mediated channels.
- Map AI trust paths across identity domains Document where human, machine, and AI-assisted actions intersect so you can see which approvals, logs, and reviews depend on assumptions that AI can now undermine.
Key takeaways
- AI is turning deception into an identity and workflow risk that reaches beyond traditional phishing controls.
- Automation can reduce SOC pressure, but only if organisations preserve clear review boundaries for high-impact decisions.
- Practitioners need one governance view of human, machine, and AI-assisted trust paths if they want to keep pace with AI-enabled abuse.
Standards & Framework Alignment
This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.
OWASP Agentic AI Top 10 address the attack and risk surface, while NIST CSF 2.0 and NIST AI RMF set the governance and control requirements practitioners need to meet.
| Framework | Control / Reference | Relevance |
|---|---|---|
| OWASP Agentic AI Top 10 | AI chatbot misuse and tool abuse map to agentic application risk patterns. | |
| NIST CSF 2.0 | PR.AC-4 | Access control and authorisation boundaries are central to chatbot and workflow abuse. |
| NIST AI RMF | AI-assisted decisioning needs governance, accountability, and monitoring. |
Restrict AI tool access to least privilege and review any action that changes access or data exposure.
Key terms
- AI-assisted deception: The use of AI systems to create or adapt deceptive content at scale. In practice, it shortens the time needed to craft believable lures, mimic tone, and iterate against targets, which makes social engineering more persistent and harder to recognise across channels.
- Chatbot guardrails: Policy and safety controls intended to limit what an AI chatbot will reveal or do. They help reduce obvious misuse, but they do not remove risk when the chatbot is connected to tools, data, or downstream workflows that can still be influenced through normal-looking interactions.
- SOC automation: The use of automated workflows to triage, enrich, route, or suppress security alerts. It improves analyst efficiency when boundaries are clear, but it becomes a governance issue when automation can make final decisions that affect evidence, containment, or incident status.
Deepen your knowledge
NHI governance, agentic AI identity, and machine identity security are core topics in our NHI Foundation Level course, the industry's only accredited NHI security programme. If you are responsible for identity security strategy or programme maturity, it is worth exploring.
This post draws on content published by Abnormal AI: Season 5 of The Convergence of AI + Cybersecurity. Read the original.
Published by the NHIMG editorial team on 2025-11-17.
NHI Mgmt Group — the independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org