AI phishing now bypasses signature-based email defenses

By NHI Mgmt Group Editorial TeamPublished 2026-05-14Domain: Governance & RiskSource: Abnormal AI

TL;DR: AI now compresses target research, infrastructure build, and personalised phishing into minutes, while signature-based email security struggles against messages with no bad links, attachments, or known indicators, according to Abnormal AI. The governance problem is that controls built to detect malicious artefacts no longer match attacks that look like routine business requests and can even use real-time voice and video spoofing.

At a glance

What this is: This is an analysis of how AI is accelerating phishing and deepfake-enabled social engineering while eroding the value of signature-based email defense.

Why it matters: It matters because IAM, PAM, and identity governance teams now have to assume that verification cues, not just message content, are part of the attack surface across human and machine-driven workflows.

By the numbers:

One team cut phishing triage from 20 to 40 hours per week to 4 to 5 hours after deploying behavioral AI on top of an existing SEG.
AI now lets attackers compress 40 hours of target research, infrastructure build, and personalized email crafting into minutes on a laptop.

👉 Read Abnormal AI's analysis of AI-generated phishing and deepfake risk

Context

AI phishing has changed the unit economics of social engineering. What used to require hours of research and careful manual tailoring can now be assembled in minutes, which means the attacker can personalise at scale without leaving the usual email artefacts that legacy controls expect to inspect.

For identity teams, the problem is no longer only message filtering. It is the collapse of trust signals across human approval paths, where a believable sender, a routine request, and even a convincing verification call can all be simulated well enough to defeat normal judgement.

The article frames this as a mismatch between the threat model and the control model. That mismatch is typical of organisations that still treat email security as a content-scanning problem rather than an identity and behaviour problem.

Key questions

Q: How should security teams defend against AI-generated phishing that has no malicious links or attachments?

A: They should stop relying on content-only detection and add behavioural controls that evaluate sender history, request context, and workflow fit. AI-generated phishing often looks technically clean, so the control has to ask whether the message is normal for that relationship and business process, not whether it contains a known payload.

Q: Why do real-time deepfakes make callback verification less reliable?

A: Because the attacker can imitate the trusted person in the same decision window that the victim uses to verify the request. If voice or video can be generated convincingly on demand, then the callback no longer proves identity on its own. It becomes one signal among several, not a final assurance step.

Q: What do organisations get wrong about phishing triage when AI is involved?

A: They often treat triage as a manual review problem instead of a control-design problem. If analysts must inspect large volumes of believable mail, the programme is already absorbing the attacker's scale advantage. Effective triage should prioritise behavioural scoring and high-risk workflow protection, not only inbox review.

Q: How do teams know whether their email security controls are keeping up with AI phishing?

A: Look for declining manual triage time, lower reliance on message signatures, and more accurate detection of anomalous sender behaviour. If the team still depends on suspicious links or human callback checks as the main defence, the control model is lagging behind the attack model.

Technical breakdown

Why AI phishing evades signature-based email security

Signature-based email security products look for known-bad indicators such as malicious links, weaponised attachments, and recognisable payload patterns. AI-generated phishing often contains none of those markers. Instead, the attacker produces clean-looking language, context-aware phrasing, and sender-relevant details that pass content inspection because the message is technically ordinary. This shifts detection away from what the email contains and toward whether the message matches the expected behavioural pattern for that sender, recipient, and business process. Practical implication: security teams need controls that evaluate context and behaviour, not only message artefacts.

Practical implication: supplement SEG controls with behavioural detection that can score message legitimacy without relying on malicious indicators.

How real-time deepfakes weaken voice verification and callback checks

Deepfake audio and video remove one of the most common human fallback controls: calling or video-chatting a person to confirm a request. If the attacker can imitate a known executive, vendor, or colleague in real time, then verification no longer proves identity in the way teams assume it does. The weakness is not only technical fidelity but timing. The impersonation can occur inside the same decision window as the request, before suspicion has a chance to trigger escalation. Practical implication: teams should treat voice and video as weak assurance channels and reserve them for corroboration, not sole approval.

Practical implication: redesign approval paths so high-risk requests require independent corroboration beyond a live voice or video check.

Behavioral AI as an identity and communication baseline

Behavioral AI works by learning what normal looks like for each user, mailbox, or communication pattern, then flagging anomalies instead of waiting for known threat intel. That is a different control philosophy from signature detection. It is closer to identity baselining because the question becomes whether the sender's behaviour, message timing, and request pattern fit the established norm. This makes the approach useful against novel phishing, impersonation, and image-only messages that have no known malware signature. Practical implication: behavioural models should sit beside SEG policy, not after an incident review.

Practical implication: baseline normal communication patterns for privileged users and high-risk workflows so anomalies are visible before approval occurs.

Threat narrative

Attacker objective: The attacker aims to win trust fast enough to trigger credential theft, fraudulent payment, or workflow abuse before human verification catches the deception.

Entry occurs when the attacker uses AI to research the target, build infrastructure, and craft a personalised message that appears to come from a legitimate contact.
Escalation happens when the email or deepfake call bypasses traditional filters because it contains no suspicious links, attachments, or known signatures.
Impact follows when the victim approves a fraudulent request or shares sensitive information, allowing the attacker to extend access into identity and business workflows.

DeepSeek breach — DeepSeek breach exposed 1M+ log lines and sensitive secret keys.
LiteLLM PyPI package breach — LiteLLM PyPI supply chain attack, credentials stolen from users.

Read our 52 NHI Breaches Analysis report for a comprehensive view of breaches impacting Non-Human Identities including AI Agents.

NHI Mgmt Group analysis

Content inspection is no longer a sufficient trust model for email security. The article shows that AI-generated phishing can arrive without malicious links, bad attachments, or known signatures, which means the traditional SEG control model is being asked to solve a problem it was not built for. That is not a tuning issue. It is a control-mismatch issue. Practitioners should treat this as evidence that trust decisions are shifting from content artefacts to behavioural legitimacy.

Real-time impersonation breaks the assumption that a call to verify is a stable fallback. The old governance premise was that a human could interrupt a suspicious request and get a trustworthy confirmation from another human. Deepfake audio and video collapse that premise because the verification channel itself can be simulated inside the same request window. The implication is that approval design now has to assume the verifier channel can be compromised alongside the original message.

Behavioral AI creates an identity boundary around communication, not just content. Normal-versus-anomalous baselining is becoming a governance pattern for human identity flows, especially where privileged users, finance processes, or sensitive approvals are involved. That matters because the trust decision is no longer only whether the message is malicious, but whether the sender, timing, and request shape fit the established identity pattern. Practitioners should evaluate email security as part of broader identity governance, not as a standalone inbox problem.

Phishing triage is becoming an identity operations workload, not just a SOC queue. When a team cuts review time from 20 to 40 hours per week to 4 to 5 hours, the hidden story is governance throughput. Too many organisations still route suspicious mail through manual judgement loops that do not scale against AI-generated volume and realism. The practical conclusion is that identity, SOC, and business approvers need a shared operating model for high-trust requests.

AI-assisted attack speed changes the value of control timing. If a complete attack path can be assembled in minutes, then controls that depend on delayed human inspection are increasingly outpaced. That does not mean humans disappear from the loop. It means the loop must be front-loaded with behavioural controls, approval hardening, and tighter verification design. Practitioners should rethink where the first meaningful control actually occurs.

From our research:
70% of organisations grant AI systems more access than they would give a human employee performing the exact same job, according to the 2026 Infrastructure Identity Survey.
Only 44% of organisations have implemented any policies to manage their AI agents, despite 92% agreeing that governing AI agents is critical to enterprise security, according to the 2026 Infrastructure Identity Survey.
That governance gap points forward to OWASP NHI Top 10, where identity, tool-use, and approval boundaries need to be treated as first-class controls.

What this signals

Behavioral trust will matter more than content trust: the next phase of email defence is about proving that a request fits the sender's normal pattern, not proving that the message contains known malicious code. For teams that still depend on SEG-era filters, the risk is a false sense of coverage.

With 70% of organisations already granting AI systems more access than they would give a human employee performing the exact same job, per the 2026 Infrastructure Identity Survey, the broader lesson is that identity controls are already being stretched by automation-heavy workflows. That pressure will spill into approval processes, mailbox trust, and exception handling.

Security leaders should expect phishing operations to become more identity-aware, not less. The practical response is to align inbox protection, privileged workflow controls, and human verification paths so that one compromised communication channel does not become a complete trust failure.

For practitioners

Shift email defense toward behavioural detection Measure sender, message, and workflow anomalies rather than waiting for malicious links or attachments. Use behavioural baselines for privileged users, finance requests, and external partner workflows so unusual timing or phrasing is flagged before approval.
Harden high-risk approval paths Require independent corroboration for payment, access, and account-change requests. Do not allow a live voice or video call to be the only verification step when the request would alter identity, privilege, or funds.
Treat verification channels as attack surface Assume audio and video can be spoofed in real time and design escalation criteria accordingly. Route sensitive requests to secondary approvers or out-of-band checks that are not dependent on the same communication channel.
Use triage metrics to justify control redesign Track time spent on phishing review, false positives, and manual escalation volume. If analysts are spending dozens of hours each week on inbox review, the control model is consuming human attention instead of reducing risk.

Key takeaways

AI-generated phishing compresses the attacker workflow and removes the obvious indicators that legacy email security was built to catch.
Real-time deepfakes weaken callback verification, which means voice and video can no longer be treated as standalone proof of identity.
Behavioural detection and hardened approval paths are becoming the practical controls that matter most for high-risk identity workflows.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Non-Human Identity Top 10 address the attack and risk surface, while NIST CSF 2.0 and NIST Zero Trust (SP 800-207) set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
NIST CSF 2.0	PR.DS-1	Email trust failures affect detection and protection controls across user communications.
NIST Zero Trust (SP 800-207)	PR.AC-4	The article shows identity assurance cannot rest on a single communication channel.
OWASP Non-Human Identity Top 10	NHI-08	AI-driven workflows and verification paths create identity trust risk around non-human execution.

Treat callback verification as one signal and require independent authorization for high-risk requests.

Key terms

Behavioral AI: A detection approach that learns normal patterns for a user, mailbox, or workflow and flags deviations. It does not depend on known malicious signatures, so it can surface novel phishing, impersonation, and abuse patterns that look technically clean but are abnormal for the environment.
Signature-Based Email Security: Email defence that scans for known-bad indicators such as malicious links, attachments, and payload patterns. It is effective against familiar threats but weak against AI-generated messages that are language-perfect, context-aware, and free of obvious malicious artefacts.
Deepfake Verification Risk: The risk that voice or video used to confirm identity can itself be fabricated in real time. In practice, this means callback verification may fail as a standalone assurance method because the attacker can impersonate the trusted person during the same approval window.
Identity-Aware Phishing: Phishing that is tailored to the target's role, relationships, and workflow so the request looks normal rather than obviously malicious. It targets trust decisions directly, which makes identity governance, approval design, and behavioural monitoring more relevant than simple content filtering.

Deepen your knowledge

NHI governance, agentic AI identity, and machine identity lifecycle are core topics in our NHI Foundation Level course, the industry's only accredited NHI security programme. If you are responsible for identity security strategy or programme maturity, it is worth exploring.

This post draws on content published by Abnormal AI: AI phishing and deepfake-enabled social engineering analysis. Read the original.

NHIMG Editorial Note
Published by the NHIMG editorial team on 2026-05-14.
NHI Mgmt Group — the independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org