TL;DR: Anticimex blocked more than 40,000 malicious emails that Microsoft missed between February and April, avoiding an estimated $169,000 in losses through AI Security Mailbox automation and graymail filtering, according to Abnormal AI. The bigger lesson is that email defence built for older threat volumes and patterns is no longer keeping pace with AI-accelerated attacks.
At a glance
What this is: This is an Abnormal AI case study showing that AI-driven email attacks can bypass native Microsoft protections at scale while behavioural filtering catches far more malicious messages.
Why it matters: It matters because email remains an identity entry point, and missed messages can turn into credential theft, account compromise, or wider identity risk across human and non-human programmes.
👉 Read Abnormal AI's analysis of AI-powered phishing and email defence gaps
Context
AI-powered email attacks now matter because native controls are often tuned for known patterns, not for the speed and personalisation of modern phishing and business email compromise. When defenders depend on a single platform layer to catch malicious messages, they inherit that platform's blind spots as part of their identity risk.
For IAM teams, email is not just a communication channel. It is a high-trust entry point into human identity, privileged workflows, and sometimes downstream non-human access when attackers use a compromised mailbox to reset passwords, approve requests, or pivot into SaaS and cloud systems.
Key questions
Q: How should security teams improve email defence against AI-generated phishing?
A: They should combine native email security with behavioural detection, mailbox automation, and user workflow controls. The goal is to catch messages that look legitimate but are designed to manipulate trust, approvals, or credential handling. Teams should also review how mailbox events feed into password resets, help desk actions, and SaaS access workflows.
Q: Why do AI-assisted email attacks create identity risk beyond the inbox?
A: Because email is a control plane for human identity. Attackers use mail to trigger password resets, approve fraudulent requests, and impersonate vendors or colleagues, which can lead to account compromise and downstream access to cloud or SaaS systems. The inbox is often the first step in a broader identity attack path.
Q: What breaks when organisations rely only on native Microsoft protections?
A: They miss attacks that are too personalised, too dynamic, or too behaviorally subtle for static filtering to catch. That creates a blind spot where malicious messages reach users, and users become the last line of defence. Over time, that increases both breach probability and operational noise.
Q: How do teams know whether graymail filtering is improving security?
A: They should look for fewer malicious messages reaching users, fewer user-reported phishing events, and reduced false trust in routine mail. If inbox noise drops while malicious-message detection rises, users can focus on the few messages that actually matter. That is a measurable security gain, not just a productivity win.
Technical breakdown
Why native email protections miss AI-accelerated attacks
Native email security controls are usually pattern-driven. They look for known signatures, reputation signals, suspicious URLs, or obvious malware indicators, but AI-assisted phishing often avoids those triggers by generating convincing, context-aware language at scale. That creates a detection gap where the message looks legitimate enough to pass automated checks but still manipulates the recipient into action. In practice, the failure is not that the control is absent, but that its detection model is too static for adaptive content and high-volume targeting.
Practical implication: pair native filters with behavioural detection that scores sender intent, thread context, and anomalous message patterns.
How mailbox automation changes the response model
Mailbox automation shifts defence from inbox-by-inbox triage to policy-driven handling of suspicious and low-value messages. By automating quarantine, cleanup, and graymail suppression, teams reduce the amount of content that reaches users and shrink the opportunity for social engineering. This matters because email security failures are often operational, not just technical. If defenders can only react after users report suspicious mail, the control loop is already too slow for AI-accelerated campaigns.
Practical implication: define quarantine and cleanup workflows that remove repeated attacker contact before users become the detection layer.
Why graymail filtering affects identity risk, not just inbox noise
Graymail is often treated as a productivity problem, but it has identity consequences. The more irrelevant or semi-trusted mail users receive, the harder it becomes to spot malicious messages, and the easier it is for attackers to blend into routine correspondence. In large organisations, that noise also conditions users to click, approve, or ignore messages without close scrutiny. Filtering graymail therefore improves not only usability but also the signal-to-noise ratio that human identity protection depends on.
Practical implication: treat graymail reduction as part of identity attack surface reduction, not as a pure messaging hygiene task.
Threat narrative
Attacker objective: The attacker wants to convert a believable email conversation into account access, privileged workflow abuse, or financial loss.
- Entry begins with AI-generated phishing or impersonation messages that land in user inboxes because native filters do not reliably flag them as malicious.
- Credential theft or workflow abuse follows when a recipient engages with the message, shares secrets, or approves a fraudulent request that gives the attacker account access.
- Impact occurs when the attacker uses mailbox access to extend the compromise into identity resets, internal fraud, or broader cloud and SaaS abuse.
Breaches seen in the wild
- Cisco DevHub NHI breach — IntelBroker exploited exposed Cisco credentials, API tokens and keys in DevHub.
- ASP.NET machine keys RCE attack — 3,000+ exposed ASP.NET machine keys enabled remote code execution.
Read our 52 NHI Breaches Analysis report for a comprehensive view of breaches impacting Non-Human Identities including AI Agents.
NHI Mgmt Group analysis
AI-powered email defence is now an identity control, not just a messaging control. When malicious mail gets through, the first thing at risk is human identity, because the inbox is where password resets, approvals, and vendor conversations intersect. Anticimex's result shows that native filtering alone may leave too much exposure in place. Practitioners should treat email detection quality as part of identity assurance, not an adjacent convenience layer.
Behavioral filtering exposes the gap between legacy controls and adaptive attack content. The issue is not simply more spam, but more convincing abuse that evades static checks and exploits normal work patterns. AI-generated attacks scale personalisation faster than review processes can adapt, which means the control problem is behavioural as much as it is technical. Security teams should expect the old signature-led model to underperform when the attacker can continuously rewrite the payload.
Mail security now affects downstream NHI and SaaS governance. A compromised mailbox is often a launch point for secret resets, delegated access, and malicious approvals that affect service accounts and cloud workflows. That makes email defence relevant to NHI governance even when the article is framed around Microsoft protection gaps. The implication is that identity teams need one threat view across human inboxes and the non-human systems they can unlock.
Graymail filtering is a signal management control, not a comfort feature. When users are buried in low-value mail, they lose the ability to spot the one message that matters. This article reinforces a wider governance pattern: attack success often depends on degrading attention before it depends on bypassing controls. Practitioners should view inbox noise reduction as part of identity risk reduction, especially where approvals and exceptions still happen by email.
From our research:
- Only 1.5 out of 10 organisations are highly confident in their ability to secure NHIs, compared to nearly 1 in 4 for securing human identities, according to The State of Non-Human Identity Security.
- From our research: 85% of organisations lack full visibility into third-party vendors connected via OAuth apps, according to The State of Non-Human Identity Security.
- Forward pivot: The NHI lifecycle problem becomes more urgent when mailbox compromise can trigger secret resets and delegated access, which is why The 52 NHI breaches Report remains a useful next read.
What this signals
The direction of travel is clear. Email security is converging with identity governance because the inbox is where attackers still convert trust into access, and that makes mailbox telemetry relevant to IAM, PAM, and service-account oversight as well as user protection.
Mailbox trust debt: the longer organisations tolerate noisy inboxes and weak behavioural controls, the more they accumulate unseen exposure in approval chains, help desk workflows, and delegated access paths. Teams should expect security metrics to move from message blocking alone to identity outcomes, including password reset abuse and workflow fraud.
For IAM and security architecture teams, the practical question is whether mail controls are feeding the right downstream response logic. If suspicious mail does not trigger identity investigation, then the organisation has a detection tool, not an assurance model.
For practitioners
- Strengthen behavioural email detection Add controls that score intent, thread context, and anomalous sending patterns rather than relying only on signatures and known bad indicators.
- Reduce inbox noise before it reaches users Use graymail filtering and automated triage to remove recurring low-value mail so security teams are not asking users to act as the primary detection layer.
- Tie mailbox alerts to identity workflows Route suspicious email events into password reset, approval, and help desk monitoring so mailbox abuse cannot quietly trigger downstream identity changes.
- Test the gap between native and behavioural controls Run controlled simulations that compare what Microsoft-style native protections catch versus what behavioural AI and quarantine workflows stop in practice.
Key takeaways
- AI-generated phishing turns email security into an identity issue because compromised inboxes often become the launch point for account abuse.
- Anticimex's results show that behavioural filtering can catch large volumes of malicious mail that native controls miss, which changes the economics of email defence.
- Teams should measure whether inbox controls reduce downstream identity events, not just whether they block more messages.
Standards & Framework Alignment
This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.
NIST CSF 2.0, NIST Zero Trust (SP 800-207) and NIST SP 800-63 set the governance and control requirements practitioners need to meet.
| Framework | Control / Reference | Relevance |
|---|---|---|
| NIST CSF 2.0 | PR.DS-5 | Email abuse leads to data loss and identity compromise. |
| NIST Zero Trust (SP 800-207) | PR.AC-4 | Suspicious mail can trigger access changes and privileged actions. |
| NIST SP 800-63 | Mailbox compromise often precedes human identity takeover. |
Map mailbox protection to PR.DS-5 and reduce exposure from malicious and low-value messages.
Key terms
- Behavioral Email Detection: A detection approach that evaluates message intent, conversation context, sender behaviour, and anomaly patterns instead of relying only on static indicators. It is designed to catch persuasive attacks that look legitimate at delivery time but become suspicious when judged against normal communication patterns.
- Graymail: Low-value, legitimate mail that is not overtly malicious but still clutters inboxes and conditions users to ignore or skim messages. In identity security, graymail matters because excessive noise lowers vigilance and makes phishing, impersonation, and approval abuse easier to hide.
- Mailbox Automation: Automated handling of suspicious, repetitive, or low-value email events, including quarantine, cleanup, and filtering actions. Used correctly, it shortens exposure windows and reduces dependence on end users as the final control in a phishing chain.
Deepen your knowledge
NHI governance, agentic AI identity, and machine identity lifecycle are core topics in our NHI Foundation Level course, the industry's only accredited NHI security programme. If you are responsible for identity security strategy or NHI governance in your organisation, it is worth exploring.
This post draws on content published by Abnormal AI: Key Insights from Stockholm on AI-powered email attacks and native protection gaps. Read the original.
Published by the NHIMG editorial team on 2026-06-08.
NHI Mgmt Group — the independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org