Subscribe to the Non-Human & AI Identity Journal

How should security teams decide whether to keep a legacy SEG with Microsoft 365?

They should decide based on control independence, not habit or procurement history. If the SEG mostly duplicates spam, malware, and URL inspection already present in Microsoft 365, it is adding cost rather than resilience. Keep only the layer that materially improves detection of identity-based and text-only attacks that native controls miss.

Why This Matters for Security Teams

A legacy SEG should not be kept because it is familiar. The decision should hinge on whether it adds a distinct control plane that Microsoft 365 does not already provide. If it only repeats spam filtering, malware inspection, and basic URL checks, then it is paying for overlap instead of resilience. The real question is whether it catches the identity-driven, text-only, and conversation-based attack paths that now dominate modern email compromise.

That matters because email is no longer just a malware delivery channel. Phishing, OAuth abuse, token theft, and impersonation often arrive as clean text and legitimate-looking links that bypass classic gateway logic. NHIMG research on incidents such as the Microsoft Midnight Blizzard breach and the JetBrains GitHub plugin token exposure shows how attackers increasingly target trust relationships, not just attachments. NIST guidance in the NIST Cybersecurity Framework 2.0 reinforces that controls should be assessed by risk reduction, not by layer count.

In practice, many security teams discover a SEG is redundant only after an identity-based phishing campaign has already bypassed the mailbox and moved into session theft or consent abuse.

How It Works in Practice

The strongest way to decide is to map each SEG control to a specific failure mode in Microsoft 365. If the SEG is not independently improving detection, containment, or investigation, it is not earning its place. Start by separating inbound filtering from identity protection, because those are not the same thing. Microsoft 365 can already handle baseline spam, malware, and some impersonation controls, while a SEG may still add value in advanced sandboxing, URL detonation, policy enforcement, or outbound data controls.

A practical review should ask four questions:

  • Does the SEG detect threats Microsoft 365 misses, especially text-only social engineering and lookalike impersonation?
  • Does it provide independent telemetry for hunting, case enrichment, or forensic correlation?
  • Does it reduce blast radius through quarantine, time-of-click re-evaluation, or outbound protection?
  • Does it materially improve resilience against business email compromise, consent phishing, or token theft?

For identity-centered threats, align the review with NHIMG guidance on NHI exposure and lifecycle control in the Ultimate Guide to NHIs. This is important because many email-driven compromises end in secret reuse, API token theft, or unauthorized app consent rather than obvious malware execution. Microsoft 365 and a SEG can also both miss attacks that use legitimate infrastructure, as shown in NHIMG analysis of the Microsoft Azure OpenAI service breach, where the issue was trust and access, not just message content.

Current guidance suggests keeping a SEG only when it independently improves detection or response in one of those areas. These controls tend to break down when the environment assumes the gateway is the main defense, because identity compromise happens after delivery and outside the SEG’s visibility window.

Common Variations and Edge Cases

Tighter email control often increases cost and operational overhead, requiring organisations to balance resilience against complexity and user friction. That tradeoff is especially visible when Microsoft 365 security features are already licensed and tuned. In some environments, a SEG is still justified for regulatory archiving, multi-tenant standardisation, outbound DLP, or an additional isolation layer for high-risk executive mailboxes. In others, it creates duplicate false positives and slows incident response without measurably reducing risk.

There is no universal standard for this yet, but best practice is evolving toward capability testing rather than product loyalty. A legacy SEG may still be useful if it provides stronger protection for external recipient controls, URL rewriting with better telemetry, or mail flow segmentation across subsidiaries and mergers. It is less compelling when it simply mirrors what Microsoft 365 already does while adding another policy stack to maintain. For a broader security posture view, NIST CSF 2.0 is useful because it frames technology as a risk treatment decision, not a procurement default.

Teams should also watch for edge cases where email is only the entry point. If the real risk is OAuth consent abuse, session hijacking, or API key theft, the better investment is often identity governance and NHI controls, not another perimeter filter. That pattern is consistent with NHIMG findings on widespread secret exposure and poor rotation discipline in the State of Non-Human Identity Security.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Non-Human Identity Top 10 address the attack and risk surface, while NIST CSF 2.0 and NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework Control / Reference Relevance
NIST CSF 2.0 GV.RM SEG retention should be based on risk reduction and governance outcomes, not legacy procurement.
OWASP Non-Human Identity Top 10 NHI-01 Email attacks often end in secret theft, OAuth abuse, and NHI compromise.
NIST AI RMF Agentic and text-based attacks require context-aware risk evaluation across identity and email channels.

Use AI RMF risk analysis to test whether SEG controls reduce actual attack paths, not just message volume.