TL;DR: DDoS attacks spiked 278% in Q1 2020 and another 31% in Q1 2021, with average incidents lasting about four hours and downtime costs reaching $5,600 per minute, according to DigiCert. Redundancy, secondary DNS, and real-time traffic visibility remain the practical controls, not a single provider or mitigation layer.
At a glance
What this is: This is a DigiCert explainer on DDoS attack prevention that argues DNS redundancy, failover, secondary DNS, and traffic monitoring are the core controls for keeping services available.
Why it matters: It matters because availability controls sit inside broader identity and access programmes, especially where DNS, cloud, and machine identities depend on resilient control planes and uninterrupted routing.
By the numbers:
- 2020, he first quarter of 2020, DDoS attacks spiked 278%.
- One minute of downtime can cost an organization as much as $5,600.
👉 Read DigiCert's blog on DDoS attack prevention and DNS mitigation
Context
DDoS prevention is about keeping services reachable when traffic volume is intentionally pushed beyond what a single endpoint or provider can absorb. In DNS and cloud environments, the problem is not just attack volume, but whether routing, failover, and visibility are designed to keep a service online under stress.
For identity and infrastructure teams, the governance lesson is simple: availability depends on control-plane resilience as much as on protection services. If DNS, monitoring, and backup paths are single points of failure, an attack can turn into an extended outage even when the application itself is intact.
Key questions
Q: How should security teams design DNS redundancy to withstand DDoS attacks?
A: Security teams should design DNS redundancy so that failover, secondary authority, and monitoring are independent of the same provider failure. The goal is not only to survive an attack, but to preserve resolution when one path is saturated or unavailable. Test the design under load, confirm health checks work, and verify that the backup path can actually answer queries during stress.
Q: Why does DDoS mitigation need DNS monitoring as well as traffic filtering?
A: DDoS mitigation needs DNS monitoring because the earliest warning sign is often an abnormal spike in query behaviour, not a finished outage. Traffic filtering helps once the attack is underway, but monitoring tells teams when to act. It also catches misconfiguration errors that can look like attack traffic, which reduces false escalation and improves response quality.
Q: What fails when a domain depends on a single DNS or cloud provider?
A: When a domain depends on a single DNS or cloud provider, an outage or attack against that provider can take the domain offline even if the application itself is healthy. That creates a shared failure domain. The result is longer downtime, slower recovery, and greater exposure to revenue and reputation loss because there is no alternate path for resolution or routing.
Q: Who is accountable when DNS availability fails during a DDoS attack?
A: Accountability usually sits with infrastructure, security, and application owners together, because DNS availability is a shared control-plane risk. NIST Cybersecurity Framework 2.0 is useful here because it ties governance, protection, detection, response, and recovery into one risk model. The practical question is whether the organisation has defined ownership for failover, monitoring, and recovery before the outage happens.
Technical breakdown
How DDoS swarms overwhelm DNS and service availability
A distributed denial-of-service attack uses many compromised devices to flood a target with requests, consuming bandwidth, connection tables, or upstream service capacity. In DNS terms, this can exhaust authoritative servers or make resolution unreliable enough that downstream applications appear offline. The weakness is not only raw volume, but the absence of redundancy and health-aware routing. If traffic cannot shift quickly to another healthy endpoint, the service inherits the full blast radius of the attack.
Practical implication: map DNS and cloud dependencies to every single point of failure before assuming availability controls will hold.
Failover and secondary DNS reduce outage exposure
Failover works by monitoring multiple IPs or hosts and directing traffic to a healthy destination when one fails. Secondary DNS adds another layer by maintaining an alternate authoritative source so resolution can continue if the primary provider is unavailable. These are not the same thing. Failover protects service routing, while secondary DNS protects the authority that answers queries. Together they reduce the chance that one outage, attack, or provider failure takes the domain dark.
Practical implication: verify that failover and secondary DNS are configured independently, tested, and not dependent on the same failure domain.
Traffic anomaly detection turns DDoS into a detectable event
DDoS defence is stronger when teams can see unusual traffic patterns as they begin, not after the outage is complete. Real-time anomaly detection and DNS analytics help separate legitimate spikes from malicious surges and can also surface misconfigurations that create similar load patterns. That matters because mitigation is often a timing problem. The sooner the anomaly is visible, the sooner routing, rate limiting, or provider-level defences can be applied before saturation.
Practical implication: pair DNS analytics with operational alerting so traffic spikes are investigated before service degradation becomes user-visible.
Threat narrative
Attacker objective: The attacker’s objective is to deny access to the target service and create operational disruption, revenue loss, and reputational damage.
- Entry occurs when attackers assemble a botnet of compromised devices and direct it toward a specific server, domain, or network.
- Escalation happens as the flood of requests overwhelms DNS resolution or upstream capacity, especially where redundancy is missing.
- Impact is loss of availability, with the domain or service becoming difficult or impossible to reach for users and applications.
Breaches seen in the wild
- Codefinger AWS S3 ransomware attack — Codefinger used compromised AWS credentials to encrypt S3 buckets via SSE-C.
- DeepSeek breach — DeepSeek breach exposed 1M+ log lines and sensitive secret keys.
Read our 52 NHI Breaches Analysis report for a comprehensive view of breaches impacting Non-Human Identities including AI Agents.
NHI Mgmt Group analysis
DDoS resilience is a control-plane availability problem, not just a traffic problem. The article makes clear that attack volume matters, but the decisive weakness is whether DNS and cloud routing can fail over cleanly under pressure. When a service depends on one provider or one authoritative path, the attacker only needs to concentrate traffic long enough to make reachability fail. Practitioners should treat DNS continuity as part of identity-adjacent infrastructure governance, not as an isolated network concern.
Redundancy is the real boundary control for availability. Failover and secondary DNS are presented as practical safeguards because they reduce the blast radius of both attacks and provider outages. That is the important governance point: availability is preserved by removing single points of failure, not by assuming any one mitigation layer will absorb all forms of disruption. For teams running cloud, DNS, and workload identity services, this is a reminder that resilience has to be designed into the path, not added after an outage.
Traffic visibility is the difference between mitigation and reaction. Real-time anomaly detection gives teams a chance to identify abnormal query patterns before a flood becomes a prolonged outage. In practice, that turns DDoS from an all-or-nothing event into something that can be observed, triaged, and routed around. The operational lesson is to treat analytics as part of the mitigation stack, because without early visibility the response window closes quickly.
DDoS economics change the governance threshold for resilience. The post ties small intervals of downtime to meaningful financial loss, which means availability decisions belong in risk and architecture discussions, not just incident response. When a few minutes of disruption can cascade into material cost, backup DNS, multi-provider design, and monitoring become business controls. Practitioners should frame DDoS resilience as a measurable risk reduction investment.
Identity-adjacent services need the same resilience thinking as core applications. DNS is often treated as plumbing, yet it sits under the reachability of authentication flows, service endpoints, and machine-to-machine dependencies. If resolution fails, downstream identity and access services can fail with it, even when their own systems are healthy. The practical conclusion is that identity programmes should include DNS and availability dependencies in their resilience scope.
From our research:
- 70% of organisations grant AI systems more access than they would give a human employee performing the exact same job, according to The 2026 Infrastructure Identity Survey.
- Only 13% of organisations feel extremely prepared for the reality of agentic AI despite the majority racing toward autonomous adoption.
- For the governance dimension behind this risk, see Ultimate Guide to NHIs , Key Challenges and Risks for how visibility gaps and over-privilege compound outage exposure.
What this signals
Identity programmes should treat resilience as part of access governance. If DNS or routing paths fail, identity-dependent services fail with them, which means access models are only as trustworthy as the availability of the control plane behind them. The governance task is to make failover, monitoring, and recovery visible in the same way as access reviews and entitlement checks.
With 70% of organisations granting AI systems more access than human employees performing the same job, per The 2026 Infrastructure Identity Survey, resilience and privilege design are converging. That convergence makes dependency mapping essential across human, NHI, and agentic workflows.
Identity blast radius: when the availability of DNS, secrets, or workload routing determines whether identities can operate, the failure domain becomes an identity risk. Teams should plan for the point at which a control-plane outage turns into a broader service and governance outage.
For practitioners
- Map DNS single points of failure Document every authoritative nameserver, CDN dependency, and upstream provider path so teams can see where one outage can take a domain offline.
- Test failover under real traffic conditions Run controlled failover exercises that confirm traffic reroutes to healthy servers when one endpoint degrades or disappears.
- Deploy secondary DNS with independent failure domains Use a secondary authoritative setup that does not rely on the same provider, control plane, or network path as the primary service.
- Instrument real-time anomaly detection for DNS traffic Alert on sudden query spikes, unusual source patterns, and misconfiguration-driven surges so the response begins before saturation.
Key takeaways
- DDoS mitigation succeeds when teams remove single points of failure in DNS and cloud routing, not when they rely on a single protective layer.
- The evidence in the article shows that outage costs can escalate quickly, which makes redundancy and anomaly detection business controls as much as technical ones.
- Practitioners should validate failover, secondary DNS, and traffic monitoring together because resilience only works when the full path is tested.
Standards & Framework Alignment
This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.
NIST CSF 2.0, NIST Zero Trust (SP 800-207) and NIST CSF 2.0 set the governance and control requirements practitioners need to meet.
| Framework | Control / Reference | Relevance |
|---|---|---|
| NIST CSF 2.0 | PR.PT-5 | DDoS mitigation depends on resilient protective technology and service continuity. |
| NIST Zero Trust (SP 800-207) | Zero trust assumes continuous verification, but availability still depends on resilient access paths. | |
| NIST CSF 2.0 | DE.CM-1 | Traffic anomaly detection is a monitoring control directly relevant to DDoS detection. |
Treat DNS and service availability as part of the trusted path that zero trust still depends on.
Key terms
- Distributed Denial-of-Service: A distributed denial-of-service attack uses many compromised systems to flood a target with requests until the service cannot respond normally. The goal is disruption rather than theft. In identity and infrastructure programmes, it matters because availability controls, not just authentication controls, determine whether users and workloads can keep reaching the service.
- Failover: Failover is the process of shifting traffic or workload handling to a healthy backup system when the primary one fails. It is only effective when the backup path is independently monitored and reachable. For identity-adjacent services such as DNS and access dependencies, failover reduces outage duration and limits the blast radius of a single failure.
- Secondary DNS: Secondary DNS is an alternate authoritative DNS setup that can continue answering queries if the primary provider is unavailable. It strengthens service continuity by separating the ability to resolve names from a single control point. In practice, it is a resilience control that protects availability when attacks, outages, or provider issues affect the primary path.
Deepen your knowledge
NHI governance, agentic AI identity, and machine identity security are core topics in our NHI Foundation Level course, the industry's only accredited NHI security programme. If you are responsible for identity security strategy or governance in your organisation, it is worth exploring.
This post draws on content published by DigiCert: DDoS Attack Prevention and Mitigation. Read the original.
Published by the NHIMG editorial team on 2026-06-17.
NHI Mgmt Group — the independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org