Subscribe to the Non-Human & AI Identity Journal

What breaks when an organisation has only one DNS provider?

A single DNS provider creates a shared failure point for websites, APIs, authentication services, and internal resolution. If that provider has an outage or configuration issue, the organisation may lose access to critical systems at once. Secondary providers and failover testing reduce that exposure, but only if they are kept current.

Why This Matters for Security Teams

DNS is not just a lookup service. It is the control plane that lets users reach websites, APIs, SaaS logins, and internal applications. When an organisation depends on a single provider, that provider becomes a shared failure domain for availability and, in some cases, security. A misconfiguration, routing fault, or control-plane outage can make healthy systems appear down.

This matters because DNS failures often surface as authentication issues, application errors, or timeouts that look unrelated at first. Teams may investigate identity, firewall, or app code while the actual fault sits in name resolution. The risk is not theoretical: NHI Mgmt Group notes that only 5.7% of organisations have full visibility into their service accounts, which is a reminder that dependency gaps are usually discovered after disruption rather than during planning. The same pattern applies to DNS dependencies in production.

Security and resilience planning should treat DNS as critical infrastructure, not a commodity utility. The NIST Cybersecurity Framework 2.0 places clear emphasis on identifying and managing external dependencies, which is exactly where single-provider DNS risk lives. In practice, many security teams encounter DNS concentration risk only after authentication and application failures have already cascaded across multiple services.

How It Works in Practice

A single DNS provider creates one operational dependency for both public and internal resolution. If that provider fails, clients cannot resolve hostnames even when the target systems are healthy. That affects web traffic, API calls, SMTP delivery, SSO redirects, certificate validation flows, and service-to-service traffic that relies on DNS names instead of fixed IPs.

The practical failure modes usually fall into three buckets:

  • Authoritative outage: the provider cannot answer queries for hosted zones.
  • Recursive outage: users cannot resolve external or third-party names.
  • Configuration error: a bad record, TTL setting, delegation change, or automated update breaks resolution at scale.

Current guidance suggests using a secondary DNS provider, but multi-provider design only helps if the alternate zone is current, the failover path is tested, and registrar settings are correct. The operational lesson is the same one seen in NHI incidents: hidden dependencies become critical when they are never exercised. NHI Mgmt Group’s JetBrains GitHub plugin token exposure shows how a single exposed dependency can cascade into wider access risk when governance is weak. For DNS, the equivalent is a single point of failure in resolution infrastructure.

Good practice is to document which services depend on which zones, validate time-to-live settings, and rehearse failover before an incident. Providers such as NIST Cybersecurity Framework 2.0 support this kind of dependency mapping through resilience and recovery planning. These controls tend to break down in organisations that use one provider for both public DNS and internal service discovery because a single control-plane fault can remove access to the fallback path itself.

Common Variations and Edge Cases

Tighter DNS redundancy often increases operational overhead, requiring organisations to balance resilience against configuration complexity. That tradeoff is real: dual providers, split-horizon setups, and health-checked failover all add maintenance burden.

There is no universal standard for DNS failover design yet, so teams should be explicit about what they are protecting against. A secondary provider helps with provider outage, but not with bad zone data copied into both places, registrar lockout, or application code that hard-codes one resolver path. Best practice is evolving toward testing both authoritative and recursive dependencies, not just keeping a second vendor on paper.

Edge cases matter. Some environments use external DNS for customers but internal DNS for authentication, CI/CD, and service discovery. Others front critical services through CDNs or load balancers, which can mask provider issues until cache expiry or certificate renewal. For organisations that manage many machine identities, DNS loss can also interrupt token exchange and secret retrieval workflows, compounding the impact of a provider event. The most robust posture is to treat DNS as part of business continuity planning, not just network administration.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

NIST CSF 2.0, NIST CSF 2.0 and NIST CSF 2.0 set the governance and control requirements practitioners need to meet.

Framework Control / Reference Relevance
NIST CSF 2.0 ID.BE-4 Single-provider DNS is a critical external dependency that affects resilience.
NIST CSF 2.0 PR.AC-5 DNS outages can disrupt authentication and access flows across services.
NIST CSF 2.0 RS.RP-1 Recovery planning must account for DNS failover and restoration steps.

Validate DNS dependencies in access paths and ensure alternate resolution works before incident day.