Subscribe to the Non-Human & AI Identity Journal

What should procurement teams ask before accepting deepfake resistance claims?

They should ask which independent lab tested the control, against which standard, and at what assurance level. They should also confirm whether the test covered injection attacks as well as presentation attacks, because the two failure modes are not interchangeable. If the answer is vague, the control is unverified rather than proven.

Why This Matters for Security Teams

deepfake resistance claims are often marketed as if a single detector or liveness check can solve identity fraud. Procurement teams need to treat those claims as assurance statements, not feature names. The real question is whether the control can withstand the attack modes that matter in production, including injection attacks, replay, and presentation attacks. That distinction matters because an attacker may never need to “look like” a person if they can manipulate the input path instead.

Current guidance suggests that buyers should anchor evaluation in independent evidence, not self-attestation. The NIST Cybersecurity Framework 2.0 reinforces the need for outcome-based risk management, while NHIMG research on the DeepSeek breach shows how quickly trust claims collapse when hidden exposure is discovered. In practice, many security teams encounter weak deepfake resistance only after fraud or account takeover has already happened, rather than through intentional supplier due diligence.

How It Works in Practice

Procurement should ask for evidence that is specific, repeatable, and relevant to the intended deployment. A credible vendor should be able to name the independent lab, the test method, the standard or benchmark used, the model or sensor conditions tested, and the assurance level achieved. If the product is used for identity proofing or access control, the test scope should also clarify whether the control resists spoofing of audio, video, image, or sensor input, and whether it was evaluated against generated media, replayed media, or injected data streams.

That last point is crucial. A system may perform well against a face presented to a camera, yet fail if the attacker injects synthetic media downstream into the software pipeline. NIST-aligned thinking favors measurable controls, while NHIMG’s coverage of the DeepSeek breach underscores how hidden implementation details can undermine confidence. Procurement teams can make the review more concrete by asking for:

  • the exact lab report and test date, not a summary slide
  • the attack classes covered, including presentation and injection attacks
  • the measured false acceptance and false rejection outcomes
  • the minimum conditions required for the claimed assurance level
  • the customer-side responsibilities needed to preserve that assurance

Where possible, buyers should ask whether the evidence is tied to a published standard or profile, because “tested internally” is not the same as independently verified. These controls tend to break down in high-friction remote onboarding environments because operators compensate for false rejects by weakening the process.

Common Variations and Edge Cases

Tighter verification often increases onboarding friction, so organisations have to balance fraud resistance against user abandonment and support cost. That tradeoff is real, especially in customer-facing flows where false rejects can create revenue loss or operational bottlenecks.

Best practice is evolving, and there is no universal standard for deepfake resistance yet. Some suppliers test only against consumer-grade spoofing, while others include more advanced synthetic media and injection pathways. Procurement teams should treat those as different assurance levels, not interchangeable claims. If a vendor cannot explain the boundary of their test environment, the claim may be valid only for a narrow set of conditions.

One useful procurement check is to ask whether the control remains effective when the attacker has partial access to the workflow, such as a compromised endpoint, browser extension, or API integration. Those scenarios matter because modern fraud often mixes deception with system manipulation. A control that only works in a clean lab environment may not survive deployment in a distributed identity stack.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Agentic AI Top 10 and CSA MAESTRO address the attack and risk surface, while NIST AI RMF set the governance and control requirements practitioners need to meet.

Framework Control / Reference Relevance
OWASP Agentic AI Top 10 Deepfake risk often becomes an input-manipulation problem in AI-driven flows.
NIST AI RMF AI RMF supports evidence-based evaluation of model and system risk claims.
CSA MAESTRO MAESTRO addresses trust, testing, and runtime assurance for agentic systems.

Demand documented test evidence, scope limits, and residual-risk acceptance for each control.