Subscribe to the Non-Human & AI Identity Journal

How should security teams detect token theft if MFA was already completed?

Focus on how the token is used, not how the user logged in. Look for unusual IP addresses, new geolocations, User-Agent changes, impossible travel, bulk data access, and access to applications or scopes the account rarely uses. Successful logins are not enough to prove legitimacy once a bearer token is stolen.

Why This Matters for Security Teams

MFA completion only proves the user or workload satisfied an authentication step. It does not prove the resulting bearer token has not been copied, replayed, or abused. That distinction is central to incident response, because stolen tokens often bypass login alerts entirely and act as standing access until expiry or revocation. Guidance from NIST Cybersecurity Framework 2.0 and case reporting such as the Salesloft OAuth token breach both reinforce the same point: authenticating a session is not the same as continuously trusting it.

For security teams, the practical problem is that token theft often looks like normal success in authentication logs. The useful signals appear later, in how the token is consumed: access from a new network, a different device fingerprint, unfamiliar user-agent strings, unusual API sequences, or sudden reads across high-value objects. That is why teams need session-level detection, not just sign-in monitoring. The Guide to the Secret Sprawl Challenge is useful here because token exposure rarely stays in one system; it spreads across code, chat, tickets, and integrations. In practice, many security teams encounter token theft only after data access has already occurred, rather than through intentional detection.

How It Works in Practice

The most reliable approach is to baseline normal token behavior and alert on deviation. Start by correlating token use with source IP, geolocation, ASN, device fingerprint, User-Agent, time of day, API scope, and data volume. A token that authenticates from a known office in the morning and then reads thousands of records from a foreign network an hour later is more suspicious than a failed login attempt ever would be. This is especially important for non-human identities, where tokens are often long-lived and embedded into automation. Research from The State of Secrets Sprawl 2026 shows how broadly exposed credentials can become, and why detection must be paired with revocation and rotation.

Operationally, teams should instrument these checks:

  • Compare token use against expected service geography and time windows.
  • Flag new User-Agent strings or sudden shifts from browser traffic to API tooling.
  • Alert on impossible travel, rapid IP churn, or access from hosting providers when that is atypical.
  • Watch for bulk exports, repeated pagination, and scope use outside normal job function.
  • Revoke the token, invalidate refresh chains, and rotate any downstream secrets immediately.

For implementation guidance, NIST Cybersecurity Framework 2.0 supports continuous monitoring and response, while the NHI Lifecycle Management Guide helps teams tie token detection to revocation, ownership, and expiry control. These controls tend to break down in shared service accounts with many legitimate access paths because the baseline becomes too broad to distinguish abuse cleanly.

Common Variations and Edge Cases

Tighter token monitoring often increases alert volume and analyst overhead, so organisations have to balance sensitivity against operational noise. There is no universal standard for this yet, especially for environments with remote work, contractor access, or global automation fleets. In those cases, current guidance suggests layering context rather than relying on any single anomaly. A travel-heavy sales team, for example, may generate frequent IP changes that look suspicious in isolation, while a CI/CD token may legitimately move between runners and regions. That is why token type matters: human sessions, service tokens, and delegated application tokens need different baselines.

Edge cases also include refresh tokens, brokered sessions, and tokens shared across multiple apps. The Top 10 NHI Issues page is relevant because overused identities and duplicated secrets make attribution harder once misuse starts. If an attacker replays a token from a normal cloud region using a valid device posture, network-only detections may miss it. In those cases, scope abuse, request sequence analysis, and downstream data-access patterns become the better indicators. The key takeaway is simple: if a token can be replayed, then MFA is no longer the control boundary that matters. In practice, teams usually discover this after an exposed token is used successfully in a place the original user never intended.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

OWASP Non-Human Identity Top 10 address the attack and risk surface, while NIST CSF 2.0 and NIST Zero Trust (SP 800-207) set the governance and control requirements practitioners need to meet.

Framework Control / Reference Relevance
OWASP Non-Human Identity Top 10 NHI-03 Token theft is mitigated by short-lived credentials and fast revocation.
NIST CSF 2.0 DE.CM-8 Continuous monitoring is required to detect abnormal token use after MFA.
NIST Zero Trust (SP 800-207) PR.AC Zero trust requires continuous trust evaluation beyond initial authentication.

Limit token lifetime, monitor reuse, and revoke exposed NHI credentials immediately.