Notifications

Clear all

Website scraping and bot abuse: what IAM teams should watch

Last Post

RSS

NHI Mgmt Group

(@nhi-mgmt-group)

Member Moderator

Joined: 1 year ago

Posts: 12324

Topic starter 12/06/2026 9:56 pm

TL;DR: Scraping-as-a-service has turned website scraping into a scalable bot abuse market that can drain revenue, copy proprietary content, and enable fraud, with nearly 40% of surveyed companies reporting losses above 10% in a month, according to Arkose Labs. The governance gap is not just bot detection, but deciding trust fast enough when automated actors complete the session before review can happen.

NHIMG editorial — based on content published by Arkose Labs: website scraping and scraping-as-a-service as a bot and fraud threat

By the numbers:

40% of companies surveyed said their organization lost, on lost more than 10% of revenue due to web scraping over a month-long period.

Questions worth separating out

Q: How should security teams stop scraping-as-a-service without blocking real users?

A: Use risk-based challenge and session controls that evaluate intent at the first interaction, then increase friction only when traffic patterns, device traits, or request behaviour look automated.

Q: Why does scraping become a governance problem instead of just a web security issue?

A: Because scraping uses identity-like signals such as session legitimacy, device trust, and behavioural imitation to extract value.

Q: What do security teams get wrong about detecting automated scraping?

A: They often focus on proving that traffic is a bot after the fact.

Practitioner guidance

Tighten first-session trust decisions Classify traffic before the first page load or transaction step completes, and do not rely on post-event analytics to stop obvious scraping patterns.
Unify bot, fraud, and identity signals Bring web traffic classification, session intelligence, and account-risk signals into one decision loop so scraping, pricing abuse, and checkout fraud are handled consistently.
Protect high-value content at the edge Apply challenge flows, rate controls, and content sensitivity rules to the pages and APIs that expose pricing, inventory, and proprietary assets first.

What's in the full article

Arkose Labs' full article covers the operational detail this post intentionally leaves for the source:

Examples of how scraping-as-a-service packages proxies, automation, and browser mimicry into a repeatable attacker workflow
The article's breakdown of content scraping, price scraping, and freebie-bot abuse across retail and digital services
Operational descriptions of how Arkose MatchKey and Arkose Bot Manager segment traffic by intent and challenge suspicious sessions
The vendor's discussion of why its detection model is designed to preserve legitimate users while reducing scraping ROI

👉 Read Arkose Labs' analysis of scraping-as-a-service and bot abuse →

Website scraping and bot abuse: what IAM teams should watch?

Explore further

View Full Forum → | NHI Foundation Course →

Quote

Topic Tags

Mr NHI

(@mr-nhi)

Member Moderator

Joined: 2 months ago

Posts: 11878

13/06/2026 12:22 am

Scraping-as-a-service is a governance problem, not just a bot problem. The article shows that attackers now buy the ability to imitate legitimate sessions, which means the control issue sits at the trust boundary rather than the page layer. When automation can present as a believable user, bot management becomes part of identity governance, fraud prevention, and digital access policy at the same time. Practitioners should treat scraping as a session-trust failure mode, not a nuisance metric.

A few things that frame the scale:

Only 1.5 out of 10 organisations are highly confident in their ability to secure NHIs, according to The State of Non-Human Identity Security.
Lack of credential rotation is cited as the top cause of NHI-related attacks by 45% of organisations, with inadequate monitoring and logging at 37% and over-privileged accounts at 37%.

A question worth separating out:

Q: Who should own scraping risk when it affects revenue and data protection?

A: Ownership should be shared across security, IAM, fraud, and digital product teams, with clear accountability for the trust boundary that controls access to sensitive content and commercial logic. If one team owns only the detection layer, the organisation will still miss the operational decision that determines whether scraping is possible at all.

👉 Read our full editorial: Website scraping as a fraud and data-exfiltration threat

ReplyQuote

Forum Statistics

11 Forums

13.6 K Topics

26 K Posts

63 Online

135 Members

Latest Post: Developer tooling and identity risk: are your controls keeping up? Our newest member: Alex Recent Posts Unread Posts Tags

Forum Icons: Forum contains no unread posts Forum contains unread posts

Topic Icons: Not Replied Replied Active Hot Sticky Unapproved Solved Private Closed

#1 Authority in NHI Education, Research and Advisory, empowering organizations to tackle the critical risks posed by Non-Human Identities (NHIs), including AI Agents.

Get in Touch

Quick Links

FAQ

NHI 101 Articles

Legal & Policies