Airline scraping attacks expose the limits of bot defence

By NHI Mgmt Group Editorial TeamPublished 2026-05-11Domain: Governance & RiskSource: Arkose Labs

TL;DR: Airlines are seeing web scraping drive booking slowdowns, lost sales, and customer frustration, while the source report says 51% of travel and hospitality companies are concerned about scraping and that 100% of airline attacks in its dataset were bot-driven, according to Arkose Labs. The governance lesson is that defending digital access now requires treating automated traffic as an operational risk, not just a site-performance problem.

At a glance

What this is: This is an airline-focused analysis of web scraping and bot-driven abuse, showing how automated collection can degrade booking performance and erode revenue.

Why it matters: It matters because IAM and security teams must distinguish legitimate automation from abusive automation, then protect customer-facing systems without breaking access for real users.

By the numbers:

51% of companies in the travel and hospitality industry are concerned to a moderate or large extent about web scraping attacks.
100% of attacks on airlines are perpetrated by bots.

👉 Read Arkose Labs' analysis of airline web scraping and bot abuse

Context

Web scraping is automated data extraction from websites, and in airline commerce it becomes an access-control problem as soon as bots start consuming pricing, seat availability, and booking workflows at scale. The issue is not only data theft, but also degraded availability when automated traffic consumes the same resources that legitimate customers need.

For IAM and security teams, this sits at the boundary between customer identity, bot mitigation, and digital fraud controls. The article's core point is that airlines are dealing with a form of automation abuse that affects revenue, service reliability, and customer trust at the same time.

The starting position is typical for consumer-facing digital businesses: visible demand, high-value data, and web endpoints that were not built to absorb hostile automation as a routine operating condition.

Key questions

Q: How should airlines stop web scraping without hurting real customers?

A: Airlines should use layered bot detection that combines behaviour, device, session, and network signals, then apply progressive challenge only when confidence is low. That approach reduces abuse while preserving booking flow for legitimate travellers. The objective is to protect pricing, inventory, and conversion paths without turning security controls into a customer-experience bottleneck.

Q: Why does web scraping create more than data loss for travel companies?

A: Because scraping also consumes application capacity. When bots repeatedly hit search and booking endpoints, they can slow pages, increase errors, and reduce the number of completed bookings. For travel companies, the issue is not only that data is copied. It is that automation can directly interfere with revenue-generating customer journeys.

Q: What signals show that scraping controls are too weak?

A: Watch for rising latency, lower look-to-book ratios, more abandoned booking sessions, and repeated requests that do not follow normal customer behaviour. If those indicators move together, scraping is probably affecting both security and commerce. A good control plane should reduce hostile automation without creating a measurable drop in genuine user throughput.

Q: Who is accountable when bot traffic disrupts airline booking systems?

A: Accountability should sit jointly across security, digital commerce, and platform operations, because the failure affects access, performance, and revenue at the same time. The right governance model assigns owners for detection, response, and business impact measurement. That is the only way to prevent bot controls from becoming either invisible or business-breaking.

Technical breakdown

How scraping bots evade simple detection

Modern scraping traffic is built to look ordinary. The article describes bots that mimic human behaviour, rotate IP addresses, and use residential proxies to avoid obvious blocking rules. That matters because basic controls such as single-signal rate limits or static bot signatures often fail when the attacker can vary timing, source reputation, and request patterns. In practice, the defender is not dealing with one script but with an adaptive collection layer designed to stay within normal-looking thresholds while extracting large volumes of data.

Practical implication: teams need detection that combines behavioural, device, and session signals instead of relying on one brittle indicator.

Why scraping becomes an availability problem

Scraping is often described as data theft, but the operational effect is broader. Heavy bot traffic can slow down search, pricing, and booking functions, and in the article's example it can even crash the site. That is a resource exhaustion problem, where repeated automated requests compete with real users for application capacity. For airlines, the impact is not limited to data replication. It becomes degraded conversion, higher abandonment, and a direct hit to customer experience on the transaction path.

Practical implication: capacity planning and bot controls need to be treated as part of transaction protection, not just security tooling.

What adaptive response changes in bot mitigation

The article points to layered protection that fingerprints traffic beyond the browser, assesses payload risk, and selectively routes suspicious activity for stronger challenge. The technical value is in moving from all-or-nothing blocking to risk-based response, which reduces disruption for legitimate customers while increasing friction for automated abuse. That pattern matters in consumer identity environments because the challenge is not always to stop every bot immediately. It is to preserve access quality, protect data, and raise the cost of collection until abuse is uneconomic.

Practical implication: use step-up challenge and traffic routing for suspicious sessions instead of blunt blocking across the whole site.

NHI Mgmt Group analysis

Bot abuse is now a digital access governance problem, not only an anti-fraud problem. The article shows scraping affecting revenue, availability, and customer experience in the same workflow, which means the control plane spans more than fraud detection alone. When automated collection consumes the same booking paths that legitimate users need, the governance question becomes who and what is allowed to consume digital service capacity. Practitioners should treat bot traffic as an identity and access policy issue at the application edge.

Scraping pressure exposes a broader access asymmetry across consumer platforms. Airlines invest heavily in access for real customers, but hostile automation can still consume the same public-facing journeys at machine speed. That creates a mismatch between business intent and actual service consumption, especially where pricing and inventory data are high value. The implication is that customer experience, anti-bot policy, and digital resilience have to be managed together, not in separate queues.

Identity trust for web commerce now depends on distinguishing productive automation from abusive automation. The article's own numbers, including 51% concern in travel and hospitality and bot-driven airline attacks, point to a structural operating condition rather than an edge case. Automated access asymmetry: the same public endpoint can support legitimate users, authorized partners, and hostile scrapers, but existing controls often cannot tell them apart quickly enough. Practitioners should reframe protection around trust decisions at request time.

Airline scraping shows why availability is part of identity governance. Once bots can degrade the booking path, the consequences move beyond data exposure into service denial and commercial loss. That makes bot defense a governance control over consumption, not just a technical countermeasure. Security teams should expect similar pressure anywhere pricing, inventory, or content has direct economic value.

From our research:
51% of companies in the travel and hospitality industry are concerned to a moderate or large extent about web scraping attacks, according to The State of Secrets in AppSec.
Only 44% of developers are reported to follow security best practices for secrets management, which means weak operational discipline often accompanies automated abuse paths.
Read Top 10 NHI Issues for a broader view of the governance gaps that appear when machine-scale access outpaces control design.

What this signals

Automated abuse is now a service-governance issue for consumer identity programmes. Travel teams that protect only login flows will miss the operational surface where scraping actually hurts, including search, fare retrieval, and booking completion. The practical shift is to measure trust at request time and to treat bot traffic as a first-class access class, not a nuisance category.

With only 44% of developers following security best practices for secrets management, weak control discipline can also spill into exposed data paths and partner integrations, per The State of Secrets in AppSec. That is why scraping defence, secrets hygiene, and transaction protection increasingly belong in the same governance conversation.

Identity trust must now extend to machine behaviour at the edge. Where automation is expected, the programme needs a clear line between authorised integration traffic and hostile collection activity. Teams that can classify those patterns early will reduce both customer friction and commercial leakage.

For practitioners

Map high-value request paths Identify the search, fare, inventory, and booking endpoints that create the highest commercial exposure, then classify them by acceptable traffic patterns and user impact if abused.
Use layered bot detection Correlate browser, network, session, and behavioural signals so that rotating IPs and residential proxies do not become the only basis for trust decisions.
Separate customer friction from bot friction Apply selective challenge and progressive response only when risk rises, so legitimate travellers keep booking while automation faces increasing cost.
Monitor conversion and availability together Track look-to-book ratio, page latency, abandonment, and error rates in one operating view so scraping is measured as both a security and revenue issue.

Key takeaways

Web scraping in airlines is not only a data problem. It is an access, availability, and revenue problem that sits at the edge of identity governance.
The article's data suggests this is already widespread, with travel and hospitality leaders worried about scraping and airline attacks described as bot-driven.
Teams need layered detection, selective challenge, and joint ownership across security and digital operations to keep automation from undermining customer journeys.

Standards & Framework Alignment

This section maps relevant standards and security frameworks to the operational risks and controls described in this guidance.

NIST CSF 2.0, NIST CSF 2.0 and NIST SP 800-63 set the governance and control requirements practitioners need to meet.

Framework	Control / Reference	Relevance
NIST CSF 2.0	DE.CM-1	Continuous monitoring is central to spotting scraping patterns and service abuse.
NIST CSF 2.0	PR.AC-5	Access control should distinguish legitimate users from automated abuse at the edge.
NIST SP 800-63		Consumer identity journeys need friction balanced against abuse prevention.

Reduce account and transaction abuse while preserving usable customer authentication paths.

Key terms

Web Scraping: Web scraping is automated extraction of data from websites, usually at scale and often without the site owner's consent. In security terms, it becomes a governance issue when the automation consumes public application resources, copies sensitive commercial data, or degrades service for legitimate users.
Bot Mitigation: Bot mitigation is the set of controls used to identify, challenge, slow, or block automated traffic that behaves outside acceptable use. It relies on behavioural analysis, device and network signals, and risk-based response so that hostile automation is constrained without unnecessarily disrupting real customers.
Look-to-book Ratio: Look-to-book ratio measures how many fare searches or visits turn into completed bookings. In airline commerce it is a practical signal of whether traffic quality and booking friction are healthy, making it useful for spotting when scraping or other automation is hurting conversion.

Deepen your knowledge

NHI governance, agentic AI identity, and machine identity lifecycle are core topics in our NHI Foundation Level course, the industry's only accredited NHI security programme. If you are responsible for identity security strategy or NHI governance in your organisation, it is worth exploring.

This post draws on content published by Arkose Labs: airline scraping attacks and bot-driven abuse in travel commerce. Read the original.

NHIMG Editorial Note
Published by the NHIMG editorial team on 2026-05-11.
NHI Mgmt Group — the independent authority on Non-Human Identity, IAM, and Agentic AI security. nhimg.org