Imperva Report Claims That 50% Of The World Wide Web Is Now Bots

May 7, 2024

Automation has been a part of the Internet since long before the appearance of the World Wide Web and the first web browsers, but it’s become a significantly larger part of total traffic the past decade. A recent report by cyber security services company Imperva pins the level of automated traffic (‘bots’) at roughly fifty percent of total traffic, with about 32% of all traffic attributed to ‘bad bots’, meaning automated traffic that crawls and scrapes content to e.g. train large language models (LLMs) and generate automated content as well as perform automated attacks on the countless APIs accessible on the internet.

According to Imperva, this is the fifth year of rising ‘bad bot’ traffic, with the 2023 report noting again a few percent increase. Meanwhile ‘good bot’ traffic also keeps increasing year over year, yet while these are not directly nefarious, many of these bots can throw off analytics and of course generate increased costs for especially smaller websites. Most worrisome are the automated attacks by the bad bots, which ranges from account takeover attempts to exploiting vulnerable web-based APIs. It’s not just Imperva who is making these claims, the idea that automated traffic will soon destroy the WWW has floated around since the late 2010s as the ‘Dead Internet theory‘.

Although the idea that the Internet will ‘die’ is probably overblown, the increase in automated traffic makes it increasingly harder to distinguish human-generated content and human commentators from fake content and accounts. This is worrisome due to how much of today’s opinions are formed and reinforced on e.g. ‘social media’ websites, while more and more comments, images and even videos are manipulated or machine-generated.

50 thoughts on “Imperva Report Claims That 50% Of The World Wide Web Is Now Bots”

Ale says:

May 7, 2024 at 4:14 am

Here we welcome our new overlord Bot !

Report comment

Reply
Ostracus says:

May 7, 2024 at 4:33 am

So 50/50 one of us could be a bot.

Report comment

Reply
1. Clovis Fritzen says:
  
  May 7, 2024 at 5:35 am
  
  10/10 best comment of the day
  
  Report comment
  
  Reply
2. Reluctant Cannibal says:
  
  May 7, 2024 at 5:44 am
  
  Most of my comments here are automated using a python script and a LLM. Fortunately, so far, nobody has noticed :)
  
  Report comment
  
  Reply
  1. Truth says:
    
    May 7, 2024 at 6:57 am
    
    I was not going to say anything, but now that you mention it *evil grin*
    
    Report comment
    
    Reply
    1. Reluctant Cannibal says:
      
      May 7, 2024 at 7:14 am
      
      Have you not noticed how banal they usually are?
      
      Report comment
      
      Reply
      1. a_do_z says:
        
        May 7, 2024 at 10:53 am
        
        So, your LLM focused its learning on my web postings. That’s just super.
        I thought I felt something watching me.
        
        Report comment
3. HaHa says:
  
  May 7, 2024 at 1:17 pm
  
  Am I real, or am I trollbot?
  
  Report comment
  
  Reply
  1. Reluctant Cannibal says:
    
    May 7, 2024 at 1:32 pm
    
    Are there any other options?
    
    Report comment
    
    Reply
  2. TG says:
    
    May 7, 2024 at 1:53 pm
    
    That is one that has always bothered me when I see it online.. the assumption that all trolls are bots by definition. Trolling has a long and storied human tradition!
    
    Report comment
    
    Reply
  3. Andrew says:
    
    May 7, 2024 at 2:06 pm
    
    Sadly it’s really you.
    
    At least you use the same account name so it’s easier to ignore.
    
    Report comment
    
    Reply
    1. Totally real person #238239 (Not a bot) says:
      
      May 7, 2024 at 2:13 pm
      
      Thanks for the free advice
      
      Report comment
      
      Reply
      1. Andrew says:
        
        May 7, 2024 at 2:41 pm
        
        Everyone still knows it’s you.
        
        Report comment
h2odragon says:

May 7, 2024 at 4:53 am

If you have a smaller site, 90%+ of your traffic is likely to be automated, and inconsiderate of “robots.txt” or other conventions.

Report comment

Reply
Dan says:

May 7, 2024 at 5:12 am

“Thou shalt not make a machine in the likeness of a human mind”

Report comment

Reply
1. Ostracus says:
  
  May 7, 2024 at 11:40 am
  
  No qualms with making a human mind in the likeness of a machine.
  
  Report comment
  
  Reply
2. TG says:
  
  May 7, 2024 at 1:13 pm
  
  Of course proclamations and constitutions don’t actually have any power to prevent anything from happening without the civic and religious discipline which existed when they were written. That exists in Dune, not on Earth, which is one of the lessons of Dune
  
  Report comment
  
  Reply
  1. Piotrsko says:
    
    May 7, 2024 at 2:36 pm
    
    So youre saying that egbok will occur in another couple hundred years?
    
    Report comment
    
    Reply
    1. TG says:
      
      May 7, 2024 at 3:11 pm
      
      Egbok and wagmi are both right on schedule.
      
      Report comment
      
      Reply
jbx says:

May 7, 2024 at 5:16 am

Blocked (inside htaccess) ip’s from :
– China
– Huawei
– DataCamp
– Amazonaw
– Amazon Data Service
– Microsoft Data Center
– Facebook crawlers

This reduced half of the total requests – and were – most of them – malicious / abusive requests.

Report comment

Reply
1. limroh says:
  
  May 7, 2024 at 10:43 am
  
  So islanding the Internet is the solution after all? :-/
  
  Okay, I spun your solution “just a *little* farther” and I technically agree with it but it’s more ore less just a personal solution not solving “global” problem.
  
  Kinda like all of “us” using addblockers / noscript to “fight against ads” but not the general public.
  
  Report comment
  
  Reply
  1. TG says:
    
    May 7, 2024 at 1:23 pm
    
    Seems like monopolization and internationalization already islanded the internet
    
    Report comment
    
    Reply
2. shod says:
  
  May 7, 2024 at 10:44 am
  
  Lemme guess: you could not block google because 70% of your site(s) are google-tracking your visitor’s asses.
  
  Report comment
  
  Reply
Justin says:

May 7, 2024 at 5:39 am

Are there solutions? I think maybe the only way around it is to remove anonymity. Have hard authentication to link every account with a real person. It still won’t remove the possibility of bots because your account could get hacked and used maliciously. But that would limit the access. Or people might pay to use your account, but you’d still be accountable and maybe lose your access. It’s not a perfect solution, but I think it would stem the tide.

Unfortunately, you lose privacy which is a big problem too. But a lot of accounts can be linked back to you already. Just not as easily as this would make it. I think Facebook originally kept new accounts down, and only allowed certain groups like colleges – you had to prove you were a college student to join. Now they’re a big part of the problem with their own bots.

Report comment

Reply
1. Reluctant Cannibal says:
  
  May 7, 2024 at 5:47 am
  
  Does .htaccess not work any more?
  
  Report comment
  
  Reply
2. Anonymous says:
  
  May 7, 2024 at 9:54 am
  
  >Are there solutions? I think maybe the only way around it is to remove anonymity.
  How about instead of throwing out one of the best aspects of the internet, we just make running bots a criminal offense? Fundamentally this is an issue of human behavior, not technology.
  
  Report comment
  
  Reply
  1. HaHa says:
    
    May 7, 2024 at 1:26 pm
    
    What about eating/shitting/farting/fapping human NPCs?
    
    Making HERP/DERP illegal isn’t a terrible idea, hard to enforce.
    
    Create a blacklist of expressions (e.g. ‘Privatize the gains, socialize the losses’) from the political parties daily talking point emails, but too easy to game.
    Just continue to put ‘moron’ mental checkmark next to anybody posting such.
    
    Report comment
    
    Reply
  2. fhunter says:
    
    May 17, 2024 at 3:53 pm
    
    And this will break the good things – ability for computer to collect and filter data for you.
    Is fetching and filtering RSS feed from this site considered ‘being a bot’ ? What else is?
    
    Report comment
    
    Reply
3. TG says:
  
  May 7, 2024 at 1:21 pm
  
  Yeah that’s one solution if you want to turn the web into just one big chilling effect ruled by the countries with the most biomass signed on to receive an internet connection
  
  Report comment
  
  Reply
craig says:

May 7, 2024 at 5:45 am

This is it.
This is Judgement Day

Report comment

Reply
1. Zamorano says:
  
  May 7, 2024 at 6:10 am
  
  Don’t worry, Imperva also produces an anti-bot solution and they also have a free 30-day trial.
  
  Report comment
  
  Reply
Jouni says:

May 7, 2024 at 6:35 am

Yeah, “research” aka “free marketing”.

The numbers are probably rounded “a bit” up to increase their sales of .. surprise surprise, bot blocking products!

Report comment

Reply
1. Reluctant Cannibal says:
  
  May 7, 2024 at 7:16 am
  
  Basically, a .htaccess script?
  
  Report comment
  
  Reply
  1. fhunter says:
    
    May 17, 2024 at 3:53 pm
    
    I prefer fail2ban ;-)
    
    Report comment
    
    Reply
Jose says:

May 7, 2024 at 6:57 am

I think bots an easily solved simply requiring some money or putting limits to posts, lets say 1$ per account or 10 comments per day…its something that you can affront but a bot farm not.
But bigger than this problem to me its the content recomendation that can filter any news or use reinforcement learning to train a group of people, over years, to do whatever the algorithm has in his reward function.

Report comment

Reply
1. Kevin says:
  
  May 7, 2024 at 7:35 am
  
  I think that would have the opposite effect; big bot farms with corporate or nation-state backing can easily maintain a spam budget, while actual humans would decide it’s not worth it. “Money-is-speech” is already a problem in the US. Don’t make it worse.
  
  Report comment
  
  Reply
  1. limroh says:
    
    May 8, 2024 at 1:51 am
    
    I’m just waiting on the NRA to take that “spending/donating(?) money is free speech” nonsense a step further and make shooting (people) an expression of free speech…
    
    Report comment
    
    Reply
2. TG says:
  
  May 7, 2024 at 1:59 pm
  
  You need to think adversarially with ideas like this. I don’t think you’re trying hard enough to consider abuse vectors. Also, that implementing this rule unevenly would insta-kill any platform which tried it and cause a diaspora of users to platforms which didn’t. I don’t see how you’d enforce it globally.. Similar to ideas for “digital ID”
  
  Report comment
  
  Reply
Beaker says:

May 7, 2024 at 8:03 am

Dead Internet Theory isn’t funny now, is it?

Report comment

Reply
1. TG says:
  
  May 7, 2024 at 1:22 pm
  
  https://www.youtube.com/watch?v=KpbLphGX8P0
  It wouldn’t matter if there were no bots
  
  Report comment
  
  Reply
Edgerock says:

May 7, 2024 at 8:14 am

Sarcasm, comedy, innuendo… three things that I believe are currently irreplicable by AI/LLMs.

Report comment

Reply
1. TG says:
  
  May 7, 2024 at 1:49 pm
  
  I just asked one to say something sarcastic and it was shockingly unfunny. Could use more experimentation on other systems.
  
  Report comment
  
  Reply
  1. ian 42 says:
    
    May 7, 2024 at 3:25 pm
    
    no, you probably asked a real american.. They tend to not get sarcasm.
    
    Report comment
    
    Reply
mp says:

May 7, 2024 at 10:06 am

I worked on IT infra for a company that ran Imperva web-app-firewalls and they are terrible.
Separately Imperva-using-sites freak out when accessed via browsers configured to be security conscious.
Anything Imperva is bad and untrustworthy.

Report comment

Reply
Alex says:

May 7, 2024 at 10:19 am

How can i tell if i’m a bot?

Report comment

Reply
Lee says:

May 7, 2024 at 11:29 am

In the past couple of years i beleive its more like 75% of the internet is crawling with bots.

Report comment

Reply
TG says:

May 7, 2024 at 1:46 pm

“This is worrisome due to how much of today’s opinions are formed and reinforced on e.g. ‘social media’ websites, while more and more comments, images and even videos are manipulated or machine-generated.”

This has always been the role of media, to form, reinforce, and manipulate public opinions. To create the illusion of a herd consensus and put pressure on the mind. Or to “inform,” if you are being overly optimistic. Disinfo and misinfo rarely have any bearing on what is provably true or false, but merely what is antagonistic.

The internet is frightening because for the first time it decentralizes control of this system. Just as during any power shift, this will lead to a reaction by the old guard, trying to scrape back their control and reinforce it, which we have been seeing the past decade or so.

Nick Land is a joke until you realize that this kind of decentralized chaos machine samizdat is actually going on.

Herd consensus has always been an absolutely terrible way to perform decision-making, but it’s strongly encoded in psychology. So media allows a far smaller group of (preferably competent) leaders to simulate it instead—it has worked this way for many centuries. Freelancer or rogue leaders using bot armies (or developing nations with mobile phones) being able to mimic this behavior is interesting. “Interesting” in a very value-neutral way, of course.

Report comment

Reply
Hirudinea says:

May 7, 2024 at 1:48 pm

I wonder if the bots will miss us when we’re gone?

Report comment

Reply
1. DoubleЖ says:
  
  May 8, 2024 at 12:24 pm
  
  “And Web itself, when it woke at dawn Would scarcely know that we were gone”.
  
  Report comment
  
  Reply
Orzel says:

May 10, 2024 at 8:32 am

Shameless plug, that’s how i deal with those : https://freehackers.org/orzel/botfreak
Reports from bad behaviour are aggregated on a database then shared with all servers for blocking at IP level. Bundled with import from few common internet “black list”.

Report comment

Reply

Hackaday

Imperva Report Claims That 50% Of The World Wide Web Is Now Bots

50 thoughts on “Imperva Report Claims That 50% Of The World Wide Web Is Now Bots”

Leave a ReplyCancel reply

Search

Never miss a hack

If you missed it

Picking A CRC

The Merits Of Comment-Driven Development As Counterweight To TDD

NASA Announces Artemis III Crew And Ambitious Goals

Revisiting Using AI Coding Assistants: You’re Holding It Wrong Edition

Hunting Submarines Via Gravity Is A Tough Errand

Our Columns

Hackaday Links: June 14, 2026

Patterns Everywhere

Hackaday Podcast Ep 373: GPS, Danger In Space, And Robby The Robot

This Week In Security: Microsoft On Microsoft, Register Your Domains, Linux On ARM, And FreeBSD Joins The File Cache Club

FLOSS Weekly Episode 870: Open Source Gardening

50 thoughts on “Imperva Report Claims That 50% Of The World Wide Web Is Now Bots”

Leave a ReplyCancel reply

Search

Never miss a hack

Subscribe

If you missed it

Our Columns