Vibe Check: False Packages A New LLM Security Risk?

April 12, 2025

Lots of people swear by large-language model (LLM) AIs for writing code. Lots of people swear at them. Still others may be planning to exploit their peculiarities, according to [Joe Spracklen] and other researchers at USTA. At least, the researchers have found a potential exploit in ‘vibe coding’.

Everyone who has used an LLM knows they have a propensity to “hallucinate”– that is, to go off the rails and create plausible-sounding gibberish. When you’re vibe coding, that gibberish is likely to make it into your program. Normally, that just means errors. If you are working in an environment that uses a package manager, however (like npm in Node.js, or PiPy in Python, CRAN in R-studio) that plausible-sounding nonsense code may end up calling for a fake package.

A clever attacker might be able to determine what sort of false packages the LLM is hallucinating, and inject them as a vector for malicious code. It’s more likely than you think– while CodeLlama was the worst offender, the most accurate model tested (ChatGPT4) still generated these false packages at a rate of over 5%. The researchers were able to come up with a number of mitigation strategies in their full paper, but this is a sobering reminder that an AI cannot take responsibility. Ultimately it is up to us, the programmers, to ensure the integrity and security of our code, and of the libraries we include in it.

We just had a rollicking discussion of vibe coding, which some of you seemed quite taken with. Others agreed that ChatGPT is the worst summer intern ever. Love it or hate it, it’s likely this won’t be the last time we hear of security concerns brought up by this new method of programming.

Special thanks to [Wolfgang Friedrich] for sending this into our tip line.

23 thoughts on “Vibe Check: False Packages A New LLM Security Risk?”

Jason says:

April 12, 2025 at 1:18 pm

Stop checking my vibes.

Report comment

Reply
Henrik Olsson says:

April 12, 2025 at 1:24 pm

Vibe coding is here to stay, we need to create some kind of whitelist for trusted packages.

Report comment

Reply
1. Andrew says:
  
  April 12, 2025 at 1:34 pm
  
  Good idea. I’ll get ChatGPT to make one.
  
  Report comment
  
  Reply
2. Gravis says:
  
  April 12, 2025 at 1:50 pm
  
  Better idea: restrict LLMs to only generate code that uses known libraries. Simply not including unknown packages solves the problem.
  
  Report comment
  
  Reply
  1. M says:
    
    April 12, 2025 at 5:30 pm
    
    Define “known” when an attacker can add packages at any time.
    
    Report comment
    
    Reply
  2. bob says:
    
    April 13, 2025 at 4:04 am
    
    I would imagine it is possible, though not easy to create a turing complete machine using known packages. that doesn’t solve the problem, it obfuscates it which may be even worse because it appears secure.
    
    Report comment
    
    Reply
3. shinsukke says:
  
  April 12, 2025 at 3:22 pm
  
  Vibe coding is here to stay
  
  What about the people who only know vibe coding though?
  
  Report comment
  
  Reply
  1. rclark says:
    
    April 12, 2025 at 3:46 pm
    
    The new Idiocracy coding bunch that is coming… Wait for it ….
    
    Report comment
    
    Reply
    1. rclark says:
      
      April 12, 2025 at 3:49 pm
      
      Ie. Don’t think, just let the AI do it…. :rolleyes: .
      
      Report comment
      
      Reply
4. none ra says:
  
  April 13, 2025 at 9:00 am
  
  Not just that.. there needs to be a reigning in of the ‘cult of update’ that keeps pushing new versions with little to no oversight and in spite of what breaks. These things have led to exploits in real projects and attempts of other attempts as well.
  
  Report comment
  
  Reply
JSL says:

April 12, 2025 at 1:53 pm

Article in The Reg today:
https://www.theregister.com/2025/04/12/ai_code_suggestions_sabotage_supply_chain/

Report comment

Reply
Christian says:

April 12, 2025 at 2:17 pm

LLM is not going far enough. Humans need libraries, levels of abstraction, keep things simple and manageable. But if we where faster and could keep more in our heads when we would not need libraries. Just write everything machine code right there and then, or whatever the lowest common deployed level of technology is.

If the Vibe Coder is no longer checking the output then why bother with programming languages and libraries.

Report comment

Reply
Robert Piston says:

April 12, 2025 at 2:47 pm

Michael Townsen Hicks, James Humphries, & Joe Slater, in a paper in “Ethics and Information Technology,” suggest that the term “hallucination” by LLM’s is inaccurate. “We … argue that describing A.I. misrepresentations as bullshit is both a more useful and more accurate way of predicting and discussing the behaviour of these systems.”

Report comment

Reply
1. Nathan says:
  
  April 12, 2025 at 5:28 pm
  
  I’ve read that paper, it makes some good points. The language we use to describe things has pointed subconscious effects about how we perceive things.
  
  Link to paper for those who are interested:
  https://link.springer.com/article/10.1007/s10676-024-09775-5
  
  On the language note, “on politics and the English language”, a short-ish essay by Orwell, has some interesting analysis on how vagueness in language is easily exploited. Also worth a good read.
  
  Report comment
  
  Reply
  1. Robert Piston says:
    
    April 12, 2025 at 9:17 pm
    
    Great article by Orwell. Link here:
    
    https://files.libcom.org/files/Politics%20and%20the%20English%20Language%20-%20George%20Orwell.pdf
    
    Report comment
    
    Reply
2. Tyler August says:
  
  April 13, 2025 at 7:10 am
  
  I’m not a fan of the term “hallucination,” either– personally, I prefer “confabulation”, which is just a polite and high-falutin’ way to say bullshitting. That said, “hallucination” is the term of art employed by the researchers, so we use it here to avoid confusion.
  
  Report comment
  
  Reply
jpa says:

April 12, 2025 at 11:52 pm

I think we will eventually need some gate-keeping for the package repositories. It seems every few weeks there is a newsworthy typosquatting or hijacked package on npm or pypi. When was the last time a malicious package got into Debian?

Report comment

Reply
1. Hussien says:
  
  April 13, 2025 at 7:58 am
  
  > When was the last time a package got into Debian?
  
  FTFY XDDDDDDDDDDDDDDDDDDDDDDDDDD
  
  Report comment
  
  Reply
. says:

April 13, 2025 at 12:30 am

Ultimately it is up to us, the programmers, to ensure the integrity and security of our code,

What the hell are the users for, then?

Report comment

Reply
1. Tyler August says:
  
  April 13, 2025 at 12:17 pm
  
  To misuse and break it in ways we could have never forseen. ;)
  
  Report comment
  
  Reply
TG says:

April 13, 2025 at 11:23 am

Starting to see a lot of the following lately:
“I asked [LLM] and it said this.”
“That’s wrong, here’s some non-AI sources which prove it.”
“Hmmm… no, I asked [LLM] the same question again and it still contradicts you. It must be correct.”

AI cults and other fanciful things are certainly not far off. Would be a pretty good subject for a short story

Report comment

Reply
1. Ostracus says:
  
  April 13, 2025 at 12:16 pm
  
  And then there’s the reverse said with the same fervor against AI aka handwaving. People are just binary with a lot of things.
  
  Report comment
  
  Reply
  1. th0m says:
    
    April 13, 2025 at 9:20 pm
    
    People are also usually very binary against things that are just wrong. You could say for instance people are pretty black and white against murder.
    
    Report comment
    
    Reply

Hackaday

Vibe Check: False Packages A New LLM Security Risk?

23 thoughts on “Vibe Check: False Packages A New LLM Security Risk?”

Leave a ReplyCancel reply

Search

Never miss a hack

If you missed it

Remembering James Lovell: The Man Who Cheated Death In Space

Smartphone Hackability, Or, A Pocket Computer That Isn’t

VRML And The Dream Of Bringing 3D To The World Wide Web

Australia’s Space Program Finally Gets Off The Pad, But Only Barely

What Happens When Lightning Strikes A Plane?

Our Columns

Neon Bulbs? They’re A Gas!

Hackaday Links: August 10, 2025

A Love Letter To Prototype Zero

Hackaday Podcast Episode 332: 5 Axes Are Better Than 3, Hacking Your Behavior, And The Man Who Made Models

This Week In Security: Perplexity V Cloudflare, GreedyBear, And HashiCorp

23 thoughts on “Vibe Check: False Packages A New LLM Security Risk?”

Leave a ReplyCancel reply

Search

Never miss a hack

Subscribe

If you missed it

Our Columns