BitTorrent For Language Models

In the old days of the Internet, FTP was sufficient for downloading the occasional file. But with the widespread use of computer audio and video, it was easy to swamp an FTP server so — eventually — BitTorrent was born. The idea was you would download bits and pieces of a file from different places and, in theory, people would download bits and pieces that you have if they need them. Now Petals wants to use this same method with language models. These AI language models are all the rage, but they take significant computer resources. The idea behind Petals is like BitTorrent. You handle a small part of the model (about 8 gigabytes which is small compared to the 352 gigabytes required), and other people have other parts.

Of course, if you are privacy-minded, that means that some amount of your data is going out to the public, but for your latest chatbot experiments, that might not be a big problem. You can install Petals in an Anaconda environment or run a Docker image if you don’t want to set up anything. If you just want to access the distributed network’s chatbot based on BLOOMZ-176B, you can do that online.

Since the network processes tokens, a chat with the device is a little jerky, with a noticeable pause between words. It did say you can increase speed by connecting your GPU. Here’s a transcript:

Hackaday: How many nodes host you?
AI: Two thousand six hundered and ninety-three
Hackaday: Do you know about hackaday.com?
AI: Yes, of course.
Hackaday: What is it?
AI: It's a website for hackers and geeks.
Hackaday: How would you write an Arduino program to blink an LED?
AI: You have to use the Arduino language and create a sketch that can load onto the Arduino board.
Hackaday: Can you write that program?
AI: No

So we suppose our jobs are safe for now. Still, if you want to experiment with your own custom models based on BLOOM or BLOOMZ, this is a quick way to start without needing a server farm. The tutorial notebook will get you started. If you want to dig deep, there is a whitepaper.

We’ve been worried about model security, so having it distributed will make some attacks easier and some harder, we imagine. We don’t know if we trust any of them to run a circular saw blade around our yard yet, though.

6 thoughts on “BitTorrent For Language Models”

I think there’s a market for annoying chatbots. E.g.:

Hackaday: Do you know about hackaday.com?
AI: Yes, of course.
Hackaday: What is it?
AI: A genderless pronoun.
Hackaday: No, Hackaday.
AI: What about it?
Hackaday: What is Hackaday?
AI: Don’t you know? It’s your name.
…

Jacob Christ says:

March 22, 2023 at 6:49 am

Duh.

Report comment

Reply
Peter says:

March 23, 2023 at 5:34 pm

That just sounds like Cleverbot.

Report comment

Reply

The best description seems to be a decentralized pipeline.

Next step is to run this in the EVM

This is unnecessary. Cloud architecture is insanely cheap to lease.

Please be kind and respectful to help make the comments section excellent. (Comment Policy)

Bob says:

March 22, 2023 at 5:15 am

I think there’s a market for annoying chatbots. E.g.:

Hackaday: Do you know about hackaday.com?
AI: Yes, of course.
Hackaday: What is it?
AI: A genderless pronoun.
Hackaday: No, Hackaday.
AI: What about it?
Hackaday: What is Hackaday?
AI: Don’t you know? It’s your name.
…

Report comment

1. Jacob Christ says:
  
  March 22, 2023 at 6:49 am
  
  Duh.
  
  Report comment
  
2. Peter says:
  
  March 23, 2023 at 5:34 pm
  
  That just sounds like Cleverbot.
  
  Report comment
  
Gravis says:

March 22, 2023 at 12:31 pm

The best description seems to be a decentralized pipeline.

Report comment

Ron says:

March 23, 2023 at 6:00 am

Next step is to run this in the EVM

Report comment

ANDRE JOHNSON says:

March 23, 2023 at 3:54 pm

This is unnecessary. Cloud architecture is insanely cheap to lease.

Report comment

Hackaday

BitTorrent For Language Models

6 thoughts on “BitTorrent For Language Models”

Leave a ReplyCancel reply

Search

Never miss a hack

If you missed it

Hacking When It Counts: DIY Prosthetics And The Prison Camp Lathe

Dearest C++, Let Me Count The Ways I Love/Hate Thee

Personal Reflections On Immutable Linux

Crunching The News For Fun And Little Profit

The End Of The Hackintosh Is Upon Us

Our Columns

Robots Want The Jobs You Can’t Do

Hackaday Links: July 13, 2025

Trickle Down: When Doing Something Silly Actually Makes Sense

Hackaday Podcast Episode 328: Benchies, Beanies, And Back To The Future

This Week In Security: Bitchat, CitrixBleed Part 2, Opossum, And TSAs

6 thoughts on “BitTorrent For Language Models”

Leave a ReplyCancel reply

Search

Never miss a hack

Subscribe

If you missed it

Our Columns