ForceGen: Using A Diffusion Model To Help Design Novel Proteins

Although proteins are composed out of only a small number of distinct amino acids, this deceptive simplicity quickly vanishes when considering the many possible sequences across a protein, not to mention the many ways in which a single 1D protein sequence can fold into a 3D protein shape with a specific functionality. Although natural evolution has done much of the legwork here already, figuring out new sequences and their functionality is a daunting task where increasingly deep learning algorithms are being applied. As [Bo Ni] and colleagues report in a research article in Science Advances, the hardest challenge is designing a protein sequence based on the desired functionality. They then demonstrate a way to use a generative model to speed up this process.

They set out to design proteins with specific mechanical properties, for which they used the known unfolding characteristics of various protein sequences to train a diffusion model. This approach is thus more akin to the technology behind image generation algorithms like DALL-E than LLMs. Using the trained diffusion model it was then possible to generate likely sequences of which the properties could then be simulated, with favorable results.

As a large data set aid, such a diffusion model could conceivably be very useful in fields even beyond protein synthesis, automating tedious tasks and conceivably speeding up discoveries.

One thought on “ForceGen: Using A Diffusion Model To Help Design Novel Proteins”

I was under the impression that a deep learning system had already “figured out” protein folding and now it’s being analyzed to identify the logic behind it’s shape predictions. Whatever the case, we really need a deep learning system that can decode deep learning systems. :)

Please be kind and respectful to help make the comments section excellent. (Comment Policy)

Gravis says:

February 20, 2024 at 10:08 am

I was under the impression that a deep learning system had already “figured out” protein folding and now it’s being analyzed to identify the logic behind it’s shape predictions. Whatever the case, we really need a deep learning system that can decode deep learning systems. :)

Report comment

Hackaday

ForceGen: Using A Diffusion Model To Help Design Novel Proteins

One thought on “ForceGen: Using A Diffusion Model To Help Design Novel Proteins”

Leave a ReplyCancel reply

Search

Never miss a hack

If you missed it

The Curse Of The Everything Device

What One-Winged Squids Can Teach The Airship Renaissance

How Safe Are Old Airbags, Anyway?

Ask Hackaday: Do You Have A Dead Man’s Switch?

The Requirements Of AI

Our Columns

Building An Interactive Climbing Wall

Keebin’ With Kristina: The One With The Uni-body That Does The Splits

Tech In Plain Sight: Projection Clocks

Hackaday Links: February 22, 2026

In Praise Of The Proof Of Concept

One thought on “ForceGen: Using A Diffusion Model To Help Design Novel Proteins”

Leave a ReplyCancel reply

Search

Never miss a hack

Subscribe

If you missed it

Our Columns