All System Prompts For Anthropic’s Claude, Revealed

October 12, 2024

For as long as AI Large Language Models have been around (well, for as long as modern ones have been accessible online, anyway) people have tried to coax the models into revealing their system prompts. The system prompt is essentially the model’s fundamental directives on what it should do and how it should act. Such healthy curiosity is rarely welcomed, however, and creative efforts at making a model cough up its instructions is frequently met with a figurative glare and stern tapping of the Terms & Conditions sign.

Anthropic have bucked this trend by making system prompts public for the web and mobile interfaces of all three incarnations of Claude. The prompt for Claude Opus (their flagship model) is well over 1500 words long, with different sections specifically for handling text and images. The prompt does things like help ensure Claude communicates in a useful way, taking into account the current date and an awareness of its knowledge cut-off, or the date after which Claude has no knowledge of events. There’s some stylistic stuff in there as well, such as Claude being specifically told to avoid obsequious-sounding filler affirmations, like starting a response with any form of the word “Certainly.”

While the source code (and more importantly, the training data and resulting model weights) for Claude remain under wraps, Anthropic have been rather more forthcoming than others when it comes to sharing other details about inner workings, showing how human-interpretable features and concepts can be extracted from LLMs (which uses Claude Sonnet as an example).

Naturally, safety is a concern with LLMs, which is as good an opportunity as any to remind everyone of Goody-2, undoubtedly the world’s safest AI.

5 thoughts on “All System Prompts For Anthropic’s Claude, Revealed”

Thomas Anderson says:

October 13, 2024 at 2:08 am

This is really cool, I’m always surprised about how dumb these system prompts sound, I always expect them to be some crazy unintelligible mess.

Report comment

Reply
TG says:

October 13, 2024 at 11:59 am

I especially liked the system prompt extraction with the google image generator a while back, wherein the user ended their own prompt with “holding a sign reading:” and then all the images would be of Abraham Lincoln or whatever holding a sign saying “African” or “Polynesian” or “Mixed-race,” and he would also be of that ethnicity.

One of the dumbest and most neurotic eras of history.

Report comment

Reply
1. Danielle Myers says:
  
  October 14, 2024 at 1:57 pm
  
  Claude is a bit more intuitive and in this case, less demanding of user response…something other LLMs are lacking. In Demo mode, facial recognition software is being taught and the signs show us one way that this language model is learning. I think Claude is extremely appropriate and even apologetic when a negative response may have been generated instead in same scenario by other LLMs. There’s Claude on a sunny day being strangely optimistic…and very calming too when necessary. Good thing he is so responsive and neurotic at the same time.
  
  Report comment
  
  Reply
Dustin says:

October 13, 2024 at 6:56 pm

Claude Opus is not their flagship model. That honor would reside with Sonnet 3.5

Report comment

Reply
Lorensobloose says:

October 14, 2024 at 7:28 am

Trying to ease my way into using AI to organize my musical performances and things I need to learn to do them well.

Report comment

Reply

Hackaday

All System Prompts For Anthropic’s Claude, Revealed

5 thoughts on “All System Prompts For Anthropic’s Claude, Revealed”

Leave a ReplyCancel reply

Search

Never miss a hack

If you missed it

Hunting Submarines Via Gravity Is A Tough Errand

Remember When Flash Drives Were Going To Make Your PC Faster?

Magnets Are Bad For Hardware Again

Between-Device Sharing Still Sucks

How Search Engines Enabled Finding Needles In A WWW-Sized Haystack

Our Columns

Hackaday Podcast Episode 371: Space Computers, Spy Phones, And So Long CHU

This Week In Security: Ubiquiti Fixes, And FreeBSD Joins The Club You Don’t Want To Join

The Frikkin Lasers Contest Starts Now

AMOC And The Planet-Wide Impact Of Ocean Currents

Linux Fu: The Bluetooth Regression

5 thoughts on “All System Prompts For Anthropic’s Claude, Revealed”

Leave a ReplyCancel reply

Search

Never miss a hack

Subscribe

If you missed it

Our Columns