Defcon Day 1 – Lost In Translation – Christian Grothoff

July 29, 2005

Steganography is the art of hiding things in plain sight. When done correctly an observer shouldn’t be able to tell that there is a hidden message as opposed to cryptography where it is obvious that something is hidden. To do this using text you usually need a large piece of source material; say all of the works of Shakespeare. Since these works are known to most people steg can usually be broken using statistical analysis.

Christian’s solution is to use machine translated (MT) texts as the source material. It is hard to make a computer generate consistant semantically and rhetorically correct texts that mimic the original is very difficult. The technique presented here uses MT texts because translation errors are expected and common.

The source text does not even have to be secret for this technique. It begins by running the text through several MT engines, i.e. Babelfish. To increase the number of possible translations each one is then run through another algorithm that creates more permutations using word replacement an other techniques. These texts are then checked sentence by sentence to determine if they are still statistically close to the original translation to make sure the translation appears probable.

At this point the message is encoded using Huffman tree encoding. Once this is complete some post processing error insertion can be applied. This takes advantage of errors that usually appear in MT: misused articles, prepositions and not translating less commonly known words. There’s even the technique “semantic substitution”, here’s an example: translate a word from English (EN) to German (DE) then translate that word to EN and then back to DE if this DE word is a possible translation of the original EN word they’ll use the DE word. This roundabout translation isn’t as clear to statistical analysis as one-to-one substitution.

There are a couple disadvantages to this method of steg: the low bitrate and the fact that you have to transmit the source and the translated text. There are also some attacks to expose this method. If the same sentence appears twice in a text and is translated two different ways it would set off a red flag. Also if the machine mistakes are inconsistent: using the word “foots” in one place and “feets” in the other. If someone developed a large statistical model of all MT systems it would be easy to see that the steg doesn’t fit the mold, but the steg could also use this model to make sure it fits (an arms race).

The website has a generator on it if you want to play around.

permalink

6 thoughts on “Defcon Day 1 – Lost In Translation – Christian Grothoff”

windwaker says:

July 29, 2005 at 11:53 pm

I could have sworn it was hiding files in images, but it varies.

Plus, in some states, steganography is illegal because it’s so safe.

Report comment

Reply
Dan says:

July 30, 2005 at 3:04 am

not to offend anyone or be a jerk, but isn’t it “stenography”? or are “steganography” and “stenography” interchangeable?

Report comment

Reply
Drakonite says:

July 30, 2005 at 5:10 am

not to offend anyone ;) but “stenography” and “steganography” are two different things.

“stenography” means a method of writing rapidly (shorthand) or the act or art of or writing in shorthand.

“steganography” means, as the text mentions, hiding a secret piece of information inside of a seperate piece of information as to conceal the secret information.

Report comment

Reply
Dave says:

July 31, 2005 at 10:50 am

not to offend anyone or be a jerk, but isn’t it “stenography”? or are “steganography” and “stenography” interchangeable?

Im offended

Report comment

Reply
Orphrey says:

August 1, 2005 at 3:00 pm

You do realize that there is something stegoed into that post, right? RIGHT?

I mean, you don’t become a popular blogger with sentences like “It is hard to make a computer generate consistant semantically and rhetorically correct texts that mimic the original is very difficult” (Welcome to the department of redundancy department).

And the post is about stegoing texts into other, not very long texts.

First one to crack it wins big accolades.

Report comment

Reply
Eliot Phillips says:

August 2, 2005 at 11:14 am

Yeah, the hidden message is “I’m typing this while on the hallway floor at Defcon and Vince just put us on the wall of sheep, damnit”

I did read that sentence over and over again and still didn’t catch it, grr.

Report comment

Reply

Hackaday

Defcon Day 1 – Lost In Translation – Christian Grothoff

6 thoughts on “Defcon Day 1 – Lost In Translation – Christian Grothoff”

Leave a ReplyCancel reply

Search

Never miss a hack

If you missed it

Thingino Teaches Cheap IP Cameras New Tricks

Hackaday Europe 2026: High Performance SDR On The Cheap

Encryption In The 1790s

The Need For Speed: Internet Speed Measurement (or DIY?)

Postal IRCs Are Almost A Thing Of The Past

Our Columns

Commercialization And Innovation

Hackaday Podcast Episode 380: 3D Printing The Rainbow, IR And IP Camera Hacks, And Americium 241 On The Loose

This Week In Security: What’s In A Name, The AI Bugpocalypse Hits Everyone, OpenWRT Flaws, And Duress Passwords

FLOSS Weekly Episode 877: RCE As A Service

Hackaday Links: July 26, 2026

6 thoughts on “Defcon Day 1 – Lost In Translation – Christian Grothoff”

Leave a ReplyCancel reply

Search

Never miss a hack

Subscribe

If you missed it

Our Columns