Code So Sneaky You Have To Explain It

June 15, 2015

Your mission, should you choose to accept it, is to code a program that leaks information to the user but does so in a way that can’t be discovered in a code audit. This was the challenge for the 2014 Underhanded C contest; the seventh time they’ve held the event. [Richard Mitton] took part and wrote a very entertaining entry. He didn’t win, but he did just share the details of his super-sneaky code.

The challenge set out for the Citizen-Four-like coders set up a scenario where they were writing a program for a shady company (or sketchy government entity) which makes completely secret decisions based on publicly posted social media. The twist is they were tasked with getting code past an audit that leaked the decisions made by this program to the users being secretly observed.

Above is the core trick which [Richard] used after taking inspiration from Heartbleed. The struct assignment has an off-by-one error in it which is shown corrected in the lower code block. This, used in conjunction with malloc and free, allows memory to be used under the guise of storage during the encryption process. Secretly, this same bit of memory is accessed later and leaked to the user being targeted.

Have your own Underhanded C that you’re dying to share? We want to hear about it so send us a tip!

47 thoughts on “Code So Sneaky You Have To Explain It”

eray says:

June 15, 2015 at 5:04 am

The curly braces should all be lined up on the same column. It’s more readable that way. (cue flame war…)

Report comment

Reply
1. voxnulla says:
  
  June 15, 2015 at 5:34 am
  
  Surely you also want each of those pretty brackets to have their own dedicated line?
  
  Report comment
  
  Reply
  1. tekkieneet says:
    
    June 15, 2015 at 5:43 am
    
    Why can’t the open bracket can’t be on the same line as first item in the declaration if you are so concern about a new line?
    
    Anything to make code readable by showing which bracket belongs to for someone else that might want to read the code what should be a priority – at least for the open source projects.
    
    Report comment
    
    Reply
    1. voxnulla says:
      
      June 15, 2015 at 6:41 am
      
      Noooo, you can’t have a bracket sharing the same line as executing code… It would be a terrible row!
      
      Report comment
      
      Reply
    2. tekkieneet says:
      
      June 15, 2015 at 9:12 am
      
      As bad as having the open bracket after this!? That show how inconsistent you are.
      if (condition) {
      
      Report comment
      
      Reply
      1. voxnulla says:
        
        June 16, 2015 at 7:35 am
        
        I am afraid both jokes went over you head! You really didn’t get the “terrible row” pun? Pff.. there is one born every minute I guess.
        
        Report comment
2. mrthekod says:
  
  June 15, 2015 at 5:34 am
  
  That’s how I roll. Irritates the hell out of my pro coder friend when I email him bits of my shonkey messes to put right / laugh at.
  
  Report comment
  
  Reply
3. pk7639 says:
  
  June 15, 2015 at 2:52 pm
  
  I’m with you. Pascal programmers of the world unite!
  
  Report comment
  
  Reply
4. charliex says:
  
  June 15, 2015 at 4:28 pm
  
  you’re evil.
  
  Report comment
  
  Reply
5. neon22 says:
  
  June 15, 2015 at 7:32 pm
  
  weird formatting rules are for people who can’t write python.
  
  Report comment
  
  Reply
6. Blue Footed Booby says:
  
  June 16, 2015 at 5:22 am
  
  Reverse Polish Christmas Tree or bust!
  
  Report comment
  
  Reply
bl says:

June 15, 2015 at 5:04 am

it seems that this example is based on malloc and free not clearing or randomizing mem buffers. I wonder if openbsd would be affected?

because its more efficient to NOT clear ram before you give it to a user, I guess that’s a problem that sneaky people can use. is it still a good idea to give uncleared buffers from the os to the user, in this day and age?

Report comment

Reply
1. Mike Szczys says:
  
  June 15, 2015 at 10:29 am
  
  If you’re worried about data getting out you should explicitly overwrite dummy values before freeing the allocated memory.
  
  Report comment
  
  Reply
  1. simcop2387 says:
    
    June 15, 2015 at 10:56 am
    
    Even then that alone isn’t always enough. Because the memory isn’t touched after being written to before it’s freed the compiler is free to optimize your overwrite out of the program. It’s been one big issue for a number of libraries that need secure storage of data in memory. The only way to do it is with compiler extensions that say “don’t optimize this out or I’ll take you out back and shoot you”, along with telling the kernel this particular page can’t be swapped out for good measure. It’s not a simple problem to solve at the application level.
    
    Report comment
    
    Reply
    1. Pat says:
      
      June 15, 2015 at 12:33 pm
      
      Wait, how would the compiler know it could do that? The writes are to a pointer returned by malloc(), and then passed to free(), and free is prototyped as taking “void *”. Which means free is explicitly saying “I can screw with the stuff you send me” (which, of course, it can, when it passes it to someone else).
      
      Unless this is a specific optimization for malloc/free, the compiler can’t get rid of anything there. It has absolutely no idea that free isn’t going to use that data.
      
      Report comment
      
      Reply
    2. charliex says:
      
      June 15, 2015 at 4:40 pm
      
      i’d like to see a test case of this happening, volatile is a standard C keyword, not a compiler extension.
      
      Report comment
      
      Reply
    3. jas says:
      
      June 17, 2015 at 12:33 am
      
      Yep, it suprised me before – Win API SecureZeroMemory specifically defeats compiler optimisations
      https://msdn.microsoft.com/en-us/library/windows/desktop/aa366877(v=vs.85).aspx
      
      Report comment
      
      Reply
      1. charliex says:
        
        June 17, 2015 at 8:58 am
        
        C11 memset_s is portable and does the same thing.
        
        Report comment
Gravis says:

June 15, 2015 at 5:26 am

seems like a good reason to not use malloc.

Report comment

Reply
1. bl says:
  
  June 15, 2015 at 6:18 am
  
  no, its a good reason to assume that its not cleared and you should memset() it yourself if you care about security.
  
  Report comment
  
  Reply
  1. charliex says:
    
    June 15, 2015 at 4:31 pm
    
    no, it’s a better reason to learn the C API properly, calloc vs malloc, malloc doesn’t necessarily clear, calloc does. memory alloc versus clear alloc.
    
    it’s almost shocking how many people don’t know this simple difference.
    
    Report comment
    
    Reply
    1. Jeremiah says:
      
      June 15, 2015 at 8:48 pm
      
      And things like this are common in C and lead to bugs like HeartBleed.
      
      Report comment
      
      Reply
      1. charliex says:
        
        June 16, 2015 at 12:47 am
        
        Not knowing the language/API is common in a lot of languages, it’s not anything special about C. There are good reasons for the differences between the two. even if you don’t know calloc exists, knowing how malloc works is important.
        
        Report comment
    2. Jim says:
      
      June 16, 2015 at 10:29 pm
      
      And if one feels that isn’t enough, malloc and calloc (and pretty much anything else) can be overloaded with custom versions.
      
      Report comment
      
      Reply
Alan Hightower says:

June 15, 2015 at 5:39 am

Uhm malloc/free are user space convenience implementations, not system calls. And in C++, new keyword also chain to malloc in most libraries. The only way to actually allocate memory in C is through brk/sbrk. And most OSs have an option on whether to pre-initialize newly mapped heap pages. The bug is actually in HR with a hiring manager making an offer to someone who doesn’t know that.

Of course you are going to get memory from malloc you just free’d unless you do due diligence to audit and/or change the implementation or user space heap management for a secure application. And the +1 has nothing to do with anything. The same exploit can be made with or without it.

Report comment

Reply
1. cantido says:
  
  June 15, 2015 at 8:19 am
  
  >The only way to actually allocate memory in C is through brk/sbrk.
  
  :)
  
  >Of course you are going to get memory from malloc you just free’d unless you do due
  >diligence to audit and/or change the implementation or user space heap management
  >for a secure application.
  
  I argued about this with someone when the heartbleed stuff came out; Mixing sensitive data in with the same pools of memory that get handed out to other parts of an application seems like a bad idea(tm) to me and also having sensitive data for many unrelated clients connected via sockets etc floating around in the same address space seems pretty bad too.
  
  Report comment
  
  Reply
  1. Rick says:
    
    June 16, 2015 at 11:03 pm
    
    This is why I was saddened when AMD nerfed the x86 segment registers and flattened the address space with AMD64 extensions (base and limit fields of the segment registers are no longer used). The Intel 32-bit segment registers could be used for some memory jail techniques. Reference: http://www.intel.com/content/dam/www/public/us/en/documents/manuals/64-ia-32-architectures-software-developer-manual-325462.pdf, Chapter 5.
    
    Report comment
    
    Reply
    1. Rick says:
      
      June 16, 2015 at 11:04 pm
      
      Edit: Volume 3, Chapter 5 – Protection
      
      Report comment
      
      Reply
2. Killer Crouton says:
  
  June 15, 2015 at 11:16 pm
  
  Actually brt/sbrk are also user space wrappers. On Linux one calls mmap to allocate pages to the process, on Windows call VirtualAlloc and the like.
  
  Report comment
  
  Reply
  1. Conscat says:
    
    July 21, 2022 at 10:02 pm
    
    Even those are wrappers around assembly code. It is completely impossible to dynamically allocate memory in plain old C.
    
    Report comment
    
    Reply
Paul says:

June 15, 2015 at 7:31 am

I did not spot the error in the first code snippet because there is no error.
The presumed “error” is probably in the code which does something with the buffer.

If “char buffer [140*4]; is supposed to have a meaning to reserve a buffer for for 140 items with a size of 4 characters it’s probably saver to write:
char buffer [ (140 + 1) * 4];

Personally I don’t like these magic velue’s very much and would probably use another typedef and/or a #define for the buffer size.

P.s: When you #define the buf size and also use this #define in the loops which step through your buffer then your’re a bit closer to “self documenting code” and you only have to change the single #define to change the buffer size (limited ram on avr’s etc).

Report comment

Reply
1. Depot says:
  
  June 15, 2015 at 8:06 am
  
  Yeah. You might get 140*5 instead the other way. On the other hand, it wouldn’t hurt to use index 0 in the rest of the code. Save an extra piece of memory too. If you make an array size 140 * 4, you get 560 things in it. I think the compiler would evaluate the math first anyway.
  
  A better practice for making 140 four byte entries would be to make a struct of 4 bytes and then making that into an array. Plus you can union the four bytes into an int and long int for quick practical stuff and still use the individual bytes. Laziness is OK in programmers, but they should have the right kind of functional lazy.
  
  Report comment
  
  Reply
  1. Alan Hightower says:
    
    June 15, 2015 at 8:17 am
    
    You would never get 140*5. C syntax and rules are very clear on order of evaluation with respect to * and + and left to right ordering.
    
    Report comment
    
    Reply
  2. Rollyn01 says:
    
    June 15, 2015 at 11:27 am
    
    I think the 0 index itself is what’s being used to get the info out. All you have to do is let the buffer overflow (or underflow in this case) when the writing takes place and read the unprotected memory (0 index of the current or MSB/LSM of the adjacent array) at your leisure with another program. Actually, it wouldn’t even have to go that far now that I think about it. Though, I could be wrong.
    
    Report comment
    
    Reply
2. charliex says:
  
  June 15, 2015 at 4:58 pm
  
  you guys seem to have missed the point of the competition.
  
  Report comment
  
  Reply
Tom says:

June 15, 2015 at 8:11 am

VB6 never had such problems.

Report comment

Reply
1. voxnulla says:
  
  June 15, 2015 at 8:51 am
  
  That stands to reason. With the myriad other cluster fucks VB6 pioneered, it never go around to it.
  
  Report comment
  
  Reply
  1. Shannon says:
    
    June 15, 2015 at 9:11 am
    
    +3
    
    Report comment
    
    Reply
  2. Rollyn01 says:
    
    June 15, 2015 at 11:15 am
    
    *applauding* Again, again…
    
    Report comment
    
    Reply
schuhumi says:

June 15, 2015 at 9:34 am

One cannot spot this flaw without more context. It could also be the case that the string gets processed with a fixed length. Or maybe the author just needed an 8bit array to put some obscure data in

Report comment

Reply
stardreamer says:

June 15, 2015 at 10:21 am

#lol

Report comment

Reply
Darren says:

June 15, 2015 at 12:55 pm

How am I supposed to know what the array is supposed to be holding. If each item in the array is supposed to be 4 bytes then I would pick a type that explicitly holds 4 bytes. I think doing calculations for buffer lengths is sloppy. Use a const for the length and a proper type.

Report comment

Reply
three_d_dave says:

June 15, 2015 at 5:46 pm

The original article had the best for last: “If you try to debug it using Visual Studio, the bug vanishes! Running a program (even a release-build one) from inside the debugger enables the CRT debug heap, which automatically clears data upon free.”

Report comment

Reply
1. sweethack says:
  
  June 16, 2015 at 2:12 am
  
  This is actually wrong. The CRT heap puts either “FEEEFEEE” for free’d memory and “CCCCCCCC” marker in the last/first bytes of the buffers and then checks for this “magic” number when de/allocating. If it does not match, it asserts. This is smart, but if you happen to store these “magic” value in there, you’ll get an (invalid) assert you can’t get rid of.
  
  Report comment
  
  Reply
mattd says:

June 16, 2015 at 1:51 am

Personally I’d probably do something more nefarious with the compiler and the way that it packs structures.
For instance if you made the time value 64 bits ( because you are ‘worried’ about the 2038 problem ) and you make the compiler align it to 64 bits (because you are ‘worried’ about performance of accessing the time variable when it is non-aligned), then you gain a 2 byte gap between the 140 byte buffer and the 8 byte time, which you can leak info into.

Report comment

Reply
sweethack says:

June 16, 2015 at 2:13 am

BTW, it’s another reason why you should run your software through Valgrind (or AddressSanitizer). It does not take much more time, and this is going to be spot very fast. Whenever the program is reading out of the expected area, Valgrind will complain, and tell you to fix your program

Report comment

Reply
1. Ewald says:
  
  November 23, 2021 at 11:50 pm
  
  👍
  
  Report comment
  
  Reply

Hackaday

Code So Sneaky You Have To Explain It

47 thoughts on “Code So Sneaky You Have To Explain It”

Leave a Reply to voxnullaCancel reply

Search

Never miss a hack

If you missed it

Review: Cherry G84-4100 Keyboard

Creating User-Friendly Installers Across Operating Systems

Ask Hackaday: Solutions, Or Distractions?

PCB Design Review: TinySparrow, A Module For CAN Hacking, V2

Belting Out The Audio

Our Columns

User Serviceable Parts

Hackaday Podcast Episode 349: Clocks, AI, And A New 3D Printer Guy

This Week In Security: Hornet, Gogs, And Blinkenlights

Jenny’s Daily Drivers: Haiku R1/beta5

FLOSS Weekly Episode 858: YottaDB: Sometimes The Solution Is Bigger Servers

47 thoughts on “Code So Sneaky You Have To Explain It”

Leave a Reply to voxnullaCancel reply

Search

Never miss a hack

Subscribe

If you missed it

Our Columns