Reverse Engineering The ARM ALU

December 21, 2015

[Dave] wanted to learn more about the ARM architecture, so he started with an image of the ARMV1 die. If you’ve had some experience looking at CPU die, you can make some pretty good guesses at what parts of the chip have certain functions. [Dave], however, went further. He reverse engineered the entire ALU–about 2,200 transistors worth.

From the image, he worked out the transistor structures and how they map to gates. Since the ARM is a 32-bit processor, there are 32 separate slices of the ALU, each with about 70 transistors (see below). There are slight differences between certain slices to support zero and carry propagation. A PLA generates control signals that route data through the ALU to perform the desired operation.

[Dave] was inspired by the visual6502 project as well as [Ken Shirriff’s] 8085 reverse engineering, both of which we’ve covered in the past. If the FET transistor logic isn’t to your liking, you could always try Minecraft.

21 thoughts on “Reverse Engineering The ARM ALU”

J says:

December 21, 2015 at 1:50 am

Dave certainly has the experience at looking a CPU die… In the year twooo-thousand, in the year twooo-thousaaaaaaand and one

Report comment

Reply
Alphatek says:

December 21, 2015 at 2:09 am

Superb. I’d also missed the visual6502/ARM guys’ project too. Still my all-time favourite CPU (well, maybe ARM2 – mul is useful…)

Report comment

Reply
jwrm22 says:

December 21, 2015 at 2:21 am

Interesting write-up. Silicon chips are a work of art. Reverse engineering on the silicon level likewise.

Acorn doesn’t produce any chips themselves. They sell the IP. Its up to the fabricators to turn the IP in to real chips. Some of the IP are available in VHDL or “blackbox”. I’ve used ARM-cores before in an FPGA. I’m wondering if it would possibly be easier to rip the files generated for the FPGA than reverse engineering the silicon. The output will change with the optimisation settings… So it will not be exactly the same.

Other than that writing a behavioural model in VHDL would give the same functionality.

Report comment

Reply
1. Artenz says:
  
  December 21, 2015 at 2:35 am
  
  FPGA implementation will be completely different though.
  
  Report comment
  
  Reply
2. Julian Skidmore says:
  
  December 21, 2015 at 11:47 am
  
  Arm1, Arm2, Arm250 (used in the Archimedes A3020) all the way to Arm600 precede Arm Ltd and weren’t licensed designs.
  
  https://en.m.wikipedia.org/wiki/List_of_ARM_microarchitectures
  
  Report comment
  
  Reply
Artenz says:

December 21, 2015 at 2:55 am

Does anybody know of any ARM1/ARM2 documentation ?

Report comment

Reply
1. Alphatek says:
  
  December 21, 2015 at 3:46 am
  
  At what level? There’s plenty of assembler-level around. Not sure about lower than that as at the time it was all Acorn proprietary info.
  
  Report comment
  
  Reply
  1. Artenz says:
    
    December 21, 2015 at 3:56 am
    
    At the assembly level, including details on all instructions. Something like the ARM Architecture Reference Manual, but for v1/v2.
    
    Report comment
    
    Reply
    1. Alphatek says:
      
      December 21, 2015 at 4:26 am
      
      Online, try http://www.riscos.com/support/developers/asm/
      
      Report comment
      
      Reply
    2. Sweeney says:
      
      December 21, 2015 at 5:53 am
      
      This any use to you?
      http://morrow.ece.wisc.edu/ECE353/arm_reference/ddi0100e_arm_arm.pdf
      
      Report comment
      
      Reply
      1. Artenz says:
        
        December 21, 2015 at 6:09 am
        
        Thanks. I had looked at that before, but looking again, I see there’s a useful chapter on v1/v2 differences, called “Overview of the 26 bit architectures” that explains the differences.
        
        Report comment
2. Julian Skidmore says:
  
  December 21, 2015 at 11:53 am
  
  I have the “Dab Hand Guide to ARM assembler” by dabs press. It just covers the simpler 26-bit addressed ARM.
  
  The earlier ARM cpu’s had no thumb mode, only one spare set of registers for FIQ interrupts; only byte and long word addressing (no half-word access); condition codes were combined with PC , the top 6 bits so that a return would always restore them.
  
  Report comment
  
  Reply
Yoda says:

December 21, 2015 at 3:45 am

Isn’t it illegal to do this? IP?

Report comment

Reply
1. daid303 says:
  
  December 21, 2015 at 7:03 am
  
  Depends on why you do it. Could be legal for certain reasons in some countries.
  
  Report comment
  
  Reply
2. Artenz says:
  
  December 21, 2015 at 8:04 am
  
  The chip layout file was made available in agreement with ARM.
  
  http://blog.visual6502.org/2015/11/the-visual-arm1.html
  
  Report comment
  
  Reply
Sergio Costas says:

December 21, 2015 at 4:09 am

Really surprised about how big the barrel shift is. It occupies a lot of silicon. It must help a lot increasing the code density…

Report comment

Reply
1. Artenz says:
  
  December 21, 2015 at 4:20 am
  
  With 32 bit operands, you can’t really afford not to have a barrel shifter.
  
  Report comment
  
  Reply
  1. Alphatek says:
    
    December 21, 2015 at 4:29 am
    
    You say that, but barrel-shift for free on every instruction was a new (and very useful) thing.
    
    To wring real speed out the ARM1, you also needed linear code (no cache), so barrel-shift and conditionalisation on every instruction was a major help.
    
    Report comment
    
    Reply
jack laidlaw says:

December 21, 2015 at 3:15 pm

This got me thinking a littlebit, does anyone know if there is an open hardware processor chip out there? That would be amazing!

Report comment

Reply
1. Dannick Pomerleau says:
  
  December 21, 2015 at 10:31 pm
  
  RISC V
  
  Report comment
  
  Reply
xorpunk says:

December 21, 2015 at 3:40 pm

The day it’s economical to map states of silicon is the day software isolation and hiding keys in hardware will no longer work..

Report comment

Reply