Where The Work Is Really Done – Casual Profiling

July 29, 2019

Once a program has been debugged and works properly, it might be time to start optimizing it. A common way of doing this is a method called profiling – watching a program execute and counting the amount of computing time each step in the program takes. This is all well and good for most programs, but gets complicated when processes execute on more than one core. A profiler may count time spent waiting in a program for a process in another core to finish, giving meaningless results. To solve this problem, a method called casual profiling was developed.

In casual profiling, markers are placed in the code and the profiler can measure how fast the program gets to these markers. Since multiple cores are involved, and the profiler can’t speed up the rest of the program, it actually slows everything else down and measures the markers in order to simulate an increase in speed. [Daniel Morsig] took this idea and implemented it in Go, with an example used to demonstrate its effectiveness speeding up a single process by 95%, resulting in a 22% increase in the entire program. Using a regular profiler only counted a 3% increase, which was not as informative as the casual profiler’s 22% measurement.

We got this tip from [Greg Kennedy] who notes that he hasn’t seen much use of casual profiling outside of the academic world, but we agree that there is likely some usefulness to this method of keeping track of a multi-threaded program’s efficiency. If you know of any other ways of solving this problem, or have seen causal profiling in use in the wild, let us know in the comments below.

Header image: Alan Lorenzo [CC BY-SA 3.0].

13 thoughts on “Where The Work Is Really Done – Casual Profiling”

mike says:

July 29, 2019 at 7:07 pm

Seems like the linked article calls it “causal profiling,” which makes more sense than the casual profiling discussed here :)

Report comment

Reply
1. eriklscott says:
  
  July 29, 2019 at 8:28 pm
  
  Chewie: [INDISTINCT ANIMAL NOISE]
  Han: I don’t know… profile casual.
  
  Report comment
  
  Reply
M says:

July 29, 2019 at 8:22 pm

I didn’t find that explanation of casual profiling very clear at all, tbh.

Report comment

Reply
1. gregkennedy says:
  
  July 29, 2019 at 11:45 pm
  
  Basically the idea is this: you can have your regular profile tell you how long instructions take, and point out “slow” paths. But this isn’t always useful, because what is “slow” may be waiting on something even slower, or maybe a “fast” thread actually did its work and went to sleep when it COULD do more work, etc. CPU time and wallclock time are very different in a multithreaded environment.
  
  Instead the question you are trying to ask with causal profiling is “If I made this part run X% faster, how much faster would the whole program go?”. Which you can’t really do since you can’t just make code “go faster”… but instead, you can make all the rest of the code go SLOWER, in order to simulate the relative improvement.
  
  The profiling method simply automates all that for you and produces a report, which can direct the developer on where to tackle next.
  
  Report comment
  
  Reply
ian 42 says:

July 29, 2019 at 10:11 pm

Isn’t Casual Profiling what’s done on tinder? Ahh, you meant Causal Profiling and nobody proof reads a HD articles before they are published…

And I suspect who wrote the small article above also didn’t read (or maybe just didn’t understand) the article he linked to..

Report comment

Reply
1. NFM says:
  
  July 29, 2019 at 11:35 pm
  
  We are the proof readers. ;)
  
  Report comment
  
  Reply
  1. MacAttack says:
    
    July 30, 2019 at 4:14 am
    
    Also I am 5partacus !
    
    Report comment
    
    Reply
    1. Jonathan says:
      
      July 30, 2019 at 7:16 am
      
      No. HE’s 5partacus!
      
      Report comment
      
      Reply
Murray says:

July 30, 2019 at 5:03 am

Enter routine, pin hihg, do routine, pin low, exit routine, oscilloscope on pin. Very useful for looking at interrupt load on a microcontroller. If you would rather have a number, and don’t have an oscilloscope, add an rc filter and measure with a voltmeter.

Report comment

Reply
1. Ostracus says:
  
  July 30, 2019 at 9:05 am
  
  Reminds me of a section in “The Soul of A New Machine “.
  
  Report comment
  
  Reply
theRainHarvester on YouTube says:

July 31, 2019 at 4:13 pm

I do something similar for quick experiments.
I slow down code by a few cycles, to see if it slows performance. If it does, then it’s a likely candidate for a perf increase if I spend tIme to optimize.

However, theoretically this method won’t work when the threads are balanced equally and they all take the same amount of time. But practically, it has worked 95% of my time.

Report comment

Reply
slacker24l7 says:

August 1, 2019 at 5:49 am

start = time.now
callslowcode()
end = time.now
delta = end – start

been doing this for years. And something that just popped in my head. If you are on a micro controller use an external timer that is activated by a pin from the micro your debugging.

Report comment

Reply
1. Johny007 says:
  
  August 4, 2019 at 9:24 am
  
  Congrats, you didn’t understand a single thing from the article.
  
  Report comment
  
  Reply

Hackaday

Where The Work Is Really Done – Casual Profiling

13 thoughts on “Where The Work Is Really Done – Casual Profiling”

Leave a Reply to mikeCancel reply

Search

Never miss a hack

If you missed it

Field Guide To The North American Weigh Station

The Rise And The Fall Of The Mail Chute

Mining And Refining: Drilling And Blasting

Eulogy For The Satellite Phone

Just For Laughs: Charlie Douglass And The Laugh Track

Our Columns

Hackaday Links: June 29, 2025

Limitations, Creativity, And Challenges

Hackaday Podcast Episode 326: A DIY Pockels Cell, Funny Materials To 3D Print With, And Pwning A Nissan Leaf

This Week In Security: MegaOWNed, Store Danger, And FileFix

Announcing The 2025 Hackaday One Hertz Challenge

13 thoughts on “Where The Work Is Really Done – Casual Profiling”

Leave a Reply to mikeCancel reply

Search

Never miss a hack

Subscribe

If you missed it

Our Columns