DIY Sound Localization Sensor

May 10, 2011

sound_localization_sensor

Sound localization is very popular in law enforcement circles due to its accuracy and ability to quickly separate gunshots from other similar noises. These systems don’t come cheap, and after trying to build one himself, [Fileark] knows why.

He thought it would be neat to build a sound localization sensor based on how the human ear determines a sound’s source. Once he got started however, he realized just how hard it was to do localization just right.

He used an LM324N op-amp as a volume comparator, which he says works decently enough though he figures there are ICs out there that can do a better job. [Fileark] reports that the sound detector works well when the source is within about a foot of the sensors, but performance deteriorates at greater distances. He may consider using an ARM Cortex-M3 as his sound processor if he builds a second version, since the Arudino he used just doesn’t have enough power to sample and run calculations within the 10-50 microsecond window he requires.

Keep reading to see a video of his sound localization sensor in action.

[youtube=http://www.youtube.com/watch?v=yHyuzKRZFUY&w=470]

32 thoughts on “DIY Sound Localization Sensor”

Ib says:

May 10, 2011 at 2:41 pm

Nice Idea!
I loled when he started shouting in the sensor.
“Hellooo”

Report comment

Reply
lwatcdr says:

May 10, 2011 at 2:48 pm

“He may consider using an ARM Cortex-M3 as his sound processor if he builds a second version, since the Arudino he used just doesn’t have enough power to sample and run calculations within the 10-50 microsecond window he requires.”

Why not a PC? Lots of processing power there and odds are he has one.

Report comment

Reply
ZeroCool42 says:

May 10, 2011 at 2:59 pm

lwatcdr: My guess, he wants to put it on a robot…

Report comment

Reply
lwatcdr says:

May 10, 2011 at 3:01 pm

@ZeroCool42. Ahh good point. I would still prototype it on a PC for ease of development but that is just me.

Report comment

Reply
fm says:

May 10, 2011 at 3:08 pm

Doesn’t the Kinect have a bunch of microphones built in for a similar purpose?

Report comment

Reply
obsoehollerith says:

May 10, 2011 at 3:19 pm

Those cheap fet mics have a fairly consistent omni response, that would seem reasonable if you’re trying to mimic the human localization scheme, but how about using tight hypercardioid dynamics that have a response peak in the part of the spectrum that you want to localize, and then measure the phase relationship to determine your direction? Sure it’d be expensive-but think of the bragging rights! lol

Report comment

Reply
Fileark says:

May 10, 2011 at 3:27 pm

@lwatcdr, I agree that prototyping sound localization should probably be done on a PC first before moving to a microcontroller.

@fm, Wow, thanks for the Kinect tip, it looks like Microsoft included the microphones to eventually possibly use for sound localization but have not realy implemented it yet. I am looking and can’t find any hackers that have done it yet either. I always wanted an Xbox…

Report comment

Reply
D_ says:

May 10, 2011 at 3:41 pm

I wonder if one could duplicate the human outer ear well enough, this could be done with just 2 microphones? Maybe duplicating the ears of another of Earth’s creatures would better easier? I wonder if time difference of arrival techniques are easily achievable at audio? Probably not not as easy it is are RF. An entertaining application would be a robotic head at the side of the roadway, that would focus on vehicles as they approach, following them as they pass by.

Report comment

Reply
Retro says:

May 10, 2011 at 4:26 pm

I think HaD has put an arduino in my brain. I can’t tell you how often I have started some new project or gotten some new wierd hardware, and then suddenly a few days later, HaD has a story on it.

Example: Last week, a friend gave me a tranz380 credit card terminal to play with, and then *poof* a story about hacking a tranz330.

And just today I was reverse engineering a phase detector circuit at work to troubleshoot it and started realizing that I could use it to build a sound locater. Drew some schematics on a napkin and everything. I come home (my work thinks HaD is about hacking and banned it. :( )and get on here, and *poof* there is a story about a sound localizer.

Very odd…

Report comment

Reply
Quin says:

May 10, 2011 at 4:37 pm

@D_ on one hand, audio should be easier than RF, because the speed of sound through air is shorter. Using just peaks, like a gunshot, and ignoring echos, you need about a foot of separation and the ability to measure under 1ms difference.

The problem with determining direction by phase is that A above middle C has a wave length of about 1.2 feet, so depending on how close or far apart your two sensors are you could get some bad data. And, since the peak of the phase moves at the same speed as above, with just 1 foot separating the mics, you will need a fast response time.

If you treat the Arduino like just another AVR proto-board, and program low level, the interrupts might be fast enough to trigger under 100us. The Arduino language wrapper, though, will be too slow I think.

Report comment

Reply
Terry says:

May 10, 2011 at 5:51 pm

The sound pickup device needs to remain stationary. Eliminate the center pickup. Four microphones, one on each corner, are needed for surround sensing. Place the microphones in PVC tubing for greater directionality. Add some foam to reduce wind noise.

Much of your problem is from overwhelming the sensor with noise from the servo. You might want to consider increasing the gain on the preamp just a tad. What good is it if you need to scream into it’s ears as if at a loud concert. Heh? What’d you say?! Can’t hear you over the music.

Implement a sample/hold circuit by adding another op-amp to the circuit between the preamp and uP. This will solve the 10-50uS timing issue by latching the analog signal until the uP has time to service it.

Report comment

Reply
zool says:

May 10, 2011 at 6:39 pm

pretty cool

Report comment

Reply
Marks says:

May 10, 2011 at 6:43 pm

Analog devices Blackfin DSPs 400Mhz – 4 dollars

Report comment

Reply
Justin Case says:

May 10, 2011 at 8:59 pm

Have you listened through head phones to what your mics hear?
I wonder if the servo makes too much common noise.

If the two mic signals are summed then used as cancellation feedback into each channel, perhaps that would help.
Then when the sound is equal on both mics, there is no signal so the unit need not move.-?
Perhaps high-pass filtering, as high freq is more directional as one decision input.
Then low pass as the other decision input.
Phase and echo may be misleading as data depending on the rule set.
It’s strange how we can also tell above left/right or below left/right as I listen to snoring to my lower-right.
Perhaps in-phase delay could be used for triangulation as the delay is the vector in the hypotenuse of a triangle???(+ve) is left, (-ve) is right?
For those who know FFT (not me), would that help here??

Report comment

Reply
monster says:

May 10, 2011 at 11:06 pm

wonder if he thought about moving the ears further apart. that way he’d have a larger timeframe to sample

Report comment

Reply
DudeGuy says:

May 10, 2011 at 11:20 pm

Combine this with the Portal Sentry. Unbelievable amazingness will ensue.

Report comment

Reply
Hackius says:

May 10, 2011 at 11:43 pm

I’m pretty sure this is a job for a DSP

Report comment

Reply
bacchus says:

May 11, 2011 at 1:49 am

I’m not sure that comparisons with gunshot locaters are helpful.

Vehicle mounted ones appear to use at least 5 microphones, and they’re dealing with distinctive sounds, so are presumably optimized for this purpose. On top of this, I’ll bet they throw some hefty processing power at the problem, justified given these things are about saving lives.

Interesting project though – I suspect Hackius is right about using a DSP, particularly if the end product needs to be reasonably small.

Report comment

Reply
Mike says:

May 11, 2011 at 1:59 am

I once built a sound localization program in C# – it actually worked pretty well. All you need is 2 (decent) microphones, cross-correlate the two signals and then make a few simple calculations.

Report comment

Reply
Fileark says:

May 11, 2011 at 9:41 am

@Mike, Do you still have the C# program? It would be more helpful than just saying “make a few simple calculations” I would LOVE any kind of code samples from anyone as all you can find on the web are just detail-less demonstrations of university projects. You can always contact me on my website http://filear.com/index.php/contact

Report comment

Reply
poslathian says:

May 11, 2011 at 1:15 pm

Ive fooled around with sound triangulation (via timing) before with a Maple (cortex m3) board. The build was an X,Y tap sensitive wall. Turns out, with a fast enough proc, the timing resolution was easy. What was hard, was tuning the amplifier on the piezo sensors so that they would spike from a tap (potentially far away), but not from random talking and movement in the room.

Seemed like an easy problem, but really took me a long time to get right!

Report comment

Reply
Justin Case says:

May 11, 2011 at 9:57 pm

I see lots of people making audio spectrum anaylzers out of all sorts of microcontrollers/LCD display combinations.
Nice ones too!!
Can’t be that hard to learn…

I would like to learn to program FFT for monkeying with audio signals.
Online suggestions???

Report comment

Reply
ThePostman says:

May 13, 2011 at 6:25 pm

@Justin Case
Have a look at:
http://www.adrianlombard.com/physical-computing/avr-fft-code/

Report comment

Reply
Travis says:

May 16, 2011 at 1:07 pm

Here’s a sound locating van I developed, used for tracking birds:
http://www.flickr.com/photos/traviswiens/3388930393/

Report comment

Reply
shahidali says:

May 19, 2011 at 9:37 pm

hi all,
m doing a project on design and development of 3D audio in which i should be able to localize (finding location of sound source ) sound using headphones.i need this project in detail.its code,implementation of code on hrdware,hrdware/pc etc
any help in this will be highly appreciated
thanks

Report comment

Reply
hans says:

May 24, 2011 at 2:57 am

@shahidali, can you be more specific? What will it be used for?

Report comment

Reply
shahidali says:

May 24, 2011 at 9:35 am

hi buddies….
it serves many purposes….ths can be used in collision avoidance in aircrafts (localizing th warning tone)etc…but my scope is crude as i have to make a GUI in which i should take a tone and give sme phase shift to localize the tone…m thinking of using visual studio wth XNA games studio to produce th 2D or 3D effect……..kindly suggest a suitable software wth xamples,code etc to produce the effects….

Report comment

Reply
sunwukong says:

December 25, 2011 at 3:45 am

There is an audio spectrum analysis program for ham operators that features a DSP type setup to do RDF. It would probably be feasable to re work the program to solve this problem. Its called SpecLab by Wolfgang Buescher, see link at
http://www.qsl.net/dl4yhf/spectra1.html

Report comment

Reply
1. sunwukong says:
  
  November 5, 2015 at 2:23 pm
  
  I remember talking to the author of SpectumLab and he said it would be dooable with his program.
  
  Report comment
  
  Reply
Sunwukong says:

October 15, 2012 at 12:31 pm

I’ve gotten good results by taking a stereo harness and hooking up two mics (one mic reversed from the other) with small caps across the mic leads. I record a sound and move around while recording then when I play it back I can “hear” the movement.

Its interesting to play a single tone over computer speakers then slowly move around between the two speakers you can “hear” where the nodes are as the sound gets stronger at a node, then fades.

Report comment

Reply
goosenoose says:

November 3, 2012 at 11:12 am

i could see this being used in wartime applications.. say, soldiers carry a portable tripod with something like this attached to the top.

it would be used in defensive applications.. posted in a visible spot to soldiers and, if ambushed. set down.

soldiers would have to train to be silent to allow it to detect where shots are coming from..

i would invest in research for this ha..

Report comment

Reply
JB says:

April 7, 2013 at 2:25 am

could see this being used in wartime applications.. say, soldiers carry a portable tripod with something like this attached to the top.
it would be used in defensive applications.. posted in a visible spot to soldiers and, if ambushed. set down.
soldiers would have to train to be silent to allow it to detect where shots are coming from..
i would invest in research for this ha..

THIS HAS ALREADY BEEN DONE AND IS ALREADY IN USE…. HOWEVER IT WOULD STILL BE NICE TO GET A DIRECTIONAL FIX ON A RANGE OF SOUND FREQUENCIES AND OR SPECIFIC FREQUENCIES

Report comment

Reply