Building A Better Kinect With A… Pager Motor?

August 13, 2012

Fresh from Microsoft Research is an ingenious way to reduce interference and decrease the error in a Kinect. Bonus: the technique only requires a motor with an offset weight, or just an oversized version of the vibration motor found in a pager.

Being the first of its kind of commodity 3D depth sensors, the tracking on a Kinect really isn’t that good. In every Kinect demo we’ve ever seen, there are always errors in the 3D tracking or missing data in the point cloud. The Shake ‘n’ Sense, as Microsoft Research calls it, does away with these problems simply by vibrating the IR projector and camera with a single motor.

In addition to getting high quality point clouds from a Kinect, this technique also allows for multiple Kinects to be used in the same room. In the video (and title pic for this post), you can see a guy walking around a room filled with beach balls in 3D, captured from an array of four Kinects.

This opens up the doors to a whole lot of builds that were impossible with the current iteration of the Kinect, but we’re thinking this is far too easy and too clever not to be though of before. We’d love to see some independent verification of this technique, so if you’ve got a Kinect project sitting around, strap a motor onto it, make a video and send it in.

[youtube=http://www.youtube.com/watch?v=CSBDY0RuhS4&w=470]

35 thoughts on “Building A Better Kinect With A… Pager Motor?”

Kaj says:

August 13, 2012 at 11:14 am

This is truly elegant engineering, I love it!

Report comment

Reply
Tinkerer says:

August 13, 2012 at 11:15 am

Not want to be a spoiler, but this is meant to solve interference between *several* Kinects; from the youtube text: “Shake ‘n’ Sense is a novel yet simple mechanical technique for mitigating the interference when two or more Kinect cameras point at the same part of a physical scene”
It does not improve performance for a single kinect, as far as the info of the movie goes.

Report comment

Reply
1. Mythgarr says:
  
  August 13, 2012 at 6:05 pm
  
  It’s true that it doesn’t improve the quality of a single sensor, but it DOES enable multiple sensors to be used on the same scene (stated in the summary and the movie). Multiple inputs can be combined to provide enhanced sensor quality, increased coverage, and increase the number of possibilities for this technology.
  Also, this same method would work fine with improved optics in the Kinect 2, allowing for potential support for multiple sensors.
  
  Report comment
  
  Reply
Chris C. says:

August 13, 2012 at 11:17 am

Who’da thunk it, so simple. I wonder any care needs to be taken to keep the weighted motors out of sync with each other.

Report comment

Reply
1. Kaj says:
  
  August 13, 2012 at 11:28 am
  
  I bet you could manage it for sure just by making all the weights vary in mass slightly.
  
  Report comment
  
  Reply
2. Finger says:
  
  August 13, 2012 at 12:57 pm
  
  It wouldn’t matter because the cameras aren’t from the same viewpoint so the motion of the pattern from one sensor will be vastly different when viewed from another sensor.
  
  Report comment
  
  Reply
  1. Tracy says:
    
    June 2, 2021 at 10:21 am
    
    Doesn’t the Kinect come with an accelerometer too so it knows how to level and calculate it’s view angle? I wonder if the standard driver software will allow the user to bypass the accelerometer so it doesn’t freak out with all the shaking going on. Even better would be if the accelerometer had a fast enough update and fine enough resolution that it knew its deflection at a given moment and can refine where believes it’s point cloud is…. Only one way to find out…(digs kinetic out of closet and steals wife’s ”back massager”) …. Now that I think about it, how did they come up with this solution? O.o
    
    Report comment
    
    Reply
boondaburrah says:

August 13, 2012 at 11:27 am

Look up the “V Motion Project” on vimeo. It was done for an energy drink company, but it uses this principle.

After you’ve been dubstepped out, the writeup is kinda interesting:

http://www.custom-logic.com/blog/v-motion-project-the-instrument/

Report comment

Reply
Mohonri says:

August 13, 2012 at 11:43 am

I imagine this would work for a single Kinect, actually. It’s a physical version of a technique used to increase the number of bits of accuracy for an ADC: introduce noise into your input and average the measurements. Atmel actually has an application note describing it here (PDF)

Report comment

Reply
Leithoa says:

August 13, 2012 at 11:45 am

Dithering on instruments to increase accuracy isn’t particularly new, although application to the kniect is obviously novel. This is definitely one of those things that now that you see it you’re surprised it hadn’t been done before.

Report comment

Reply
Aaron says:

August 13, 2012 at 11:51 am

Basically saccades, innit? Agreeing with those who are saying “How did everyone else not think of that?”

Report comment

Reply
willrandship says:

August 13, 2012 at 11:57 am

I’m surprised the kinect doesn’t have some latency in the IR sensor compared to the camera that causes problems with accuracy.

Report comment

Reply
1. AndroxxTraxxon says:
  
  August 18, 2012 at 7:56 pm
  
  Infrared and visible light both travel at the same speed, so if there is any latency, it’s in the data processing.
  
  Report comment
  
  Reply
ajacks504 says:

August 13, 2012 at 12:07 pm

Dithering…

Report comment

Reply
1. Aaron says:
  
  August 13, 2012 at 7:03 pm
  
  I dunno…well, maybe…but wait, no — ah, whatever you think!
  
  Report comment
  
  Reply
David s says:

August 13, 2012 at 12:28 pm

Is there a way to attenuate the noise so it doesn’t sound like my Xbox is getting an unending stream of text messages?

Report comment

Reply
kuxas says:

August 13, 2012 at 12:42 pm

The vibration motor found in a pager… or for those of us living in 2012, a phone…

Report comment

Reply
Jim McC says:

August 13, 2012 at 1:06 pm

Dithering, true, but motion blur is the more important aspect here.

Basically each infrared projector stops putting out a point cloud and starts putting out a motion-blurred point cloud. Because the infrared camera is undergoing the same motion blur, it sees the points it is projecting as true points still, but each other kinect only sees blurry indistinct IR light, not a point cloud that its trying to measure.

Genius!

Report comment

Reply
wondertest says:

August 13, 2012 at 1:07 pm

This technique works with a single kinect too. I wrote a piece of software that ran the motor up and down in slight increments, and captured depth data only while the lens was moving. It was hell on the motor, but it improved the capture quality significantly.

Report comment

Reply
minipimmer says:

August 13, 2012 at 1:11 pm

Does this have anything to do with the stochastic ressonance phenomena? http://en.wikipedia.org/wiki/Stochastic_resonance

Report comment

Reply
Hirudinea says:

August 13, 2012 at 2:42 pm

If microsoft doesn’t put this into Kinect 2.0 they’re crazy.

Report comment

Reply
Wilcorp70 says:

August 13, 2012 at 3:50 pm

I haven’t had a chance to read the source article, does this work with the stock firmware? Would this be able to improve accuracy using Kinect Games?

Report comment

Reply
burkley says:

August 13, 2012 at 4:02 pm

This makes me think of how a lock-in amplifier works.

Report comment

Reply
Adrian says:

August 13, 2012 at 4:21 pm

This was originally published by Andrew Maimone and
Henry Fuchs at the IEEE VR conference this March, I was there and it was a pretty interesting talk, and the general feeling in the crowd was “why didn’t we think of that?”

Here’s a link to the paper from IEEE VR if anyone is interested in more details:
http://www.cs.unc.edu/~fuchs/kinect_VR_2012.pdf

Report comment

Reply
1. gannon says:
  
  August 13, 2012 at 9:57 pm
  
  And here I was going to mention that :P
  It definitely is a simple solution to the interference issue though. I myself thought that turning the cameras/lasers on/off in sequence rapidly would’ve been the solution people would go for.
  
  Report comment
  
  Reply
2. neon22 says:
  
  August 14, 2012 at 5:49 am
  
  +1
  
  Report comment
  
  Reply
dopetik1911 says:

August 13, 2012 at 5:45 pm

Pretty simple and incredibly powerful!!!
Do you think this idea can help kinect to sense fast motions more accurately?

Report comment

Reply
notmyfault2000 says:

August 13, 2012 at 9:35 pm

What the hell’s a “pager?” And why do my grandparents keep giving me these sheets of paper with all these numbers on them? I think they call them “chaks” or something like that…

Report comment

Reply
1. Tracy says:
  
  June 13, 2021 at 12:13 pm
  
  Ill take care of those for you…. In fact itll be easier dispose of them properly if they just put my name on them….
  
  Report comment
  
  Reply
nah! says:

August 14, 2012 at 12:57 am

i can see many hackaday reader (those who own an xbox) playing xbox without force feedback, because they needed the motor for their kinect

Report comment

Reply
Whatnot says:

August 14, 2012 at 8:37 am

I’m sorry but may I ask that they do the voiceover for their video with an actual microphone and more importantly by a person not having a severe headcold? I found in unbearable and had to turn the sound off.

Report comment

Reply
Ryoku says:

August 14, 2012 at 10:13 am

now if someone can make a case mod for the kenetic that takes advantage of this and the “glasses” research that improves the scan resolution that would be awesome. just another reason for me to get off my butt and go buy a kinetic sadly I’ll have to invest in a more powerful netbook to pull it off while on the go ^^;

Report comment

Reply
Galane says:

August 15, 2012 at 3:32 pm

How about applying super resolution techniques to a multiple Kinect setup?

http://en.wikipedia.org/wiki/Super-resolution

Report comment

Reply
1. ratshit says:
  
  November 18, 2015 at 10:58 am
  
  wow thats cool!
  
  Report comment
  
  Reply
ratshit says:

November 18, 2015 at 10:59 am

any robot thats using a kinect setup is asking to get blinded by interferance. especially if its an interferance attack!

Report comment

Reply