Interactive Dynamic Video

August 26, 2016

If a picture is worth a thousand words, a video must be worth millions. However, computers still aren’t very good at analyzing video. Machine vision software like OpenCV can do certain tasks like facial recognition quite well. But current software isn’t good at determining the physical nature of the objects being filmed. [Abe Davis, Justin G. Chen, and Fredo Durand] are members of the MIT Computer Science and Artificial Intelligence Laboratory. They’re working toward a method of determining the structure of an object based upon the object’s motion in a video.

The technique relies on vibrations which can be captured by a typical 30 or 60 Frames Per Second (fps) camera. Here’s how it works: A locked down camera is used to image an object. The object is moved due to wind, or someone banging on it, or any other mechanical means. This movement is captured on video. The team’s software then analyzes the video to see exactly where the object moved, and how much it moved. Complex objects can have many vibration modes. The wire frame figure used in the video is a great example. The hands of the figure will vibrate more than the figure’s feet. The software uses this information to construct a rudimentary model of the object being filmed. It then allows the user to interact with the object by clicking and dragging with a mouse. Dragging the hands will produce more movement than dragging the feet.

The results aren’t perfect – they remind us of computer animated objects from just a few years ago. However, this is very promising. These aren’t textured wire frames created in 3D modeling software. The models and skeletons were created automatically using software analysis. The team’s research paper (PDF link) contains all the details of their research. Check it out, and check out the video after the break.

12 thoughts on “Interactive Dynamic Video”

RW says:

August 26, 2016 at 11:42 am

Why does this come across as “Video analysis solved…. we say screw it and use a form of sonar” ? :-D

Though good point that understanding an object comes through interacting with it, and babies begin to do it by stuffing things in their mouths… maybe need machines with mouths :-D

Report comment

Reply
RandyKC says:

August 26, 2016 at 11:49 am

Movies are about to get a lot reeler.

Report comment

Reply
Christopher Favreau says:

August 26, 2016 at 11:51 am

freaky….

Report comment

Reply
Rodrigo Loza says:

August 26, 2016 at 12:25 pm

Beautiful. Well thought using the frequency domain of an image to find its moving parts. Of course, there should be antecedents, but i haven’t read any.

Report comment

Reply
Rick Thompson says:

August 26, 2016 at 12:34 pm

i wonder if the software can also tell us where the force or frequency needs to be to get an object to behave in the same way as a simulation.

Report comment

Reply
Braneman says:

August 26, 2016 at 1:19 pm

Now this, I could actually see this having a lot of applications in making animations in video games more realistic. ESPECIALLY in making physics in games and bringing it up to a reasonable level. I could completely see taking a leaf off a plant, doing this to it and then making a 3d model of a plant with the same weights and whatnot to create more realistic animations.

Report comment

Reply
Redhatter (VK4MSL) says:

August 26, 2016 at 3:10 pm

16.7Hz is above what would be the Nyquist frequency when capturing at 30 FPS… so I’m guessing they used 60 FPS here. A question to ask though is, is the 6.2Hz actually 6.2Hz, or is some of it possibly at an image frequency?

Report comment

Reply
1. paul says:
  
  August 27, 2016 at 7:39 am
  
  In one of their video’s they use the “artefacts” from the rolling shutter to increase the FPS.
  So they can analyze multiple kHz frequencies from 50Hz video
  
  Report comment
  
  Reply
Dan#8582394734 says:

August 26, 2016 at 3:26 pm

They should team up with the guys that were using FPGA based systems to extract real-time geometry from stereo camera feeds. I wonder if they can use a third central camera runing at low resolution and 120 fps to extract the dynamics that can then be used to distort the high resolution 3D video?

Report comment

Reply
1. RW says:
  
  August 26, 2016 at 3:49 pm
  
  High FPS low res. sensor in the wiimote immediately jumps to mind.
  
  Report comment
  
  Reply
h4rm0n1c says:

August 26, 2016 at 5:23 pm

Reminds me of myst.

Report comment

Reply
Tectu says:

August 26, 2016 at 5:35 pm

Jesus, this just completely blew my mind. This is very impressive!

Report comment

Reply

Hackaday

Interactive Dynamic Video

12 thoughts on “Interactive Dynamic Video”

Leave a ReplyCancel reply

Search

Never miss a hack

If you missed it

Crunching The News For Fun And Little Profit

The End Of The Hackintosh Is Upon Us

The Hackaday Summer Reading List: No AI Involvement, Guaranteed

Back To The Future, 40 Years Old, Looks Like The Past

Why The Latest Linux Kernel Won’t Run On Your 486 And 586 Anymore

Our Columns

FLOSS Weekly Episode 840: End-of-10; Not Just Some Guy In A Van

Dithering With Quantization To Smooth Things Over

Could Space Radiation Mutate Seeds For The Benefit Of Humanity?

This Week In Security: Anthropic, Coinbase, And Oops Hunting

Hackaday Links: July 6, 2025

12 thoughts on “Interactive Dynamic Video”

Leave a ReplyCancel reply

Search

Never miss a hack

Subscribe

If you missed it

Our Columns