OpenCV Brings Pinch To Zoom Into The Real World

March 24, 2022

Gesture controls arrived in the public consciousness a little over a decade ago as touchpads and touchscreens became more popular. The main limitation to gesture controls, a least as far as [Norbert] is concerned, is that they can only control objects in a virtual space. He was hoping to use gestures to control a real-world object instead, and created this device which uses gestures to control an actual picture.

In this unique augmented reality device, not only is the object being controlled in the real world but the gestures are being monitored there as well, thanks to a computer vision system watching his hand which is running OpenCV. The position data is fed into an algorithm which controls a physical picture mounted on a slender robotic arm. Now, when [Norbert] “pinches to zoom”, the servo attached to the picture physically brings it closer to or further from his field of view. He can also use other gestures to move the picture around.

While this gesture-controlled machine is certainly a proof-of-concept, there are plenty of other uses for gesture controls of real-world objects. Any robotics platform could benefit from an interface like this, or even something slightly more mundane like an office PowerPoint presentation. Opportunity abounds, but if you need a primer for OpenCV take a look at this build which tracks a hand in minute detail.

11 thoughts on “OpenCV Brings Pinch To Zoom Into The Real World”

Alex says:

March 24, 2022 at 4:05 am

hahahaha, that was genius ! zoom in and the picture comes to you ! hahaha great

Report comment

Reply
Gravis says:

March 24, 2022 at 4:39 am

Seems more appropriate for AR.

Report comment

Reply
robomonkey says:

March 24, 2022 at 5:47 am

Reminded me of this….

https://youtube.com/clip/Ugkxw1O2w47XJKj5morH7yZK-MzjqxA3MTnF

Report comment

Reply
Transistors says:

March 24, 2022 at 5:59 am

The problem I have with these kinds of interfaces is that you really need to have the machine know that you are intending to talk to it. If the businessman running his powerpoint presentation starts gesturing with his hands to his audience, what is to keep powerpoint from suddenly closing, or giving some other undesired behavior? We already have interfaces that behave this way. Take speech to text assistants. How many times have you heard someone’s google assistant or siri start talking to them unexpectedly? Gestures could be even worse, especially in sensitive applications where mistakes are hazardous.

Report comment

Reply
1. Michael says:
  
  March 24, 2022 at 8:33 am
  
  Maybe eye tracking or UWB could provide additional context clues for the control algorithms? It’s not a given that voice or gesture control is always inaccurate, just an engineering challenge.
  
  Report comment
  
  Reply
2. RÖB says:
  
  March 24, 2022 at 6:22 pm
  
  Teach it to translate Italian.
  
  Report comment
  
  Reply
Thinkerer says:

March 24, 2022 at 6:01 am

One of the selling points of “cobots” – small robots for use in assembly/manufacturing and distribution working alongside people – is the ability to learn by manually moving the manipulator head through the desired operation and having the device remember (and often optimize) that operation. Using a vision system to simply demonstrate the operation (“pick this up, put it there, but absolutely don’t hit that thing…”) would be an interesting embellishment.

Report comment

Reply
1. X says:
  
  March 24, 2022 at 8:41 am
  
  This is a hard problem for machine vision and an easy problem for a 3d position sensor, do it the easy way. Machine vision technology is mature and stable for 2d applications, not so much for 3d.
  
  Report comment
  
  Reply
X says:

March 24, 2022 at 7:59 am

Myron Krueger was doing this stuff on an Apple Ii back in the 1980s. He had a demo at 1989 SIGGRAPH that was really cool.

Report comment

Reply
CRJEEA says:

March 24, 2022 at 2:32 pm

robomonkey, I knew what video that was, before I used the link, great minds think alike.

Report comment

Reply
Franklin McShnoogerty says:

March 25, 2022 at 6:47 am

The real acconmplishment here is getting his hand to run OpenCV. Talk about bio-hacking!!

Report comment

Reply

Hackaday

OpenCV Brings Pinch To Zoom Into The Real World

11 thoughts on “OpenCV Brings Pinch To Zoom Into The Real World”

Leave a Reply to MichaelCancel reply

Search

Never miss a hack

If you missed it

Version Control To The Max

Trackside Observations Of A Rail Power Enthusiast

Radio Apocalypse: Meteor Burst Communications

Flow Visualization With Schlieren Photography

Big Chemistry: Cement And Concrete

Our Columns

Remembering Memory: EMS, And TSRs

Keebin’ With Kristina: The One With The MingKwai Typewriter

Hackaday Links: May 11, 2025

“Man And Machine” Vs “Man Vs Machine”

Supercon 2024: An Immersive Motion Rehabilitation Device

11 thoughts on “OpenCV Brings Pinch To Zoom Into The Real World”

Leave a Reply to MichaelCancel reply

Search

Never miss a hack

Subscribe

If you missed it

Our Columns