How To Roll Your Own Custom Object Detection Neural Network

Real-time object detection, which uses neural networks and deep learning to rapidly identify and tag objects of interest in a video feed, is a handy feature with great hacker potential. Happily, it’s also possible to make customized CNNs (convolutional neural networks) tailored for one’s own needs, and that process just got easier thanks to some new documentation for the Vizy “AI camera” by Charmed Labs.

Raspberry Pi-based Vizy camera

Charmed Labs has been making hacker-friendly machine vision devices for a long time, and the Vizy camera impressed us mightily when we checked it out last year. Out of the box, Vizy has a perfectly functional object detector application that runs locally on the device, and can detect and tag many common everyday objects in real time. But what if that default application doesn’t quite meet one’s project needs? Good news, because it’s possible to create a custom-trained CNN, and that process got a lot more accessible thanks to step-by-step examples of training a model to recognize hands doing rock-paper-scissors.

Person and cat with machine-generated tags identifying them
Default object detection works well, but sometimes one needs custom results.

The basic process is this: Start with a variety of images that show the item of interest. Then identify and label the item of interest in each photo. These photos (a “training set”) are then sent to Google Colab, which will be used to generate a neural network. The resulting CNN model can then be downloaded and used, to see how well it performs.

Of course things rarely work perfectly the first time around, so at this point it’s pretty common for some refinement to be needed to increase accuracy. Luckily there are a number of tools to help do this without creating a new model from scratch, so it’s just a matter of tweaking until things perform acceptably.

Google Colab is free and the resulting CNNs are implemented in the TensorFlow Lite framework, meaning it’s possible to use them elsewhere. So if custom object detection has been holding up a project idea of yours, this might be what gets you over that hump.

Stadia Controller’s Two Extra Buttons Get Seen With WebHID

The Google Stadia game streaming service relied on a proprietary controller. It was a pretty neat piece of hardware that unfortunately looked destined for landfills when Google announced that Stadia would discontinue. Thankfully it’s possible to use them as normal gamepads, and related to that, [Thomas Steiner] has a developer blog post about how to talk to the Stadia controller via WebHID. Continue reading “Stadia Controller’s Two Extra Buttons Get Seen With WebHID”

RCA’s Clear Plastic TV Wowed Crowds In 1939

In the United States in 1939, television sets still had a long way to go before they pretty much sold themselves. Efforts to do just that are what led to RCA’s Lucite Phantom Telereceiver, which aimed to show people a new way to receive broadcast media.

Created for the 1939 World’s Fair, the TRK-12 Lucite Phantom Telereceiver introduced people to the concept of television. Production models were housed in contemporary wood cabinets, but the clear acrylic (itself also a relatively new thing) units allowed curious potential customers to gaze within, and see what was inside these devices.

One interesting feature is the vertically-mounted cathode ray tube, which reflects off a mirror in the top cover of the cabinet for viewing. This meant that much of the bulk of the TRK-12 could be vertical instead of horizontal. Important, because the TRK-12 was just over a meter tall and weighed 91 kilograms (or just over 200 lbs.)

Clearly a luxury item, the TRK-12 sold for $600 which was an eye-watering sum for the time. But it was a glimpse of the future, and as usual, the future is made available a few ticks early to those who can afford the cost.

Want to see one in person? You might be in luck, because an original resides at the MZTV Museum of Television in Toronto, Canada.

Domino Ring Machine Tips Tiles In A Never-ending Wave

Like to see dominoes fall? [JK Brickworks] has got what you need, in the form of a never-ending ring of falling and resetting tiles. LEGO pieces are the star in this assembly, which uses a circular track and moving ramp to reset tiles after they have fallen. Timed just right, it’s like watching a kinetic sculpture harmoniously generating a soliton wave as tiles fall only to be endlessly reset in time to fall again.

A Mindstorms IR sensor monitors a tile’s state for timing.

It’s true that these chunky tiles aren’t actually dominoes — not only are they made from LEGO pieces and hinged to their bases, they have a small peg to assist with the reset mechanism. [JK Brickworks] acknowledges that this does stretch the definition of “dominos”, but if you’re willing to look past that, it’s sure fun to see the whole assembly in action.

The central hub in particular is a thing of beauty. For speed control, an IR sensor monitors a single domino’s up/down state and a LEGO Mindstorms EV3 with two large motors takes care of automation.

The video does a great job of showing the whole design process, especially the refinements and tweaks, that demonstrate the truly fun part of prototyping. [JK Brickworks] suggests turning on subtitles for some added details and technical commentary, but if you’re in a hurry skip directly to 4:55 to see it in action.

Want to see more automated domino action? This domino-laying robot sets them up for you to knock down at your leisure, and this entirely different robot lays out big (and we do mean BIG) domino art displays.

Continue reading “Domino Ring Machine Tips Tiles In A Never-ending Wave”

With ChatGPT, Game NPCs Get A Lot More Interesting

Not only is AI-driven natural language processing a thing now, but you can even select from a number of different offerings, each optimized for different tasks. It took very little time for [Bloc] to mod a computer game to allow the player to converse naturally with non-player characters (NPCs) by hooking it into ChatGPT, a large language model AI optimized for conversational communication.

If you can look past the painfully-long loading times, even buying grain (7:36) gains a new layer of interactivity.

[Bloc] modified the game Mount & Blade II: Bannerlord to reject traditional dialogue trees and instead accept free-form text inputs, using ChatGPT on the back end to create more natural dialogue interactions with NPCs. This is a refinement of an earlier mod [Bloc] made and shared, so what you see in the video below is quite a bit more than a proof of concept. The NPCs communicate as though they are aware of surrounding events and conditions in the game world, are generally less forthcoming when talking to strangers, and the new system can interact with game mechanics and elements such as money, quests, and hirelings.

Starting around 1:08 into the video, [Bloc] talks to a peasant about some bandits harassing the community, and from there demonstrates hiring some locals and haggling over prices before heading out to deal with the bandits.

The downside is that ChatGPT is currently amazingly popular. As a result, [Bloc]’s mod is stuck using an overloaded service which means some painfully-long load times between each exchange. But if you can look past that, it’s a pretty fascinating demonstration of what’s possible by gluing two systems together with a mod and some clever coding.

Take a few minutes to check out the video, embedded below. And if you’re more of a tabletop gamer? Let us remind you that it might be fun to try replacing your DM with ChatGPT.

Continue reading “With ChatGPT, Game NPCs Get A Lot More Interesting”

Researchers Find “Inert” Components In Batteries Lead To Cell Self-Discharge

When it comes to portable power, lithium-ion batteries are where it’s at. Unsurprisingly, there’s a lot of work being done to better understand how to maximize battery life and usable capacity.

Red electrolytic solution, which should normally be clear.

While engaged in such work, [Dr. Michael Metzger] and his colleagues at Dalhousie University opened up a number of lithium-ion cells that had been subjected to a variety of temperatures and found something surprising: the electrolytic solution within was a bright red when it was expected to be clear.

It turns out that PET — commonly used as an inert polymer in cell assembly — releases a molecule that leads to self-discharge of the cells when it breaks down, and this molecule was responsible for the color change. The molecule is called a redox shuttle, because it travels back and forth between the cathode and the anode. This is how an electrochemical cell works, but the problem is this happens all the time, even when the battery isn’t connected to anything, causing self-discharge.

Continue reading “Researchers Find “Inert” Components In Batteries Lead To Cell Self-Discharge”

Behold A Microscope That Sees By Squashing Things Into It

“Look with your eyes, not your hands” is something many of us have heard while growing up, but that doesn’t apply to the touch-sensitive microscope [Steve Mould] got to play with.

Gel pad removed, exposing lens and multi-directional lighting.

The wand-like device is made by Gelsight, and instead of an optical lens like a normal microscope, it sports a gel pad on the sensing end. By squashing an object into the gel, the device is able to carefully illuminate and image the impression created. By taking multiple images lit from different angles, a lot of information can be extracted.

The result is a high-resolution magnification — albeit a monochromatic one — that conveys depth extremely well. It’s pretty neat clearly seeing tiny specks of dust or lint present on surfaces when [Steve] demonstrates imaging things like coin cells.

Many a hacker knows that the devil is in the details when it comes to executing an idea. Even so, the basic principles of the Gelsight seem simple enough and possibly within the realm of inspired DIY in the same way that we saw a CNC gantry and USB microscope repurposed as an optical comparator.

Watch the Gelsight in action in the video below, embedded below the page break.

Continue reading “Behold A Microscope That Sees By Squashing Things Into It”