Blind Camera: Visualizing A Scene From Its Sounds Alone

June 12, 2023

A visualization by the Blind Camera based on recorded sounds and the training data set for the neural network. (Credit: Diego Trujillo Pisanty)

When we see a photograph or photo of a scene, we can likely imagine what sounds would go with it, but what if this gets inverted, and we have to imagine the scene that goes with the sounds? How close would we get to reconstructing the scene in our mind, without the biases of our upbringing and background rendering this into a near-impossible task? This is essentially the focus of a project by [Diego Trujillo Pisanty] which he calls Blind Camera.

Based on video data recorded in Mexico City, a neural network created using Tensorflow 3 was trained using an RTX 3080 GPU on a dataset containing frames from these videos that were associated with a sound. As a result, when the thus trained neural network is presented with a sound profile (the ‘photo’), it’ll attempt to reconstruct the scene based on this input and its model, all of which has been adapted to run on a single Raspberry Pi 3B board.

However, since all the model knows are the sights and sounds of Mexico City, the resulting image will always be presented as a composite of scenes from this city. As [Diego] himself puts it: for the device, everything is a city. In a way it is an excellent way to demonstrate how not only neural networks are limited by their training data, but so too are us humans.

4 thoughts on “Blind Camera: Visualizing A Scene From Its Sounds Alone”

Andrew says:

June 12, 2023 at 5:46 pm

“When you hear hoofbeats, think horses, not zebras.”

Report comment

Reply
paul shallard says:

June 12, 2023 at 9:53 pm

so, when can we expect the naval version of this
3d underwater sonar imaging would be a game changer
kept waiting for the lidar version a sonar has happened yet, that we know of?

Report comment

Reply
1. Kaiser says:
  
  June 13, 2023 at 2:52 am
  
  Synthetic Aperture Sonar is probably what gives you the best images currently.
  Having that in a cheap to use form would be amazing
  
  Report comment
  
  Reply
jenningsthecat says:

June 14, 2023 at 7:20 am

This sounds like a project that might benefit greatly from something akin to an LLM.

Report comment

Reply