Voice Without Sound

March 15, 2023

Voice recognition is becoming more and more common, but anyone who’s ever used a smart device can attest that they aren’t exactly fool-proof. They can activate seemingly at random, don’t activate when called or, most annoyingly, completely fail to understand the voice commands. Thankfully, researchers from the University of Tokyo are looking to improve the performance of devices like these by attempting to use them without any spoken voice at all.

The project is called SottoVoce and uses an ultrasound imaging probe placed under the user’s jaw to detect internal movements in the speaker’s larynx. The imaging generated from the probe is fed into a series of neural networks, trained with hundreds of speech patterns from the researchers themselves. The neural networks then piece together the likely sounds being made and generate an audio waveform which is played to an unmodified Alexa device. Obviously a few improvements would need to be made to the ultrasonic imaging device to make this usable in real-world situations, but it is interesting from a research perspective nonetheless.

The research paper with all the details is also available (PDF warning). It’s an intriguing approach to improving the performance or quality of voice especially in situations where the voice may be muffled, non-existent, or overlaid with a lot of background noise. Machine learning like this seems to be one of the more powerful tools for improving speech recognition, as we saw with this robot that can walk across town and order food for you using voice commands only.

14 thoughts on “Voice Without Sound”

dlcarrier says:

March 15, 2023 at 11:44 am

The Enders Game series had these, as well as tablet computers. Now all we need is faster-than-light communications and relativistic travel speeds.

Report comment

Reply
1. Ninjaneer says:
  
  March 15, 2023 at 2:40 pm
  
  That’s what I was thinking too. And don’t forget Jane, the AI in the ‘net, that Ender was communicating with.
  
  Report comment
  
  Reply
2. Steve L says:
  
  March 15, 2023 at 2:54 pm
  
  What about a related field, lip reading. HAL did it. But it is “really hard”, even for the human mind.
  
  Report comment
  
  Reply
[EGO] says:

March 15, 2023 at 12:52 pm

A vocabulary word immediately popped into my head when I read this. Fricative. What about the fricatives? Those and other things that are created more in the upper area/lip area. Are they reasonably accurate?

Report comment

Reply
1. Erik Johnson says:
  
  March 15, 2023 at 12:55 pm
  
  Maybe with some self discipline/training, e.g. many of us can speak coherently without moving our mouths if we think about it
  
  Report comment
  
  Reply
  1. Dale A Kaup says:
    
    March 15, 2023 at 6:33 pm
    
    I read an article a few years ago and basically it said people unconsciously form speech silently and that there are decipherable laryngeal movements while thinking. While you may not think you’re doing anything other than thinking, research says otherwise
    
    Report comment
    
    Reply
2. Spazer says:
  
  March 15, 2023 at 3:45 pm
  
  Different subset of language specifically for these devices?
  
  Report comment
  
  Reply
𐂀 𐂅 says:

March 15, 2023 at 1:42 pm

Didn’t NASA nail this problem ages ago and were even able to detect sub vocalizations so that the user didn’t even need to actual make a sound, the tiny changes in electrical activity in the neck muscles was enough?

Report comment

Reply
1. CRJEEA says:
  
  March 15, 2023 at 8:36 pm
  
  The military has had that for a while.
  They tried using it to aim and fire automatic gun turrets and remotely operated vehicles.
  
  Report comment
  
  Reply
Tom says:

March 15, 2023 at 3:30 pm

SottoVoce is such an awesome name!

Named after this, no doubt:

https://youtu.be/o84uUs40ql4

Report comment

Reply
Raidcore says:

March 15, 2023 at 3:58 pm

There have been several devices like this over the past 5 years or so. Still cool and I hope it takes off

https://www.smithsonianmag.com/innovation/device-can-hear-voice-inside-your-head-180972785/

Report comment

Reply
craig says:

March 15, 2023 at 4:25 pm

For ages (think, WWII) the military had throat mike things that just used vibrations instead of sound itself. Called a voiceless mic if I recall correctly. Used for loud aircraft and stuff. Sometime they would be in movies where they pinch their neck when talking. I played with some surplus ones like 30 years ago. But if you want to uncover AI and neural net learning and stuff, fine.

Report comment

Reply
1. CRJEEA says:
  
  March 15, 2023 at 8:40 pm
  
  They had bone conduction lolly pops in the nineties. You could hear music playing through your teeth as you ate it. You just don’t get stuff like that these days.
  
  Report comment
  
  Reply
  1. Giin says:
    
    March 18, 2023 at 6:37 am
    
    Lol I remember those! Man, they made my teeth ache!
    
    Report comment
    
    Reply

Hackaday

Voice Without Sound

14 thoughts on “Voice Without Sound”

Leave a ReplyCancel reply

Search

Never miss a hack

If you missed it

The Hackaday Summer Reading List: No AI Involvement, Guaranteed

Back To The Future, 40 Years Old, Looks Like The Past

Why The Latest Linux Kernel Won’t Run On Your 486 And 586 Anymore

One Laptop Manufacturer Had To Stop Janet Jackson Crashing Laptops

The 2025 Iberian Peninsula Blackout: From Solar Wobbles To Cascade Failures

Our Columns

This Week In Security: Anthropic, Coinbase, And Oops Hunting

Hackaday Links: July 6, 2025

Hackaday Podcast Episode 327: A Ploopy Knob, Rube-Goldberg Book Scanner, Hard Drives And Power Grids Oscillating Out Of Control

Last Chance: 2025 Hackaday Supercon Still Wants You!

FLOSS Weekly Episode 839: I Want To Get Paid Twice

14 thoughts on “Voice Without Sound”

Leave a ReplyCancel reply

Search

Never miss a hack

Subscribe

If you missed it

Our Columns