Can You Hear Me Now? Try These Headphones

May 29, 2024

When you are young, you take it for granted that you can pick out a voice in a crowded room or a factory floor. But as you get older, your hearing often gets to the point where a noisy room merges into a mishmash of sounds. University of Washington researchers have developed what they call Target Speech Hearing. In plain English, it is an AI-powered headphone that lets you look at someone and pull their voice out of the chatter. For best results, however, have to enroll their voice first, so it wouldn’t make a great eavesdropping device.

If you want to dive into the technical details, their paper goes into how it works. The prototype uses a Sony noise-cancelling headset. However, the system requires binaural microphones so additional microphones attach to the outside of the headphones.

Given training data, we wonder if traditional correlation methods would be just as effective. In other words, you could use facial recognition to figure out who’s talking and pull their voice out using more traditional signal processing techniques. However, this system can potentially pick up sound from unknown speakers, figuring direction from the binaural microphones, so even if the correlation method worked well on known speakers, the new system is likely superior in new situations.

There’s more to noise-cancelling headgear than you might think. Or you can just go low-tech.

12 thoughts on “Can You Hear Me Now? Try These Headphones”

kista4a says:

May 29, 2024 at 10:49 pm

> In plain English, it is an AI-powered headphone that lets you look at someone and pull their voice out of the chatter. For best results, however, have to enroll their voice first, so it wouldn’t make a great eavesdropping device.

So we’re at the point in which “AI” denotes the use of directional microphone + some DSP, or am I missing something?

Report comment

Reply
1. Shannon says:
  
  May 30, 2024 at 8:33 am
  
  Meh, I could go either way on this. I suppose it is doing DSP, but it’s being done by a neural network that was previously trained on the target voice, and that is an AI technique.
  The most remarkable thing about this to me is running a neural network fast enough to do noise cancellation.
  
  Report comment
  
  Reply
Jan says:

May 29, 2024 at 10:56 pm

Wow, that’s really really cool! Although I doubt grandma will be wearing them during family parties (because the headphone messes with her hair and the size of the headphones messes with her looks) but that’s a “tiny” detail. I love this concept!

Report comment

Reply
D says:

May 29, 2024 at 11:07 pm

I’ve never been able to hear voices above the noise, and now you’re telling me my hearing is going to get worse?! That’s a downer.

Report comment

Reply
limroh says:

May 29, 2024 at 11:56 pm

I read HaD’s article and skimmed the source a bit -> I’m not sure how the system selects the “source”…

I’d have expected something like
1. two eyetracking cameras
2. triangulation where one is looking (direction, distance)
3. ??? some math ???
4. use three microphones (left, right & top) to enhance the sound from the triangulated source.

But that doesn’t use AI…

Report comment

Reply
1. Biotronic says:
  
  May 30, 2024 at 12:53 am
  
  There are two microphones. When the user taps a button and look towards the source, the sound arrives at both microphones at the same time (as does sound from straight behind, above and below you, but most likely the source is the major contributor). Since the source signal is mostly the same for both microphones, their voice can be extracted and analysed by a neural network. The result of that analysis is used by another neural network to isolate the source voice.
  
  Report comment
  
  Reply
2. Johnu says:
  
  May 30, 2024 at 1:06 am
  
  My thought would just be #4 – two or three microphones pointed at where a person talking to you is most likely to be, so all you have to do is point your head at them and be about the right distance from them.
  
  Report comment
  
  Reply
  1. Joseph Eoff says:
    
    May 30, 2024 at 5:24 am
    
    That’s the idea that ocurred to me as I watched my brother in law fiddling with controls for his hearing aids using his phone.
    
    It seems you ought to be able to use beam forming on three microphones to emphasize sound sources “straight ahead” – just turn your head to look at a source in order to “tune it in.”
    
    That would make usage more natural. People normally turn to look at the person they are listening to.
    
    Report comment
    
    Reply
    1. Biotronic says:
      
      May 30, 2024 at 5:44 am
      
      In “formal” settings, people normally turn to the speaker, yes. If you’re e.g. out walking, probably not. The goal of this thing is to also isolate the sound source when for whatever reason you’re not looking.
      
      Report comment
      
      Reply
    2. Shannon says:
      
      May 31, 2024 at 1:11 pm
      
      Yeah, this is for when you’re at a crowded pup and everyone’s shouting so you have to shout. You could be having a conversation with several people, not just one person whose eyes you can stare into so very constantly and lovingly.
      
      Report comment
      
      Reply
dapostusa says:

May 30, 2024 at 9:53 am

Don’t show this article to my wife!

Report comment

Reply
Ewald says:

May 30, 2024 at 10:29 pm

There were glasses that accomplished this without AI. It used 4 mic’s and amplified sound from the direction you were looking, probably feeding it through a voice bandpass filter and used the other mic’s to attenuate sound coming from other directions. It looked much more like a normal pair of glasses.
As a person who has trouble listening in noisy environments, i find that wearing the right ear plugs already helps a lot. Still, it’s great to find uses of AI to help people cope in all kinds of situations.

Report comment

Reply

Hackaday

Can You Hear Me Now? Try These Headphones

12 thoughts on “Can You Hear Me Now? Try These Headphones”

Leave a ReplyCancel reply

Search

Never miss a hack

If you missed it

The Terminal Demise Of Consumer Electronics Through Subscription Services

Gentle Processing Makes Better Rubber That Cracks Less

How The Widget Revolutionized Canned Beer

Ore Formation: Introduction And Magmatic Processes

Remembering James Lovell: The Man Who Cheated Death In Space

Our Columns

How Laser Headlights Died In The US

Hackaday Links: August 17, 2025

Metric, Imperial, And Flexibility

Hackaday Podcast Episode 333: Nightmare Whiffletrees, 18650 Safety, And A Telephone Twofer

This Week In Security: The AI Hacker, FortMajeure, And Project Zero

12 thoughts on “Can You Hear Me Now? Try These Headphones”

Leave a ReplyCancel reply

Search

Never miss a hack

Subscribe

If you missed it

Our Columns