Building A Smart Speaker From Scratch

July 5, 2019

Smart speakers have proliferated since their initial launch earlier this decade. The devices combine voice recognition and assistant functionality with a foreboding sense that paying corporations for the privilege of having your conversations eavesdropped upon could come back to bite one day. For this reason, [Yihui] is attempting to build an open-source smart speaker from scratch.

The initial prototype uses a Raspberry Pi 3B and a ReSpeaker microphone array. In order to try and bring costs down, development plans include replacing these components with a custom microphone array PCB and a NanoPi board, then implementing basic touch controls to help interface with the device.

There’s already been great progress, with the build showing off some nifty features. Particularly impressive is the ability to send WiFi settings to the device using sound, along with the implementation of both online and offline speech recognition capabilities. This is useful if your internet goes down but you still want your digital pal to turn out the lights at bed time.

It’s not the first time we’ve seen a privacy-focused virtual assistant, and we hope it’s not the last. Video after the break.

6 thoughts on “Building A Smart Speaker From Scratch”

George1984 says:

July 5, 2019 at 4:46 pm

How pray tell does “open source” have anything to do with what is essentially an always on listening device ? I’ve never seen so many people so willingly eager to surrender their privacy in the name of “convenience”. That mindset combined with legislation exploiting all avenues of fear mongering (including the embarassing “students” at a certain Ivy League university enthusiastically signing a petition to abolish the 1st Amendment), the “Patriot Act”, efforts to demonize law abiding gun owners, etc – all point to a very prescient prediction of future life, courtesy of a Mr Orwell.

Reply
1. Yihui says:
  
  July 6, 2019 at 9:37 am
  
  “Open source” means you can audit and change the source code. “always on listening” is not continuously sending audio to the cloud. The audio will be processed locally until a keyword is triggered. You can also make it work offline, which should have no privacy problem.
  
  Reply
2. Ralph says:
  
  July 6, 2019 at 10:19 am
  
  If the device was fully local, no remote corporate backend, and didn’t log anything recorded (except perhaps if YOU CHOOSE to turn on logging of things said just after the “wake word” and wish to store them in a file encrypted with a key only you know for your own later perusal) there’s nothing anti-freedom/anti-privacy left about it. And such a thing can definitely be open-sourced, nothing incompatile there. At that point the only objection becomes the fact that such devices are still a vastly inferior less reliable more buggy way to enter queries and access information than keyboard/command line/mouse and gui/touchscreen/… methods.
  
  Reply
  1. Pat says:
    
    July 8, 2019 at 7:39 am
    
    You don’t need to go that far – at least for Google Assistant, you can do the voice-to-text yourself and send the text directly if you want. At that point you know everything that’s being sent, and the Google Assistant portion is essentially just a smart command line.
    
    That being said the biggest problem with the Google Assistant goal is that the library isn’t anywhere *near* as capable as the actual system is, since it’s unable to do a ton of things due to what Google is willing to provide openly.
    
    Reply
Joel Finkle says:

July 6, 2019 at 7:03 am

I’ll need to take a look at this. Between the death of the Chromecast Audio, the Muzo Cobblestone’s terrible audio output, and wonky support for my ancient Squeezebox touch, I could use something with stereo output for whole house audio.

Reply
Kliment says:

July 14, 2019 at 5:11 am

Totally offtopic but I designed that octocat model back in 2011! Here is the original openscad source if anyone wants to play with it http://kliment.kapsi.fi/octocat/ (server migration in 2017 messed up the timestamps, sorry about that). See also: the horrible print quality we put up with at the time.

Reply

Hackaday

Building A Smart Speaker From Scratch

6 thoughts on “Building A Smart Speaker From Scratch”

Leave a Reply to KlimentCancel reply

Search

Never miss a hack

If you missed it

With Affordable Storage Options Dwindling, Where To Store Our Data?

Ask Hackaday: How Much Compute Is Enough?

WheatForce: Learning From CPU Architecture Mistakes

Improving FDM Filament Drying With A Spot Of Vacuum

Spy Tech: Conflicts Bring A New Number Station

Our Columns

In Space (Probably) Everyone Can Hear You.. Well, You Know

Re-Learning How To Run

Hackaday Podcast Episode 364: Clocks, Cameras, And Free Will

This Week In Security: The Supply Chain Has Problems

Sega Meganet: Online Gaming In 1990

6 thoughts on “Building A Smart Speaker From Scratch”

Leave a Reply to KlimentCancel reply

Search

Never miss a hack

Subscribe

If you missed it

Our Columns