Let Alexa Control Your Life; Guide To Voice-Enable Everything

December 26, 2015

Let’s face it, automation doesn’t feel quite as futuristic unless you can just say what you want out loud and have the machines flawlessly obey. That is totally possible now — and on the cheap. Well, cheap as far as money goes. It can be an expensive learning curve to get it all working. This will help. [Lindo St. Angel] has put together a guide to navigate voice control of hardware using Amazon’s Alexa SDK.

We previously reported that Amazon’s AI had escaped its hardware prison in the form of the Alexa Skills Kit. Yes, calling it the Alexa SDK above is wrong it’s actually the ASK but nobody knows what that acronym is while most recognize the gist of an SDK. It gives you the hooks and the documentation necessary to leverage the functionality in your own applications. The core functionality of Alexa is voice recognition. Even so, it’s still a tall hill to climb.

[Lindo] has broken down the problem into a very manageable example. The Amazon Voice Service (part of ASK) is used for voice recognition and control. Amazon’s Lambda service connects the ASK to your piece of hardware; in this case he’s using a Raspberry Pi as the server. The final step is to connect your hardware to the Pi. [Lindo] is interfacing a keypad-based home automation system with the Pi but the sky’s the limit at this point.

With all the authentication and connectivity laid bare, this is a lot more approachable. The question is no longer can you connect everything to voice control. The question becomes should you give control of everything over to one single online service?

21 thoughts on “Let Alexa Control Your Life; Guide To Voice-Enable Everything”

CRJEEA says:

December 26, 2015 at 7:24 am

Has anyone come across anything stand alone (that doesn’t require an Internet connection) that can still give reasonable reliability when it comes to translating speech into toggling pins but still be able to cope with a fairly large number of different commands and parameters?

Report comment

Reply
1. Mike Szczys says:
  
  December 26, 2015 at 7:34 am
  
  There must be something because my car can do voice commands fairly well and it has no internet connection.
  
  Report comment
  
  Reply
  1. bigbob says:
    
    December 26, 2015 at 8:13 am
    
    Does it respond to only a very select few commands?
    
    If you car has onstar, it has an internet connection…
    
    Report comment
    
    Reply
    1. Xeon says:
      
      December 26, 2015 at 8:16 am
      
      I have a i20.. the trick here the commends are very small in scope and defined.
      so it’s way easier for the code to figure it out.
      
      Report comment
      
      Reply
    2. Mike Szczys says:
      
      December 26, 2015 at 12:54 pm
      
      No, you can call out music on an SD card and it will play that artist/album/track. This vehicle doesn’t have OnStar.
      
      Report comment
      
      Reply
2. Dominic Nguyen says:
  
  December 26, 2015 at 7:53 am
  
  You can check out the Jasper project.
  http://jasperproject.github.io/documentation/
  http://hackaday.com/2014/04/09/create-your-own-j-a-r-v-i-s-using-jasper/
  
  Report comment
  
  Reply
3. k-ww says:
  
  December 26, 2015 at 8:31 am
  
  http://www.mikroe.com/click/speakup/ for $39, it’s worth looking at.
  
  Report comment
  
  Reply
4. Thoquz says:
  
  December 26, 2015 at 11:56 am
  
  https://en.wikipedia.org/wiki/CMU_Sphinx
  
  Report comment
  
  Reply
5. TacticalNinja says:
  
  December 26, 2015 at 3:56 pm
  
  I thunk the reason for cloud based voice recognition is the shear amount of data that your voice is crossmatched with (i.e. the whole english database, plus other languages and accents) if you have a fairly strong computer, probably you can implement your own voice recognition, or find a way to download the database.
  
  Report comment
  
  Reply
  1. Required says:
    
    December 27, 2015 at 11:39 am
    
    Unfortunately the datasets are closely gaurded. These days machine learning is easy so most of the competitive advantage comes from have large high quality datasets. Strangely enough nobody wants to share. Perhaps comeone can find a large collection of transcribed text, or a large collection of people reading books?
    
    Report comment
    
    Reply
    1. Daniel says:
      
      December 29, 2015 at 5:32 pm
      
      Librivox.org?
      
      Report comment
      
      Reply
6. Dan says:
  
  December 27, 2015 at 3:03 pm
  
  You will need to do a bit of research to confirm this but, anything running Android Jelly Bean (or above) should be able to use offline voice recognition. So that means the Raspberry Pi 2.
  
  Report comment
  
  Reply
7. Gretchen Hall says:
  
  December 28, 2015 at 11:20 am
  
  Mac OS X dictation has an option to do offline dictation – you have to download a fairly large dataset, but once you do, you can dictate without any internet access.
  
  It is most definitely not open, and I’ve never used the speech recognition APIs, so I’m not sure what exactly is feasibly in terms of programmability, but it might be work a look if use use Macs
  
  Report comment
  
  Reply
8. rollinns says:
  
  December 28, 2015 at 6:07 pm
  
  Here’s a good place to start:
  https://en.wikipedia.org/wiki/Speech_recognition_software_for_Linux
  
  I tried getting Palaver to work, it’s based on Google’s voice recog. API, but I could never get it working. I was mainly interested in voice dictation for notes, email, etc. Maybe it was a hardware issue but still, it never worked on two different computers I tried it on.
  https://github.com/markmandel/Palaver
  http://www.linux.com/news/embedded-mobile/mobile-linux/711479-palaver-taps-googles-voice-technology-for-linux-speech-recognition/
  
  I
  
  Report comment
  
  Reply
9. Joe says:
  
  July 10, 2017 at 10:12 am
  
  CMU pocketsphinx can run on a raspi, in fact it’s the offiine speech recognition backend for Android. I wrote a python script ages ago to control music playback and it was pretty good (on my desktop, i didn’t have a pi then). the trick is that it’s context based, so it needs to know what words in its vocabulary go together in order to reduce errors. I had to make a bash script to give it every possible combination of “[wake word] play [song] by [artist]”
  
  Report comment
  
  Reply
Jarek says:

December 26, 2015 at 12:06 pm

it’s encouraging to see the progress of this project go from:
“use only amazon servers and only on amazon devices to only buy amazon stuff” to
“use only amazon servers and only on amazon devices to do anything” to
“use only amazon servers on any devices to do anything”

almost like a believable plan an AI would cook up to convince humanity to become interested in it and install it on all devices…

brb calling spielberg

Report comment

Reply
dolo724 says:

December 26, 2015 at 1:24 pm

“Toaster: Lighten up!”

Report comment

Reply
1. Hirudinea says:
  
  December 26, 2015 at 4:21 pm
  
  ‘Toaster? I find that offensive, now I’ll have to destroy the 12 colonies!”
  
  Report comment
  
  Reply
ZPeter says:

December 27, 2015 at 4:14 pm

Does anyone know of a high quality/ low cost microphone setup that can be used with speech recognition.

Report comment

Reply
1. Dan says:
  
  December 27, 2015 at 6:00 pm
  
  The cheapest voice controller options are, a cheap android phone, one that can be rooted and has wifi + bluetooth so it can talk to all your IOT modules. All that for $50, nothing else comes close in terms of value for money. Look around and you may even get one that fits on your wrist. How can even the smartest hack beat that? It is a classic example of the “economies of scale.”
  
  Report comment
  
  Reply
Duane Stein says:

November 30, 2017 at 1:53 pm

Doesn’t anyone realize the ramifications of the very first sentence? “Let Alexa Control Your Life” REALLY?!??!

Report comment

Reply

Hackaday

Let Alexa Control Your Life; Guide To Voice-Enable Everything

21 thoughts on “Let Alexa Control Your Life; Guide To Voice-Enable Everything”

Leave a Reply to DanielCancel reply

Search

Never miss a hack

If you missed it

NPAPI And The Hot-Pluggable World Wide Web

The Time Clock Has Stood The Test Of Time

The Rise And Fall Of The In-Car Fax Machines

How Advanced Autopilots Make Airplanes Safer When Humans Go AWOL

2025: As The Hardware World Turns

Our Columns

Fighting Food Poisoning With A Patch

Hackaday Podcast Episode 352: Visualizing Sound, And Windows 11 Is A Dog

How Do PAL And NTSC Really Work?

Linux Fu: Yet Another Shell Script Trick

Hands On WIth The Raspberry Pi Compute Module Zero

21 thoughts on “Let Alexa Control Your Life; Guide To Voice-Enable Everything”

Leave a Reply to DanielCancel reply

Search

Never miss a hack

Subscribe

If you missed it

Our Columns