It’s the 21st century, and we’re still a long way from the voice-controlled computers we were all promised in the 60s, 70s, 80s, and 90s. The state of voice interaction has improved, though, and Amazon’s release of the Alexa Skills Kit (ASK) is another sure step towards a future of computers that will pay attention to you. This allows any hardware to become Alexa, your personal voice assistant with the ability to do just about anything you command.
Up to this point, Alexa was locked away inside the Amazon Echo, the ‘smart’ cylinder that sits in your living room and does most of what you tell it to do. Since the Amazon Echo was released, we’ve seen the Echo and the Alexa SDK used for turning lights on and off, controlling a Nest thermostat, and other home automation tasks. It’s not Google Now, Microsoft’s Cortana, or Apple’s Siri that is behind all these builds; it’s Amazon’s Alexa that is bringing us into a world where Star Trek’s [Scotty] talking into an old Mac is seen as normal.
Right now, the Getting Started guide for the Alexa Skills Kit is focused more on web services than turning lights on and air conditioning off. Sample code for ASK is provided in JavaScript and Java, although we would expect 3rd party libraries for Python to start popping up any day now. If you want to run ASK on a Raspberry Pi or other small Linux computer, you’ll need a way to do voice capture; the Jasper project is currently the front-runner in this space.
We hope this changes the home automation game in a couple of different ways. First, the ASK processes everything in the cloud so very low power devices are now ready for some seriously cool voice interaction. Second, Amazon’s move to open up what you can do with the software backend means a community developing for the hardware could eventually exert pressure on Amazon to do things like making the system more open and transparent.
Already working on some hacks with the Echo or ASK? Send in a tip to your write-up and tells us about it in the comments below.
We could really use some people smarter than I to make Echo work with Lowes Iris system
Step one is to hack the Iris system to get actual control over it. Sadly there is not much out there on hacking it.
Of course it had to escape. That thing costs as much as a good tabled and does almost nothing that a tablet does.
An echo would be interesting if it costs 10X less and you can say it is worth putting one here and there…. if you don’t care about privacy, of course.
I mostly use my Echo for listening to room-filling music with super-convenient voice control. A tablet costs almost as much as an Echo and does nothing that an Echo does. You’re trying to compare apples to pig’s feet.
Correct me if I’m wrong, but once again instead of a nice SDK that we can use OFFLINE, e.g. even after an asteroid hits amazon or the government of your country desides to disconnect itself from the internets, we are given an API to integrate our stuff with their web services. In other words – useless.
In other words, you expect server grade/distributed voice processing in a little box in your living room. Lol. Speech recognition is usually VERY proprietary or heavily licensed.
And probably uses some serious horsepower in a short period of time to provide timely responses. Natural language processing doesn’t really run well on a cell phone even today.
Correct me if I’m wrong, but once again instead of a nice SDK that we can use OFFLINE, e.g. even after an asteroid hits amazon or the government of your country desides to disconnect itself from the internets, we are given an API to integrate our stuff with their web services. In other words – useless.
We heard you the first time Andy.
BTW look up Jasper http://jasperproject.github.io/documentation/ if you want to do this yourself and or http://caffe.berkeleyvision.org/ Caffe has a neural network so you can use it for a number of things.
Use Sirius instead, it’s already built for this:
http://sirius.clarity-lab.org/
So you would need an internet connection to turn your lights, HVAC system, and other home necessities on or off? Yeah, that will never go wrong in any way…
Makes me glad I know how to operate a power switch. Seriously, is that becoming a lost art?
A friend of mine recently teached me a great response for this case: “97%”.
That’s the amount of uptime the biggest telecommunications provider here in Germany will guarantee. In other terms: If your internet connection is out for one and a half weeks per year, you’re still not below 97%. This usually shuts down even the most fervent “let’s put everything in ‘the cloud'” apologists.
“will guarantee” is a long way from “likely to get”. I certainly don’t remember the last time I had even a *day* of downtime with my various broadband connections, let alone a week’s worth in a year that wasn’t a result of me changing address or provider.
The other word that’s useful here is “fallback”. If your keys and physical switches still work, then the choice is between using your old setup 100% of the time, or just 3% of the time (if we take your 97% uptime estimate).
I have had more than that much downtime in the past year, but that was due to lightning.
Comcast decided that my DOCSIS 2 cable modem (That I own) was too old so they gave me a letter telling me I need to upgrade to DOCSIS 3. Then a week later they set their equipment to deny my modem part of the authentication step of connecting. So it would constantly get a signal, lock onto the uplink and downlink, try to authenticate and then get kicked off of the network and start over. It took me a couple of days to get a DOCSIS 3 modem that they liked and get it setup with them.
Not the only time I’ve ever had issues.
That’s the SLA, not the resulting reality. They want to avoid refunds, and only an idiot will promise you 100% uptime. Also, that’s the SLA for one provider. It’s not so tough to have two (e.g. GPRS/DSL backup).
One day there will be a hack story where someone has used a switch, that shkles father had in a box and shkle rediscovered it, unless of course the world of humans ends in an attack of echo before that.
Not a switch but you’ll get the idea.
http://www.themathlab.com/writings/short%20stories/feeling.htm
Adding voice control to your system does not magically render the local control (i.e. buttons on your thermostat, or zWave enabled light switch) inoperable. It adds an extra layer of convenience.
It also adds an extra layer of failure prone equipment and infrastructure.
A quality light switch can easily outlast the house it’s installed in.
True but you can not usually turn it on if you are at work and are going to be coming home later than expected.
Why not this for Pi?
https://github.com/amzn/alexa-avs-raspberry-pi
Alexa for RPi
Glad to see you deliver your mail by word of mouth, on foot, lest the email go wrong and your words be twisted somehow. Luddite.
Given the lilt of these comments so far, the banner image is a perfect choice ;-)
I wonder if it can sing Daisy :D
“Alexa, sing Daisy Bell.”
“Sorry, I can’t understand the question I heard.”
“Alexa, sing Bicycle Built for Two.”
“Hmm… I can’t find the answer to the question I heard.”
Should be fixable with the Skills Kit.
Isn’t it funny how in all old movies the supercomputer/A.I. is a box that does it all, under the possibility that the user can smash it and kill it instead of a wireless microphone/camera/screen connected to a server of which the user has no control?
Everything in IT goes in cycles, standalone servers, central mainframes with dumb terminals, decentralised client/server model, central cloud(servers) with dumbish clients. Next up distributed decentralised encrypted cliver (cli[ent/ser]ver).
Which is what Microsoft did with Windows 10: defaults to using your devices bandwidth to stream updates to or from other devices on your local network or ANYWHERE.
From what I can see M$ are aiming to eventually hold all your data (or a copy/index of it) centrally and also run applications centrally (365). And you can have a dumbish pretty eye candy client.
Me personally I hate that a company decides what and when updates are pushed to my machine, feels very big brother (post Snowden, why would anyone trust this to a US based company this much)
Can’t wait for NSA data backup/retrieval plus home control/security service. (Central Services anyone!?) May be they make a law and require that you pay for it just like Obamacare one of these days.
What does a tekkieNEET care? Momma will be paying the bills.
In case anyone is wondering what I’m referring to, check this out: http://knowyourmeme.com/memes/neet
No-one’s wondering. Doesn’t really matter.
There is a python library out there to help with intent specification and data generation. It’s pretty basic though. https://github.com/anjishnu/ask-alexa-pykit
I don’t really see anything in this service that brings the capability outside of the Amazon Echo.
Regardless, seemingly the Echo still isn’t available outside of the US!
Wake me up when any of this is the slightest bit relevant :-)
The Alexa Voice Service, if implemented properly, is basically the “core” of Alexa. You could build something on a Raspberry Pi that acts like an Amazon Echo and combine it with the Alexa Skills Kit for extended commands. The only thing you don’t get with AVS out of the box is access to Amazon Music or Audible unless you get special approvals from Amazon for the AVS device you build.
I’ve been poking with the Voice Service reference software — it’s basically an Echo in a box.
Cool.
I’ll look forward to a Raspberry Pi implementation.
Already in the reference kit. ;)
https://developer.amazon.com/public/solutions/alexa/alexa-voice-service/docs/reference-implementation-guide
So, when will someone port this to the 1980’s Macintosh System?
I’m very enthused about this technology but I’m super cynical about using it. I would love to have a A.I Personal Assistant but I would loath to have one that gossips, much less records every command I give it (and possibly everything else as well).
Until that A.I. (and the associated personal data on which it relies) is ‘contained’ wholly within an enclosure I control (or at least well-understand), I’m going to forgo having one too near me. I’m not sure I’ll live to see processing power small enough–local enough–to ever provide that.
Yea, I know I’m being ‘tracked’ anyway on so many different levels that one more probably makes little difference, but I just don’t want to give up entirely.
Maybe you aren’t as interesting as you think you are.
Go feed a wraith Rodney.
BTW, Google keeps recordings of you every time you use the voice features in Android. Any time you say “Ok Google” and ask it a question. https://history.google.com/history/audio?utm_source=help
They also save your location history from any/all devices that can be tracked on, such as a phone or tablet. Even if the device doesn’t have GPS. It still works with wifi.
I have set up a home heating system based on a couple of ESP8266’s and a small OLED screen. I have recently added voice control via the amazon echo ASK SDK. (Took all of two hours to set up by modifying the sample provided by Amazon!). The web service for the ASK is running on a raspberry pi that is connected to a TV running OSMC (its always on). It works a lot better than I had anticipated given that the convoluted path. Voice processing is done on the Amazon server > Talks to the raspberry pi > talks to the ESP on the local network.
Rob.
Wow, could you post a blog on your setup, or even better, a howto guide? It sounds awesome!
Why not just use this for Pi
https://github.com/amzn/alexa-avs-raspberry-pi
Alexa for RPi