Google AIY: Artificial Intelligence Yourself

May 4, 2017

When Amazon released the API to their voice service Alexa, they basically forced any serious players in this domain to bring their offerings out into the hacker/maker market as well. Now Google and Raspberry Pi have come together to bring us ‘Artificial Intelligence Yourself’ or AIY.

A free hardware kit made by Google was distributed with Issue 57 of the MagPi Magazine which is targeted at makers and hobbyists which you can see in the video after the break. The kit contains a Raspberry Pi Voice Hat, a microphone board, a speaker and a number of small bits to mount the kit on a Raspberry Pi 3. Putting all of it together and following the instruction on the official site gets you a Google Voice Interaction Kit with a bunch of IOs just screaming to be put to good use.

The source code for the python app can be downloaded from GitHub and consists of a loop that awaits a trigger. This trigger can be a press of a button or a clap near the microphones. When a trigger is detected, the recorder function takes over sending the stream to the Google Cloud. Speech-to-Text conversion happens there and the result is returned via a Text-To-Speech engine that helps the system talk back. The repository suggests that the official Voice Kit SD Image (893 MB download) is based on Raspbian so don’t go reflashing a memory card right away, you should be able to add this to an existing install.

And if you don’t have access to the official kit yet but are just itching to give it a try then look no further. Google was kind enough to put up a Guide to add Google Assistant support to the Raspberry Pi 3. The single board computer already has a speaker output and there are a plethora of USB microphones out that will do the job. USB sounds cards work just fine as well and after you follow the instructions to setup the Google SDK, you got yourself an Assistant.

If you want to complete the Google AIY Kit experience, you will have to do a bit of hacking. Adding a push button to trigger the Assistant Script is pretty simple and if someone wants to add a DIY Clap Trigger instead, go right ahead.

35 thoughts on “Google AIY: Artificial Intelligence Yourself”

Joe says:

May 4, 2017 at 8:37 am

So you have to make your own hotword detection

Report comment

Reply
1. RicoElectrico says:
  
  May 4, 2017 at 8:47 am
  
  Are there any reasonable (open-source, easy to use, production quality) libraries for this task?
  
  Report comment
  
  Reply
  1. RicoElectrico says:
    
    May 4, 2017 at 8:47 am
    
    Oh, and easy to train.
    
    Report comment
    
    Reply
    1. Alex says:
      
      May 4, 2017 at 2:06 pm
      
      The best fully open-source one I know of is Pocketsphinx, which is a full speech recognition engine, or if you’re willing to use a partially open-source one then Snowboy is supposed to be quite good – the only issue is that you have to train the voice model online, but after that the recognition runs completely offline.
      
      Report comment
      
      Reply
Haydn says:

May 4, 2017 at 8:47 am

sold out :(

Report comment

Reply
Rog Fanther says:

May 4, 2017 at 8:48 am

I´ll wait for them to launch something that can be installed in the pi and run disconnected from the cloud.

Report comment

Reply
1. Doc Oct says:
  
  May 4, 2017 at 8:52 am
  
  I’d also prefer to have something that can run outside of Google’s servers and therefore out of their sight. It creeps me out enough that they cache all queries you make via the Google Now voice questions.
  
  Report comment
  
  Reply
2. CodeReclaimers says:
  
  May 4, 2017 at 9:01 am
  
  Yeah I’m not a fan of all these AI products that run in a black box on somebody else’s machines. I’ve already got enough devices reporting on my behavior, I don’t need to add more with a hobby project.
  
  Surely there’s a voice recognition approach that’s lightweight enough to run on a Pi or some other hobby-sized machine?
  
  Report comment
  
  Reply
  1. Alphatek says:
    
    May 4, 2017 at 2:23 pm
    
    20+ years ago, I saw good voice-independent recognition on an ARM7 @ 30MHz, so you’d hope so
    
    Report comment
    
    Reply
  2. Nitori says:
    
    May 4, 2017 at 9:38 pm
    
    Yah I want to see a project where all the software runs inside the actual device or at least on your own server.
    
    Report comment
    
    Reply
    1. demokrit011 says:
      
      May 6, 2017 at 4:51 am
      
      Have you heard of https://mycroft.ai/ ? I don’t really know how fast/responsive this is as i only read about it but this would be your host-it-yourself solution. Otherwise there was once Sirius which has evolved into lucida.ai i think…
      
      Report comment
      
      Reply
3. salec says:
  
  May 4, 2017 at 10:13 am
  
  Perhaps you can use Google cloud to fast train your own ANN? If an embedded machine can encompass it, that is. Feed it broadcasts and Google obtained records of the same, so that you don’t compromise your own privacy.
  
  Report comment
  
  Reply
4. Rog Fanther says:
  
  May 4, 2017 at 10:28 am
  
  Not even for privacity, but sometimes you want to install something using this in a place without internet connection.
  
  Also, about running in the pi : we had reasonably workable voice recognitin since OS/2 time. So, Dragon Dictate and others worked in 386´s and 486´s. Given the processing power of the pi, I would hope it should be more than capable to run something like this. Maybe if google would really code something for it, instead of piling some bloated java libraries to the task
  
  Report comment
  
  Reply
5. nic0mac says:
  
  May 4, 2017 at 10:39 am
  
  Cant seem to find how to do it in the instructions but on their developer blog page it does say ” instructions to build a Voice User Interface (VUI) that can use cloud services (like the new Google Assistant SDK or Cloud Speech API) or run completely on-device. ” So I’m guessing that there is a way to run it offline just with less functionality depending on your device, or maybe i’m wrong…
  
  Report comment
  
  Reply
  1. Jklu says:
    
    May 4, 2017 at 12:10 pm
    
    It seems to be in the config:
    https://github.com/google/aiyprojects-raspbian/blob/master/config/voice-recognizer.ini.default#L7
    
    Report comment
    
    Reply
    1. nic0mac says:
      
      May 4, 2017 at 1:02 pm
      
      cool, Thanks for not telling me i was wrong, I hadn’t looked that hard yet. This might make a better way to access the security feed of the driveway on the tv, cause you know, finding the remote and pressing 3 buttons or just looking out the window when i hear a car pull up is way to hard sometimes.
      
      Report comment
      
      Reply
6. Jklu says:
  
  May 4, 2017 at 12:05 pm
  
  You might want to have a look at Jasper ( http://jasperproject.github.io/documentation/configuration/ ) which offers a choice of speech recognition engines including 2 offline variants
  
  Report comment
  
  Reply
Ken Quast says:

May 4, 2017 at 8:54 am

More data and information for Google. Yea!

Report comment

Reply
1. Clovis Fritzen says:
  
  May 4, 2017 at 10:07 am
  
  Using our data is their business, that has not been a surprise to anyone since 1998, sir.
  
  Report comment
  
  Reply
2. coromd says:
  
  May 4, 2017 at 4:24 pm
  
  It can’t hear you until you press enter or use something to trigger the detection. As of right now the only standalone device that can OK Google or run “always on” is the Google Home.
  
  Report comment
  
  Reply
Tweepy says:

May 4, 2017 at 9:23 am

What the point of this? How is it different?
As long GAFAM are in the loop, I wont be using such system.

Report comment

Reply
1. ???? ???? says:
  
  May 4, 2017 at 2:33 pm
  
  It is an educational toy, if you need security and privacy such systems are not appropriate. However you can use voice APIs to help train your own voice-to-text-to-action system and then just run the local neural network when it is competent enough.
  
  Report comment
  
  Reply
kryptylomese says:

May 4, 2017 at 9:47 am

Please make the kit available to purchase separately from the magazine?

Report comment

Reply
1. notarealemail says:
  
  May 4, 2017 at 9:55 am
  
  https://developers.googleblog.com/2017/05/aiy-projects-voice-kit.html
  
  Report comment
  
  Reply
  1. notarealemail says:
    
    May 4, 2017 at 9:58 am
    
    The Voice Kit ships out to all MagPi Magazine subscribers on May 4, 2017, and we’ve published a parts list, assembly instructions, source code and suggested extensions to our website: aiyprojects.withgoogle.com. The complete kit is also for sale at over 500 Barnes & Noble stores nationwide, as well as UK retailers WH Smith, Tesco, Sainsburys, and Asda.
    
    Report comment
    
    Reply
    1. haydn says:
      
      May 4, 2017 at 10:37 am
      
      I think they may mean those UK retailers as stocking the magpi magazine, as they don’t stock raspi, or any other electronics stuff. I just returned from a couple that had empty spaces where magpi should have been, and all online places have sold out. £20 start price on ebay.
      
      Report comment
      
      Reply
MCenderdragon says:

May 4, 2017 at 10:44 am

So how much data is send to goolge? I sgoogle only use for speech-to-text or also the complte AI task? If it was only the STT then we simply use keyboard as input. But currently this sounds like speech to google. goolge is making it to text feed an AI, AI doing magic, google send back text, text-to-spech on raspi is talking to you.

Report comment

Reply
???? ???? says:

May 4, 2017 at 2:29 pm

You can do all this, and a lot more with a $50 Android 6.0 phone, a custom App and any number of ESP8266 enabled “things”. Google’s offering is for children, not hardware hackers.

Report comment

Reply
1. Fred says:
  
  May 5, 2017 at 8:19 am
  
  https://www.hackster.io/bastiaan-slee/nabaztag-gets-a-new-life-with-google-aiy-e9f2c8
  
  Looks like a hacker to me. and looks to be a good hack too.
  
  Now wind your neck and get back in you hole and stop dumping on cool stuff…..
  
  Report comment
  
  Reply
  1. ???? ???? says:
    
    May 5, 2017 at 11:21 am
    
    I’m not “dumping” anything you are just kidding yourself about the fact that you can get everything and more already packaged for less money, this is a matter of verifiable facts and only a complete dickhead would deny it.
    
    Report comment
    
    Reply
Low Grade Source says:

May 4, 2017 at 3:59 pm

There are a ton of speech recognition module(s) (<- search it) on Aliexpress. They are mostly limited to phrase recognition and have hard limits of around 20 to 170 phrases for ~$15-$80. Probably good enough for most uses but definitely not an AI interface. Need more phrases? get a second board ;)

Report comment

Reply
Steve says:

May 4, 2017 at 4:26 pm

Got a Mag Pi and the kit this afternoon at an out-of-the-way WH Smiths near where I was working. All works quite nicely.
Was very easy to get it up and running.

The Voice Hat is really interesting though – lots of other breakouts on it – not just a speaker output, mic input and button input. The kit comes with a header strip to let you populate :

I2C and SPI breakouts

What look like 6 Servo outputs and 4 ‘Drivers’ (motor drivers?) There is an unpopulated DC-in barrel jack pads (the Mag Pi article boards show it populated) and some other jumpers too.

Not bad for a giveaway with a GBP £5.99 magazine.

Report comment

Reply
1. Steve says:
  
  May 4, 2017 at 4:30 pm
  
  Meant unpopulated headers – not jumpers.
  
  Report comment
  
  Reply
oliv4945 says:

May 7, 2017 at 1:28 am

Open source Jarvis project can avoid working with Google :-)
https://github.com/alexylem/jarvis

Report comment

Reply
Shandy says:

May 9, 2017 at 2:08 am

All good but the limitation of having to allow a connection to googles cloud is a “pain in the ass” especially when you want to make use of it on a phone in a country back lane, it cant even send a text without being online.

They definitely need to make a offline version.

Report comment

Reply

Hackaday

Google AIY: Artificial Intelligence Yourself

35 thoughts on “Google AIY: Artificial Intelligence Yourself”

Leave a Reply to oliv4945Cancel reply

Search

Never miss a hack

If you missed it

My Winter Of ’99: The Year Of The Linux Desktop Is Always Next Year

The Potential Big Boom In Every Dust Cloud

Forced E-Waste PCs And The Case Of Windows 11’s Trusted Platform

Remotely Interesting: Stream Gages

Hands-On: EufyMake E1 UV Printer

Our Columns

Supercon 2024: How To Track Down Radio Transmissions

Keebin’ With Kristina: The One With The H.R. Giger Keyboard

Hackaday Links: June 1, 2025

Pulling Back The Veil, Practically

Hackaday Podcast Episode 323: Impossible CRT Surgery, Fuel Cells, Stream Gages, And A Love Letter To Microcontrollers

35 thoughts on “Google AIY: Artificial Intelligence Yourself”

Leave a Reply to oliv4945Cancel reply

Search

Never miss a hack

Subscribe

If you missed it

Our Columns