Siri Controls Your PC Through Python And Gmail

April 27, 2017 by Lewin Day 8 Comments

Voice-based assistants are becoming more common on devices these days. Siri is known for being particularly good at responding to natural language and snarky responses. In comparison, Google’s Assistant is only capable of the most obvious commands, and this writer isn’t even sure Microsoft’s Cortana can understand English at all. So it makes sense then, if you want voice control for your PC, to choose Siri as your weapon of choice. [Sanjeet] is here to help, enabling Siri to control a PC through Python.

The first step is hooking up the iPhone’s Notes app to a Gmail account. [Sanjeet] suggests using a separate account for security reasons, as you’ll need to place the username and password in a Python script. The Python script checks the Gmail account every second, looking for new Notes from the iPhone. Then, it’s as simple as telling Siri to make a Note (for example, “Siri, Note shutdown”) and the Python script can then pick up the command, and act accordingly.

It’s a quick and easy way to get Siri to do your bidding. There’s other fancy ways to do it, too — like capturing Siri’s WiFi data on your home network.

A Smart Wand For All Us Muggles

November 26, 2016 by Rich Hawkes 10 Comments

Arthur C. Clarke said that “any sufficiently advanced technology is indistinguishable from magic.” Even though we know that something isn’t “magic”, it’s nice to see how close we can get. [Dofl] and his friends, big fans of the magic in Harry Potter, thought the same thing, and decided to create a magic wand that they could use themselves.

The wand itself is 3D printed and has a microcontroller and WiFi board, a voice recognition board, a microphone, and a vibrating motor stuffed inside. The wand converts the voice into commands and since the wand is connected to WiFi, the commands can be used to communicate with your WiFi connected lights (or your WiFi connected anything, really.) Five voice commands are recognized to turn on and off music, the lights, and a “summon” command which is used in the video to request a hamburger from delivery.com. For feedback, the motor is vibrated when a command is recognized.

There’s not much technical information in the original article, but I’m sure our readers could figure out the boards used and could suggest some alternatives to get the wand’s form factor down a bit. Over the years, other wands have appeared on our pages, using some different technologies. It’s a fun way to interact with the environment around you, even if you know the “magic” involved is just boring old technology.

Continue reading “A Smart Wand For All Us Muggles” →

Seeed Studio’s ReSpeaker Speaks All The Voice Recognition Languages

August 25, 2016 by Moritz Walter 22 Comments

Seeed Studio recently launched its third Kickstarter campaign: ReSpeaker, an open hardware voice interface. After their previous Kickstarted IoT hardware, such as the RePhone, mostly focused on connectivity, the electronics manufacturer from Shenzhen now tackles another highly contested area of IoT: Voice recognition.

The ReSpeaker Core is a capable development board based on Mediatek’s MT7688 WiFi module and runs OpenWrt. Onboard is a WM8960 stereo audio codec with integrated 1W speaker/headphone driver, a microphone, an ATMega32U4 coprocessor, 12 addressable RGB LEDs and 8 touch sensors. There are also two expansion headers with GPIOs, I2S, I2C, analog audio and USB 2.0 and an onboard microSD card slot.

The latter is especially useful to feed the ReSpeaker’s integrated speech recognition engine PocketSphinx with a vocabulary and audio file library, enabling it to respond to keywords and commands even when it’s not hooked up to the internet. Once it’s online, ReSpeaker also supports most of the available cloud based cognitive speech recognition services, such as Microsoft Cognitive Service, Amazon Alexa Voice Service, Google Speech API, Wit.ai and Houndify. It also comes with an SDK and Python API, supports JavaScript, Lua and C/C++, and it looks like the coprocessor features an Arduino-compatible bootloader.

The expansion header accepts shield-like hardware add-ons. Some of them are also available through the campaign. The most important one is the circular, far-field microphone array. Based on 7 XVSM-2000 digital microphones, the extension board enhances the device’s hearing with sound localization, beam forming, reverb and noise suppression. A Grove extension board connects the ReSpeaker to the Seeed’s current lineup on ready-to-use sensors, actuators and other peripherals.

Seeed also cooperates with the Meow King Audio Electronic Company to develop a nice tower-shaped enclosure with built-in speaker, 5W amplifier and battery. As a portable speaker, the Meow King Drive Unit (shown on the right) certainly doesn’t knock your socks off, but it practically turns the ReSpeaker into an open source version of the Amazon Echo — including the ability to run offline instead of piping everything you say to Big Brother.

According to Seeed, the freshly baked hardware will ship to backers in November 2016, and they do have a track-record of on-schedule shipped Kickstarter rewards. At the time of writing, some of the Crazy Early Birds are still available for $39. Enjoy the campaign video below and let us know what you think of think hardware in the comments!

Talking Star Trek

June 8, 2016 by Al Williams 34 Comments

Speech generation and recognition have come a long way. It wasn’t that long ago that we were in a breakfast place and endured 30 minutes of a teenaged girl screaming “CALL JUSTIN TAYLOR!” into her phone repeatedly, with no results. Now speech on phones is good enough you might never use the keyboard unless you want privacy. Every time we ask Google or Siri a question and get an answer it makes us feel like we are living in Star Trek.

[Smcameron] probably feels the same way. He’s been working on a Star Trek-inspired bridge simulator called “Space Nerds in Space” for some time. He decided to test out the current state of Linux speech support by adding speech commands and response to it. You can see the results in the video below.

Continue reading “Talking Star Trek” →

You Speak, Your Scope Obeys

March 29, 2016 by Elliot Williams 17 Comments

We’ve been scratching our heads about the various voice-recognition solutions out there. What would you really want to use one for? Turning off the lights in your bedroom without getting up? Sure, it has some 2001: A Space Odyssey ~~flare~~ flair, but frankly we’ve already got a remote control for that. The best justification for voice control, in our mind, is controlling something while your hands or eyes are already busy.

[Patrick Sébastien Coulombe] clearly has both of his hands on his oscilloscope probes. That’s why he developed Speech2SCPI, a quick mash-up of voice recognition and an oscilloscope control protocol. It combines the Julius open-source speech recognizer project with the Standard Commands for Programmable Instruments (SCPI) syntax to make his scope obey his every command. You’ve got to watch the video below the break to believe how well it works. It even handles his French accent.

Continue reading “You Speak, Your Scope Obeys” →

The Voice Recognition Typewriter

February 17, 2016 by Brian Benchoff 29 Comments

Typewriters with voice recognition have existed for over one hundred years; they were called secretaries. Robots are taking all the jobs now, and finally dictation and typing is a job that can be handled by a computer. [Zip Zaps] used an old Smith Corona typewriter to automate the process of turning dictation into print. Like a secretary hunched over an anachronistic IBM Selectric in the first season of Mad Men, this robot will take dictation and accept the overt sexism of a 1960s Manhattan ad agency.

Instead of the machinations of a few biological actuators, this typewriter is controlled with an array of servos driven by Pololu Maestro servo controller. There are twelve servos that move a small actuator down onto the keys, and another twelve servos that move the others above the correct row of the keyboard. The carriage return lever is actuated by a stepper motor, linear rail, and giant plastic lever.

While a robot that can use a typewriter is impressive, the real trick is getting it to take dictation. [Zip Zaps] used the built-in voice recognition found in Windows for this, streaming characters over a serial port to the Arduino-based electronics.

Does it work? Yes, surprisingly it does. Is it useful? Well, typewriters naturally have a cleaner, more analog tone about them, and you can’t replicate the typing experience of an old Smith Corona typewriter with a digital format. This build is just the natural extension of what digital electronics are capable of these days, and we look forward to seeing someone with this amazing device in our local Starbucks.

Continue reading “The Voice Recognition Typewriter” →

Turning A Page With Your Voice

May 19, 2015 by Brian Benchoff 20 Comments

[Justin]’s friend [Steve] injured his spine a while ago, and after asking what would make [Steve]’s life simpler, the answer was easy. [Steve] missed reading books. Sure, e-readers exist, but you still need to turn the page. Now [Steve] can do that with his voice thanks to some microcontrollers, Bluetooth modules, and a voice recognition module.

A voice-activated page turner wasn’t the first attempt at giving [Steve] the ability to turn a page on a Kindle. The first prototype was a big blue button that sent a keyboard code for ‘right arrow’ over Bluetooth, turning a book one page at a time. This worked well until multiple pages turned, and with no back button it was a major nuisance.

After playing with the voice recognition in an Amazon Echo, [Steve] and [Justin] wondered if the same voice recognition technology could be applied to page turns on a Kindle. With a voice recognition Arduino shield from SparkFun it was easy to detect a ‘page down’ command. A Bluetooth module sends HID commands to a Kindle, allowing [Steve] to read a book with only his voice.

[Justin] put all the design files for this build up on Github.