“Glasses” That Transcribe Text To Audio

March 19, 2025

Glasses for the blind might sound like an odd idea, given the traditional purpose of glasses and the issue of vision impairment. However, eighth-grade student [Akhil Nagori] built these glasses with an alternate purpose in mind. They’re not really for seeing. Instead, they’re outfitted with hardware to capture text and read it aloud.

Yes, we’re talking about real-time text-to-audio transcription, built into a head-worn format. The hardware is pretty straightforward: a Raspberry Pi Zero 2W runs off a battery and is outfitted with the usual first-party camera. The camera is mounted on a set of eyeglass frames so that it points at whatever the wearer might be “looking” at. At the push of a button, the camera captures an image, and then passes it to an API which does the optical character recognition. The text can then be passed to a speech synthesizer so it can be read aloud to the wearer.

It’s funny to think about how advanced this project really is. Jump back to the dawn of the microcomputer era, and such a device would have been a total flight of fancy—something a researcher might make a PhD and career out of. Indeed, OCR and speech synthesis alone were challenge enough. Today, you can stand on the shoulders of giants and include such mighty capability in a homebrewed device that cost less than $50 to assemble. It’s a neat project, too, and one that we’re sure taught [Akhil] many valuable skills along the way.

10 thoughts on ““Glasses” That Transcribe Text To Audio”

shinsukke says:

March 19, 2025 at 1:39 am

Very cool project, and by an eighth grader no less!

I am really out of touch with machine vision libraries etc. In my mind, they are still hard to use and out of reach, and OCR is unobtainable by an individual. Glad to be reminded I’m wrong though.

Report comment

Reply
1. ono says:
  
  March 19, 2025 at 3:39 am
  
  They´re not that scary or hard to get started with. You can even mandate some AI to help you design some (crude, to test and correct) examples how to use computer vision and ocr (python opencv rulez)
  
  Report comment
  
  Reply
Cheese Whiz says:

March 19, 2025 at 3:40 am

Kudos to Akhil, certainly far more impressive than anything I was doing in the eighth grade!

Report comment

Reply
GJ MOBEY says:

March 19, 2025 at 3:40 am

Probably more useful for dyslexics than the blind, but seriously cool.

Report comment

Reply
Mystick says:

March 19, 2025 at 4:15 am

Getting into Lobot territory.. but you keep it up, kid!

Report comment

Reply
Jan says:

March 19, 2025 at 7:37 am

Only thing is if your blind you dont know where to turn your glasses?

Report comment

Reply
1. James Davidson says:
  
  March 19, 2025 at 3:15 pm
  
  Blindness can be a big spectrum. There are plenty of people who have enough vision to be able to tell the shape of a book or a sign, but not enough to be able to clearly make out the letters.
  
  It’s a common misconception that blindness means total darkness.
  
  Report comment
  
  Reply
DC says:

March 19, 2025 at 12:17 pm

Very cool! I could see this expanding from not only reading out text, to also using object recognition so the wearer can understand their surroundings. AI tools could certainly be useful in identifying objects, pathways, etc.
Examples:
– “Closed door 30 degrees to your left, doorknob on righthand side of door”
– “Small object on ground ahead”
-“Turn right 10 degrees to stay on sidewalk/path

And many of the most commonly communicated instructions (such as navigation instructions) could be abbreviated, or even turned into distinct sounds the wearer would learn. You could also play these sounds/communications to the left ear, right ear, or both as a way of communicating if the object is in the left/right/center of the field of view.

Report comment

Reply
Hirudinea says:

March 19, 2025 at 2:17 pm

Maybe interface these with google translate, on the fly translation of foreign texts.

Parlez-vous français ? Non, mais je lis le français.

Report comment

Reply
I Alone Possess The Truth says:

March 19, 2025 at 2:36 pm

This could be good for legally blind folks and if it connected to Google Translate one could read Chinese, Arabic and so forth without having to type the text.

Report comment

Reply

Hackaday

“Glasses” That Transcribe Text To Audio

10 thoughts on ““Glasses” That Transcribe Text To Audio”

Leave a ReplyCancel reply

Search

Never miss a hack

If you missed it

A History Of Pong

Supersonic Flight May Finally Return To US Skies

The Death Of Industrial Design And The Era Of Dull Electronics

Power Grid Stability: From Generators To Reactive Power

Why Apple Dumped 2,700 Computers In A Landfill In 1989

Our Columns

Be More Axolotl: How Humans May One Day Regrow Limbs And Organs

Hackaday Links: July 27, 2025

Personalization, Industrial Design, And Hacked Devices

Hackaday Podcast Episode 330: Hover Turtles, Dull Designs, And K’nex Computers

This Week In Security: Sharepoint, Initramfs, And More

10 thoughts on ““Glasses” That Transcribe Text To Audio”

Leave a ReplyCancel reply

Search

Never miss a hack

Subscribe

If you missed it

Our Columns