Make Your Bookshelf Clickable

February 15, 2024

We’ll confess that we have a fondness for real books and plenty of them. So does [James], and he decided he needed a way to take a picture of his bookshelves and make each book clickable to find more information. This is one of those things that sounds fairly simple until you decide to do it. You can try an example of the results and then go back and read about the journey it took to get there.

There are several subtasks involved. First, you want to identify each book’s envelope. It wouldn’t do to click on the Joy of Cooking and get information about Remembrance of Things Past.

The next challenge is reading the title of the book. This can be tricky. Fonts differ. The book could be upside down. Some titles go cross the spine, but most go vertically. The remainder of the task is fairly easy. If you know the region and the title, you can easily find a link (for Google Books, in this case) and build an SVG overlay that maps the areas for each book to the right link.

The optical character recognition is done with GPT-4. The prompt used is straightforward:

Read the text on the book spine. Only say the book cover title and author if you can find them. Say the book that is most promiment. Return the format [title] [author], with no punctuation.

With that information, a Google API will look up the book for you, and the rest is straightforward. You can grab the code on GitHub. We wonder how this method of OCR for difficult text would compare to more conventional methods. After all, OCR isn’t a hard problem. The complex problem is making it work well.

21 thoughts on “Make Your Bookshelf Clickable”

CJay says:

February 15, 2024 at 4:32 am

“After all OCR isn’t a hard problem”

Oh sweet summer child…

OCR isn’t hard *now* because we have incredible amounts of computing power and hi-res image capture but by god it was difficult when PCs still had RAM measured in MB and on 72 or even 30 pin modules.

What was *incredibly* impressive was the OCR and handwriting recognition technology employed by an organisation I occasionally visited back in the early 2000s

Reply
1. CJay says:
  
  February 15, 2024 at 4:37 am
  
  (I am of course well aware that Al knows already and is as old, if not older than I am)
  
  Reply
  1. Ostracus says:
    
    February 15, 2024 at 5:01 am
    
    The man dated carbon. As for the article that’s why I keep a bar code scanner around.
    
    Reply
2. Ostracus says:
  
  February 15, 2024 at 4:52 am
  
  ABBY Finereader, expensive but capable.
  
  Reply
  1. RunnerPack says:
    
    February 15, 2024 at 5:05 am
    
    Tesseract, free but workable.
    
    Reply
3. Lord Nothing says:
  
  February 15, 2024 at 5:22 am
  
  im really impressed by how you can take microwave scans of ancient scrolls while still rolled up (because unrolling them would destroy them), and still manage to scan in every character in ancient disused dialects and dead languages and turn it into actual text, and then translate it into english.
  
  in all of my tech life i never had to manually copy a single page of text. though fixing scan errors is a different matter. im old enough to remember that.
  
  Reply
  1. CJay says:
    
    February 15, 2024 at 5:25 am
    
    I’ve spent many hours over my career and personal life correcting OCR documents, it’s often a blessing but, like the output from the various “AI”, you really need to proof read it and understand what it’s saying.
    
    Reply
4. J_B says:
  
  February 17, 2024 at 11:32 pm
  
  Depends on the circumstances.
  If you have context, it is much easier than random text.
  That’s why snailmail in the late 60’s could be processed with OCR pretty well and extremely fast, even with limited computational resources.
  
  Reply
Wes says:

February 15, 2024 at 6:55 am

James P. Hogan predicted the scanning of books you couldn’t open in his 1977 Sci-Fi novel Inherit the Stars.
https://www.baen.com/inherit-the-stars.html

Reply
1. Gravis says:
  
  February 15, 2024 at 7:11 am
  
  Scanning of books you couldn’t open? How is it that you can’t open a book?
  
  Reply
  1. Shannon says:
    
    February 15, 2024 at 8:51 am
    
    There was a ‘hack chat’ about some books that couldn’t be opened just a few days ago.
    https://hackaday.com/2024/01/22/x-ray-investigations-hack-chat/
    
    TL;DR: some scrolls were entombed by the 79AD eruption of Vesuvius, they were recovered 250 years ago and have been waiting for the technology that might read them.
    
    Reply
  2. vancouverizer says:
    
    February 16, 2024 at 8:32 am
    
    In the story it is for books that are old and would be damaged of opened. The scifi machine suggests an MRI-like device with scanner and ocr, similar to how archeologists are doing it today, 37 years later.
    
    Quote: The image on the Trimagniscope tube was an enlarged view of one of the pocket-size books found on the body, which Dancheckker had shown them on their first day in Houston three weeks before. The book itself was enclosed in the scanner module of the machine, on the far side of the room. The scope was adjusted to generate a view that followed the change in density along the boundary layer of the selected page, producing an image of the lower section of the book only; it was as if the upper part had been removed, like a cut deck of cards. Because of the age and condition of the book, however, the characters on the page thus exposed tended to be of poor quality and in some cases were incomplete. The next step would be to scan the image optically with TV cameras and feed the encoded pictures in the Navcomms computer complex. The raw input would then be processed by pattern recognition techniques and statistical techniques to produce a second, enhanced copy with many of the missing character fragments restored.
    
    Reply
  3. Dr Hohn says:
    
    February 22, 2024 at 3:36 pm
    
    Brewing texts from the early 1800’s would crumble when opened (acid paper). In the 90’s I was scanning OCR/PDF to do research – only way to study the content.
    
    Reply
R Jay Rishel says:

February 15, 2024 at 8:02 am

next match this up with a projector and have it highlight the physical book when you click on it.

after that, have it tell you where to move it on the shelf to get the books sorted by genre and alphabetized by author name, using the projector to highlight where you need to move the book to.

Reply
Kaiser says:

February 15, 2024 at 8:25 am

Neat. I will have to look deeper into this.

I recently had a similar idea. But more to find specific books at second hand markets which sometimes are wildely unsorted. So with a few modifications, from what i have seen, it should work. have a list of books you want to buy. take a picture of an unsorted bookshelf. highlight positions of identified books :D

Reply
Nik says:

February 15, 2024 at 3:16 pm

I wil I can take a picture of the DVD movie shelf at Goodwill store and find what I need.
Let’s say I make a list of DVD’s what I am looking for and snapping the picture of the shelf will notify me about the DVD on the shelf to reduce time browsing all titles.

Or maybe a particular screw in my junk drawer.

Reply
1. Always in silicon says:
  
  February 15, 2024 at 6:17 pm
  
  There should not be a need to pre-make a list of titles you’re looking for.
  AI should be able to sort through all the available DVDs on the shelf, categorize them and then match them to your genre/actor/director known likes and probable likes (and of course then highlight where they are on the shelf).
  
  Reply
2. alanrcam says:
  
  February 16, 2024 at 4:22 pm
  
  Take a photo of a bunch of jigsaw puzzle pieces. Let the AI number them, and show which pieces fit together.
  You might want to limit the number of pieces per photo, to make it easier for the AI and yourself.
  
  Reply
FluffyGhostKitten says:

February 15, 2024 at 4:04 pm

There used to be an app for this. But they got bought and shuttered by Rakuten.

Reply
1. KingFishR says:
  
  February 18, 2024 at 12:27 am
  
  What was the name of this App?
  
  Reply
Diemer says:

February 17, 2024 at 7:07 pm

This is something I’ve wanted for at 15 years, but it was a lot harder to solve that problem back then. Hurrah!!!!

Reply

Hackaday

Make Your Bookshelf Clickable

21 thoughts on “Make Your Bookshelf Clickable”

Leave a Reply to J_BCancel reply

Search

Never miss a hack

If you missed it

Forced E-Waste PCs And The Case Of Windows 11’s Trusted Platform

Remotely Interesting: Stream Gages

Hands-On: EufyMake E1 UV Printer

A Brief History Of Fuel Cells

Big Chemistry: Fuel Ethanol

Our Columns

Pulling Back The Veil, Practically

Hackaday Podcast Episode 323: Impossible CRT Surgery, Fuel Cells, Stream Gages, And A Love Letter To Microcontrollers

This Week In Security: CIA Star Wars, Git* Prompt Injection And More

Researchers Are Slowly Finding Ways To Stem The Tide Of PFAS Contamination

FLOSS Weekly Episode 834: It Was Cool In 2006

21 thoughts on “Make Your Bookshelf Clickable”

Leave a Reply to J_BCancel reply

Search

Never miss a hack

Subscribe

If you missed it

Our Columns