How Search Engines Enabled Finding Needles In A WWW-Sized Haystack

When the World Wide Web surged into existence during the 1990s, we were introduced to the problem of how to actually find something in this ever-ballooning construction zone that easily outpaced even the fastest post-WW2 urban sprawl. Although domain names provided a way to find servers using DNS rather than having to mash in IP addresses, you still somehow had to know the relevant URL.

A range of solutions were thought up over time, ranging from printed Yellow Pages type guides, to online curated lists of resources, as well as things like web rings where one website would link to a relevant similar website. This was the time when word-of-mouth was also very relevant, with people proudly announcing their own website on Geocities or other hosting service.

Search engines already existed long before the WWW became the hot new thing during the 1990s, but it was the WWW that would really push them to their limits. As anyone who used search engines for the WWW can attest, they had many issues. Often you’d end up using multiple search engines to find something, and despite fierce competition between web search engines to become the starting page for their browser, actually finding things on the WWW remained a tough problem.

Since a web search engine ‘just’ has to index the WWW and match a search query against the results, why was this such a hard problem that persisted until Google apparently cracked the code?

Continue reading “How Search Engines Enabled Finding Needles In A WWW-Sized Haystack”

Pre-Web Hardware Connects To The Web

We’re not quite to the 25th anniversary of the world wide web, but that doesn’t mean the greatest innovation in information distribution since [Gutenberg]’s press can’t be celebrated a bit early, does it? [Suhayl] is throwing some of his hardware into the ring, and loading up the first web page with a modem from the mid 1960s and a teletype from the mid 70s. No, no sane person would have ever done this 25 years ago, but it’s neat to watch in any event.

The hardware [Suhayl] is using includes a Livermore Data Systems modem. It’s a finely crafted wooden box with an acoustic coupler on top, and a DB-25 connector on a side that connects to a terminal or computer via RS-232. If that Livermore Data Systems acoustic coupler modem looks familiar, you might be right. This modem was demoed back in 2009 by [phreakmonkey]. It’s an impressive little box that can connect to a remote system at up to 300 baud.

The I/O is handled by an ASR-33 teletype. This was the standard way to connect to computers and mainframes before we were all blessed with video terminals and TV typewriters. The whole setup connects to a Unix system with a much more familiar Hayes modem, runs a text-only browser, and retrieves the first web page as it was served up at CERN some 25 years ago.