IndustrySearch Memories

Search Memories

Veterans of AltaVista, Excite and Infoseek reminisce about the early days of web search, and opine about Google, then and now.

Veterans of AltaVista, Excite and Infoseek reminisce about the early days of web search, and opine about Google, then and now.

A special report from the Search Engine Strategies conference, August 2-5, 2004, San Jose, CA.

In January 1962, Dick Rowe earned his place in popular music history, by becoming “the guy who turned down the Beatles.”

As head of Decca records at the time, he had final say on whether they should sign up the Liverpool based combo, or London based outfit Brian Poole and the Tremeloes. He chose the latter and the rest, as they say… is history!

That story came flashing back to me as I sat in the audience for the Search Memories session at SES San Jose in August. Doug Cutting, Primary Developer, Nutch (formerly Senior Architect, Excite), Steve Kirsch, founder, Propel Software (formerly founder, Infoseek) and Louis Monier, head of R&D, eBay (formerly chief technical officer, AltaVista) together, reminisced about the early years of search engines.

And one by one, they each openly admitted that they had been approached by two students with a new idea for a search engine. “Go pound sand I told them” said Steve Kirsh. “I wasn’t impressed with their demo at all” said Doug Cutting. “I didn’t have the authority to sign a check anyway” said Louis Monier.

The two students which they all turned away, of course, were Larry Page and Sergey Brin. Their new idea for search was called PageRank, and would eventually become the power behind Google.

It was good to see that they could all share a laugh about it. Although I’m certain that, when they saw the expected valuation of Google for the impending IPO (as it was at the time of the conference), a little weeping and gnashing of teeth may have occurred behind closed doors.

This was a most revealing session for me. Not about the inner workings of early search engines, more about the similarities in the stories they each had to tell.

Doug Cutting explained that they were thrilled in 1995 to boast about having access to an index of a whopping 1.2 million pages. Eventually this would extend to 2 million pages. But only to find themselves overshadowed by the launch of the mighty AltaVista, hitting the ground running with super power and an index of 16 million pages. As moderator Chris Sherman commented, this must have been the start of the “size wars” at search engines.

And so it must have been, with Doug Cutting then receiving instructions to get Excite up to 50 million pages! For a while, Excite was the biggest thing in search (if size really does matter). Only to be eclipsed once again by the mighty AltaVista.

Yes, the mighty AltaVista with the huge power of Digital behind them…

I have to tell you, there was nobody in the room more surprised than me to discover that “the mighty AltaVista” at the time consisted of Louis Monier and about a half dozen other people that he had access to – and that was it. In fact, there was so little support, it was his wife who designed the company logo. The company was “just the bare bones”.

However, judging by the press around the launch of AltaVista it seemed as if there must have been hundreds of people involved. “AltaVista was lost inside a dying hardware company that was in complete free fall” Louis explained. Digital research labs were recovering from what he called “a couple of black eyes” and had just been sued twice losing $100 million!

But none of this prevented the press guys trying to squeeze the maximum PR possible out of the launch of AltaVista. Louis lamented the fact that the press guys at Digital at the time, had such little knowledge of what AltaVista was all about. “You enter your query and within 24 hours you get a response they wrote!”

So what were the innovations in search during those early days?

We had enhanced relevance with a basic Google approach at Infoseek said Steve Kirsch. It was kind of like a poor man’s Google back then. We’d look at the number of web pages that had hyperlinks containing the query term and try and rank the page with the most links at the top. And Infoseek was the first to have phrase searching.

Meanwhile at AltaVista Louis was in perplexed state once again as the strategy shifted from search to portal. “The attempt to turn AltaVista into a portal was just a freakin’ disaster” but speed and universal access were top of his own list of priorities for search. “I once spent a week trying to iron out a bug in Lynx 0.9 even though there were only about ten thousand people using it.”

Doug Cutting was proud of a “very cool” feature which Excite added. “It was like an automatic thesaurus. But it was really a technique from statistics called discrimination analysis. Where we found words that weren’t in your query, but could help to disambiguate it, so we’d put three or four of these words at the top of the results page to help.”

And what was it, did they think, that Google brought to the table?

Steve Kirsh seemed to think that, beyond technology, building the Google brand was exceptionally important. But also added that, when he’d put this to Google cofounder Larry Page some years earlier, he’d dismissed brand and put it down to operations. Just as Coke has a brand, Larry Page said that their success was due more to their operation i.e. you can get Coke anywhere in the world. And the same applies to Google, which manages to copy a multi-terabyte index around the world on a regular basis. Also noting that in the early days of Infoseek, they had trouble just trying to copy 50 gigabytes and get it over to AOL!

Doug believed that they had all missed “the big trick” with Google and that was creating the link graph and using it for ranking purposes. But he also pointed out that since Google entered the space with a clean, uncluttered white interface and a link graph, not a lot had changed since then.

And whereas AltaVista had been founded upon huge powerful machines, Louis Monier commented on the more flexible and low cost scaling method which Google uses in its hardware set up. “They’ve built something that scales like you wouldn’t believe and it’s really, really cheap.”

AltaVista had also built a link table, but it was more for estimating the size of the web. “We could tell from new links discovered just how fast and big the web was growing.”

Steve Kirsch couldn’t resist telling the audience that they knew about this at Infoseek so they had placed a web page with the words “This is the last page on the web – there’s no more after this” so the AltaVista crawler would find it and just stop there!

Finally Chris Sherman rounded up the session by asking the panel to take a look into their crystal balls and talk about the future of search.

Doug took the opportunity to talk about his new project Nutch, an open source search engine similar to Google in its operation. “It’s good for niche web search and also for searching intranets and university campus when you have ten servers and you want to index them all. But we’ve demonstrated it on hundreds of millions of web pages and we think it will go as far as billions.”

“Search is not sticky,” said Louis. “That’s the direction Google is going right now though. Look at how quickly they just turned Gmail on. That’s the next thing for the future. Getting people to stick with you… Until two guys working in a garage come out with something new and…”

The final word went to Steve Kirsch. “I’m not qualified to predict the future really. But one thing I could certainly predict is that, the next time some Stanford student comes to me with a new idea… I may not be so quick to tell him to go pound sand again!

Want to discuss or comment on this story? Join the Infoseek to Google Founders: Go Pound Sand discussion in the Search Engine Watch forums.

Mike Grehan is CEO of Smart Interactive and author of Search Engine Marketing: The essential best practice guide.

Search Headlines

NOTE: Article links often change. In case of a bad link, use the publication’s search facility, which most have, and search for the headline.

Bush & Kerry Battle It Out In Search
Search Engine Round Table Oct 20 2004 11:32AM GMT
Open Season: News Sites Add Outside Links, Free Content
Online Journalism Review Oct 20 2004 11:24AM GMT
In search of TV via the Web It’s only a matter of time before tomorrow’s Internet is about searching effortles
CBS MarketWatch Oct 20 2004 9:42AM GMT
Google translation status report of main site
Addict3d.org Oct 20 2004 7:34AM GMT
Overture, Google Reach Out to Agencies
ClickZ Today Oct 20 2004 4:17AM GMT
Reviewing 10 Affiliate Networks
iMedia Connection Oct 20 2004 2:11AM GMT
Speculation on What’s Next For Google Desktop
Searchblog Oct 19 2004 11:00PM GMT
The International Type Index
ResearchBuzz Oct 19 2004 10:52PM GMT
ICANN Whois Policies Update
CircleID Oct 19 2004 6:53PM GMT
The Search Mailbag: Climbing the Ranks
Media Post Oct 19 2004 6:48PM GMT
Championing a Wiki World
Business Week Oct 19 2004 6:38PM GMT
Google Desktop Search Doesn’t Threaten Security
eWeek Oct 19 2004 6:09PM GMT
Google Deposes Overture on AOL Europe
ClickZ Today Oct 19 2004 4:34PM GMT
Viapoint Offers New Approach To Desktop Search
Desktop Pipeline Oct 19 2004 2:56PM GMT
Wotbox Undercuts The Competition On Paid Inclusion
InternetWeek Oct 19 2004 2:19PM GMT

Resources

The 2023 B2B Superpowers Index
whitepaper | Analytics

The 2023 B2B Superpowers Index

8m
Data Analytics in Marketing
whitepaper | Analytics

Data Analytics in Marketing

10m
The Third-Party Data Deprecation Playbook
whitepaper | Digital Marketing

The Third-Party Data Deprecation Playbook

1y
Utilizing Email To Stop Fraud-eCommerce Client Fraud Case Study
whitepaper | Digital Marketing

Utilizing Email To Stop Fraud-eCommerce Client Fraud Case Study

1y