Google Debuts 200 Year News Archive Search
"The goal of this service is to allow people to search and explore how history unfolded," said Anurag Acharya, Google distinguished engineer, who played a major role in shepherding the new product.
Google has partnered with news organizations including Time, The Wall Street Journal, The New York Times, the Guardian and the Washington Post, and aggregators including Factiva, LexisNexis, Thomson Gale and HighBeam Research, to index the full-text of content going back 200 years.
Archived news results can be found in three ways. You can search the news archives directly through a new News Archive Search page. News archive results are also returned when you search on Google News or do a general Google web search and your query has relevant historical news results.
Both free and fee-based content is included in Archive Search, with content from both publishers and aggregators. Search results available for a fee are labeled "pay-per-view" or with a specific price indicated. Google does not host this content; clicking on a link for fee-based content takes you to the content owner or aggregator's web site where you must complete the transaction before gaining access to the content.
Search results look similar to those produced by a search on Google news, with a few additional time-related features.
"Much like news, we are grouping related articles together from a given time period," said Acharya. "The ranking here, as you may expect from a Google service, is based entirely on relevance," with no precedence given to fee-based vs. free content. The mix of fee vs. free links will also vary depending on your query.
On the left side of search results are links to drill down into content from specific time periods. A blue arrow icon points to a "period of particular interest," when an event occurred or "something special happened," said Acharya.
One of the most interesting features of the new service is how it automatically creates a timeline that shows how an event or topic played out over time. Clicking the "timeline" link reorders results in chronological order; you can then drill down to get content from specific dates simply by browsing. There's also an option to limit search results to a single day via the advanced search page, according to Acharya.
This is a fantastic feature for people interested in seeing how a particular historic event played out over time. But it's also useful for simply keeping up with the progress of contemporary events. "We usually see history as a view of the past many years later," said Acharya. "Now we can enable you to search for anything and everything as it unfolds."
The service is rolling out with a U.S. English interface, but there's already a lot of non-English content available. "Our coverage is the deepest in English, but our plan is to expand into other languages fairly soon," said Acharya.
Google has no plans to become a content aggregator itself, or to even offer a streamlined payment system where you can use your Google account to pay for content, according to Google content partnerships director Jim Gerber. "At this point we are focusing on trying to make the content easily searchable and navigable," he said.
Are Google's partners worried about potential future competition? "The response from our partners has been overwhelmingly positive," said Gerber, because Google is currently only providing a link to partner sites where users log in and pay. "They see this as a great source of free, very targeted traffic." Content owners and aggregators not currently in the Google News Archives program can contact Google and request to be included, Acharya added.
As ZDNet blogger Garett Rogers, former SEW news editor Gary Price have pointedly noted, much of the fee-based content in Google Archive Search is available at no charge via many public libraries who subscribe to fee-based services and provide free access to patrons.
Google itself does something similar to this by permitting university users to access fee-based content licensed by the university in Google Scholar results. But for now, Google has no plans to build gateways to content through public libraries.
"Today users can't find this information on Google so we're just making sure we get it into the index," said Gerber.
Don't want to pay a fee for archived news? Check out Topix.net's one-year archive of news that Danny reviewed last week.
Search Headlines
NOTE: Article links often change. In case of a bad link, use the publication's search facility, which most have, and search for the headline.From The SEW Blog...
- Google Updates Terminology Of Last Visit Date In Cache Results
- New Look YellowPages.ca Comes Out Of Beta
- Netscape Search Inserts Netscape News Above Web Results
- Google Opens Tesseract OCR Software
- Speakers Wanted For SES Multimedia & Mobile Edition 2006
- Bringo Click To Call Service Attempts To Help Consumers Foil Voice Response Systems
- Google To Fingerprint Voices With PC Microphones
- Yahoo Answers Launches In The UK
- YouTube Hires Yahoo's Treasurer, Gideon Yu
- Google Says They Will Give Brazil Orkut Data
- New Engine 'ChaCha' Offers Real-Time Answers From Live 'Guides'
Headlines & News From Elsewhere
- Battle brews over Flickr deletions, News.com
- Another Tag Search Engine, ResearchBuzz
- Bloglines focus group..., Jason Calacanis
- ChaCha, Yahoo Answers In The UK & Searching With Humans; Foiling The Phone Tree, New Netscape Search & More!, Daily SearchCast
- Nielsen: Web Ad Spend Outpaces All Other Media, ClickZ
- Accipiter Buys BidClix, ClickZ
- The Sullivan Show, DMNews.com
- ChaCha’s Lesser Known Bookish Cousins, Greg Sterling
- With Google's Formal Entry, Pay-Per-Call Set to Grow, ClickZ
- Jatalla, Phil Bradley
- How You, Too, Can Use YouTube, ClickZ
- Google to tap Indian talent pool, Rediff
- Very Early Look at Synthasite’s Ajax Website Builder, TechCrunch
- Google's Adam Lasnik & Optimizing For Google Classes - $30, V7N
- Whupped by Microsoft, Corel takes on Google, Globe & Mail
- Who Edits Wikipedia?, Google Blogoscoped
- Peggy Li on Jewelry and Pimp My Site, Chris Pirillo
- Yahoo! engine sparks Browzar backlash, Silicon.com
- Is Browzar Just An Adware Machine?, TechCrunch
- Google Maps Package Tracking, InsideGoogle
- 10 Dumb Approaches To Search Marketing, V7N
- Bryan and Jeffrey Eisenberg on Waiting for your Cat to Bark, Chris Pirillo
- New Search at Netscape, Jason Calacanis
- Google Flags Sites That Add Too Many New URLs, Search Engine Roundtable
- Microsoft adCenter Allows 100,000 Keywords Per Account, Search Engine Roundtable
- Google Archive Search, Googling Google
- Google CEO declines Apple options grant, Mac NN
- AOL Research has been shut down, Greg Linden

Newsletter signup
Biography
Chris Sherman
Chris Sherman is a frequent contributor to several information industry journals. He's written several books, including The McGraw-Hill CD ROM Handbook and The Invisible Web: Uncovering Information Sources Search Engines Can't See, co-authored with Gary Price. Chris has written about search and search engines since 1994, when he developed online searching tutorials for several clients. From 1998 to 2001, he was About.com's Web Search Guide.
Article Archives by Chris Sherman
Farewell, SearchDay! - Dec 29, 2006
Search Engine Forums Spotlight - Dec 22, 2006
Highlights from the SEW Blog: Dec. 18, 2006 - Dec 18, 2006
Search Engine Forums Spotlight - Dec 15, 2006
More article archives











