Google Operations VP Talks Of "Shards" & Other Search Architecture Details

Author

Date published March 13, 2006 Categories

Industry

An InternetNews.com article named Peeking Into Google has Google’s VP of operations and engineering giving us insight into Google’s architecture. The article covers Google’s hardware, the operating system on that hardware, and the auto-healing technology used on Google’s servers. Also, the article describes how Google stores the data on the machines, and the action Google takes when a query is submitted. As you continue reading through the article, you also learn where the snippet of content comes from on the search results page and that search result page is then stored into memory. The Google VP also discusses the “Google File System”, “Map/Reduce Framework” and “Global Work Queue” and each of its respective responsibilities.

Here are some key points from the article;

Commodity servers for about $1,000 each built into interconnected nodes for complete redundancy.
The operating system runs a “stripped-down Linux kernel” with patches bugs “that haven’t been fixed in the original kernel.”
“Google has automated methods of dealing with machine failures, allowing it to build a fast, highly reliable service with cheap hardware.”
Google splits up Web pages into “shards” and then replicates them to several other servers, these servers are named “chunk servers.”
When the query is submitted by the searcher, the query is “split into chunks of service” where Google uses “one complete set of servers” to answer the query.
The snippets of content used under the search results come from “document servers” which contain one copy of the Web page.
The result page is then stored in memory.
The Google File System is partly responsible for storing “two copies that are not physically adjacent — not on same power strip or same switch,” of “chunks”.
Client machines are used for “fault tolerance,” if one fails the “chunks” should move to a different client machine.
This is all managed with Google’s “Map/Reduce Framework” which was designed in 2004.
Google’s Global Work Queue batches queries on machines to run “random computations over tons of data.”

There is a forum thread on this article at Cre8asite Forums.

More about:

Resources

Analytics The 2023 B2B Superpowers Index

The Merkle B2B 2023 Superpowers Index outlines what drives competitive advantage within the business culture and subcultures that are critical to success. It is the indispensable guide for B2B marketers to deliver world-class experiences and keep pace with the dynamic environment. Download Now
Analytics Data Analytics in Marketing

The ClicData survey found that various challenges exist that prevent organizations from achieving such gains. These challenges included inaccessible data formats and limited flexibility in displaying data in dashboards. Download Now
Digital Marketing The Third-Party Data Deprecation Playbook

The need for fraud prevention in the digital world is critical now more than ever. Why? Thinking about your own behavior, consider how you complete transactions and how this has changed over the last 5 years. Download Now
Digital Marketing Utilizing Email To Stop Fraud-eCommerce Client Fraud Case Study

The need for fraud prevention in the digital world is critical now more than ever. Why? Thinking about your own behavior, consider how you complete transactions and how this has changed over the last 5 years. Download Now

Industry

SEO

PPC

Analytics

Social

Local

Mobile

Video

Content

Development

Information

Follow us

Google Operations VP Talks Of "Shards" & Other Search Architecture Details

Resources

Analytics The 2023 B2B Superpowers Index

Analytics Data Analytics in Marketing

Digital Marketing The Third-Party Data Deprecation Playbook

Digital Marketing Utilizing Email To Stop Fraud-eCommerce Client Fraud Case Study

Resources

The 2023 B2B Superpowers Index

Data Analytics in Marketing

The Third-Party Data Deprecation Playbook

Utilizing Email To Stop Fraud-eCommerce Client Fraud Case Study

Related Articles

Analysis and advice: Why recipe sites saw huge fluctuations in visibility

Search trends 2018: what can marketers learn?

SEW Interview: Clark Boyd on visual search

Where we’re going, we won’t need websites

Google's Alphabet: A Welcome Move in Asia

Google Now Adds Quote Cards That Lack Attribution

Gmail, Google+, and Hangouts Suffer Outage Across Europe

Bing Ads Improves Speed, Bidding, and Targeting

Follow us

Google Operations VP Talks Of "Shards" & Other Search Architecture Details

Get the Latestdaily news and insights about search engine marketing, SEO and paid search.

Resources

Resources

The 2023 B2B Superpowers Index

Data Analytics in Marketing

The Third-Party Data Deprecation Playbook

Utilizing Email To Stop Fraud-eCommerce Client Fraud Case Study

Related Articles

Analysis and advice: Why recipe sites saw huge fluctuations in visibility

Search trends 2018: what can marketers learn?

SEW Interview: Clark Boyd on visual search

Where we’re going, we won’t need websites

Google's Alphabet: A Welcome Move in Asia

Google Now Adds Quote Cards That Lack Attribution

Gmail, Google+, and Hangouts Suffer Outage Across Europe

Bing Ads Improves Speed, Bidding, and Targeting

Get the Latest
daily news and insights about search engine marketing, SEO and paid search.