IndustryNew Search Patent Applications: June 19, 2006 – Autolinking, and Better Advertising through Deletion Predictions

New Search Patent Applications: June 19, 2006 - Autolinking, and Better Advertising through Deletion Predictions

Four patent applications from Google describe fighting spam in emails, providing product review searches, moving large amounts of data, and autolinking. Yahoo matches, and raises with five patent filings. One on watching deletions to choose better ads, another on serving dynamic information through a additional browser interface, and three more on multimedia and RSS.

Microsoft goes TV 2.0 with an electronic program guide, and describes a way of matching advertising content with certain search queries before those searches are made. IBM comes up with a unique way of presenting the results of a search from more than one search engine, and a way of reducing the amount of irrelevant results in a search by analyzing an initial set of results, identifying an appropriate additional query term from those results, and searching the original results again but with the additional query term included in the search.

Go Daddy describes a way of fighting spam in emails. Xerox employs collaborative filtering from previous users’ searches to predict search results. Apostolos Gerasoulis, from Ask.com, with a couple of co-inventors, ranks and displays pages (objects) based upon linkage and textual data, and then defines a way to identifiy and assign topics to them.

Google

Email Spam

Emails with links in them could be considered spam if the links point to pages that are in a conceptual category considered spammy. This patent application really doesn’t describe the concept categorization part of the process. That’s done in a related patent application mentioned within this document, and the related document lists Georges Harik as one inventor. Dr. Harik’s name is on a very large percentage of the patent applications involving Gmail-type processes.

Method and system to detect e-mail spam using concept categorization of linked content
Invented by Johnny Chen
US Patent Application 20060122957
Published June 8, 2006
Filed December 3, 2004

Abstract

A system and method for detecting undesired electronic messages (e.g., spam) using concept categorization of hyperlinks is disclosed. A server receives an electronic message and retrieves web pages that correspond to hyperlinks in the message. The server performs concept categorization on the retrieved web pages based on semantic relationships in the received information to determine whether the electronic message meets predefined criteria associated with undesired messages.

Searching and Aggregating Product Reviews

If Google wanted to get into the product or services review business, the next patent filing describes a blue print for the process that might make an effective and innovative system.

Method and system for finding and aggregating reviews for a product
Invented by Jan Matthias Ruhl and Mayur D. Datar
US Patent Application 20060129446
Published June 15, 2006
Filed December 14, 2004

Abstract

The embodiments disclosed herein include new, more efficient ways to collect product reviews from the Internet, aggregate reviews for the same product, and provide an aggregated review to end users in a searchable format. One aspect of the invention is a graphical user interface on a computer that includes a plurality of portions of reviews for a product and a search input area for entering search terms to search for reviews of the product that contain the search terms.

Scaling and Distributing Data

Arvind Jain is the head of Research and Development in Google’s Bangalore office, and has spoken at a number of conferences on infrastructure projects and issues involving such things as Google’s crawl and indexing system, distributed file replication system, and compression techniques for large scale storage systems. He’s listed as the inventor for this next Google filing.

System and method for scalable data distribution
Invented by Arvind Jain
US Patent Application 20060126201
Published June 15, 2006
Filed December 10, 2004

Abstract

A system having a resource manager, a plurality of masters, and a plurality of slaves, interconnected by a communications network. To distribute data, a master determined that a destination slave of the plurality slaves requires data. The master then generates a list of slaves from which to transfer the data to the destination slave. The master transmits the list to the resource manager. The resource manager is configured to select a source slave from the list based on available system resources. Once a source is selected by the resource manager, the master receives an instruction from the resource manager to initiate a transfer of the data from the source slave to the destination slave. The master then transmits an instruction to commence the transfer.

Autolinking

Google’s Autolink raised a lot of eyebrows, and brought some negative reactions. A Search Engine Watch Blog post from Danny Sullivan, Google Toolbar’s AutoLink & The Need For Opt-Out defined many of the issues around the toolbar feature. The following patent application explains how such a system might work from the search engine’s perspective.

Providing useful information associated with an item in a document
Invented by Gueorgui Djabarov
US Patent Application 20060129910
Published June 15, 2006
Filed December 14, 2004

Abstract

A method includes recognizing an item within a first document based on a pattern associated with the item but not the exact content of the item. The method further includes identifying a link for the item and providing a second document that includes information associated with the item when the link for the item is selected.

Yahoo

Choosing Better Ads through User Behavior

Some queries involve the use of concepts and units, as described in at least five Yahoo patent filings (see previous patent posts in the Yahoo sections from Yahoo Units and Microsoft Redundancy Filters and More Yahoo Concepts and Google Predictive Searches.)

But sometimes a two term query isn’t a concept as much as it is a couple of keywords that someone may use to search for something. If that person performs a second search after deleting one of the words, then the record of that deletion and second search might help Yahoo calculate “deletion probability scores” for words being used in these kind of two term queries.

This can be helpful when there isn’t a good keyword based advertising match for that query, but there might be a good match individually for each of the terms that make up the query. The “deletion probability scores” can help determine which of the two terms to show keyword-based advertising for in search results.

System and methods for ranking the relative value of terms in a multi-term search query using deletion prediction
Invented by Rosemary Jones and Daniel C. Fain
US Patent Application 20060129534
Published June 15, 2006
Filed December 14, 2004

Abstract

The likely relevance of each term of a search-engine query of two or more terms is determined by their deletion probability scores. If the deletion probability scores are significantly different, the deletion probability score can be used to return targeted ads related to the more relevant term or terms along with the search results. Deletion probability scores are determined by first gathering historical records of search queries of two or more terms in which a subsequent query was submitted by the same user after one or more of the terms had been deleted. The deletion probability score for a particular term of a search query is calculated as the ratio of the number of times that particular term was itself deleted prior to a subsequent search by the same user divided by the number of times there were subsequent search queries by the same user in which any term or terms including that given term was deleted by the same user prior to the subsequent search. Terms are not limited to individual alphabetic words.

Browser Interface Helpers

This next document describes some ways to provide additional dynamic information to someone via a toolbar styled interface, while they are browsing pages on the web.

Method of controlling an Internet browser interface and a controllable browser interface
Invented by Thomas J. Shafron
Assigned to Yahoo
US Patent Application 20060129937
Published June 15, 2006
Filed February 2, 2006

Abstract

The present invention is directed to a method of dynamically controlling and displaying an Internet browser interface, and to a dynamically controllable Internet browser interface. In accordance with the present invention, a browser interface may be customized using a controlling software program that may be provided by an Internet content provider, an ISP, or that may reside on an Internet user’s computer. The controlling software program enables the Internet user, the content provider, or the ISP to customize and control the information and/or functionality of a user’s browser and browser interface.

RSS Enhancements

The following three Yahoo filings all list the same inventors, including John Thrall who is the head of media search engineering, for Yahoo Search. They provide different aspects of using RSS with multimedia files.

Syndicating multiple media objects with RSS
Invented by Andrew R. Volk, David D. Hall, and John J. Thrall
US Patent Application 20060129917
Published June 15, 2006
Filed December 1, 2005

Abstract

System and method for syndicating more than one media object in an element using Real Simple Syndication (RSS). In one embodiment, multiple media objects with at least one shared characteristic are syndicated under the same element. For example, a single media object can come in multiple formats and/or compression rates.

Syndicating multimedia information with RSS
Invented by Andrew R. Volk, David D. Hall, John J. Thrall
US Patent Application 20060129907
Published June 15, 2006
Filed December 1, 2005

Abstract

System and method for adding descriptive information to a Real Simple Syndication (RSS) document. The descriptive information describes the content of media objects syndicated through the document. The descriptive information can be used to provided additional information to a subscriber, and can be used in searching for syndicated media content.

RSS rendering via a media player
Invented by Andrew R. Volk, David D. Hall, John J. Thrall
US Patent Application 20060129916
Published June 15, 2006
Filed December 1, 2005

Abstract

System and method for syndicating media objects through a link to a media player using Real Simple Syndication (RSS). A content provider may not want to give direct access to a media object to a subscriber. Instead a content provider can give the subscriber a link to a media player that can access the media object.

Microsoft

Searching electronic program guide data
Invented by Pradhan S. Rao, David Hendler Sloo, Daniel Danker, and George K. Nyako
Assigned to Microsoft
US Patent Application 20060130098
Published June 15, 2006
Filed December 15, 2004

Abstract

Searching electronic program guide (EPG) data is described. The EPG data may be compartmentalized into channel metadata that describes characteristics of one or more channels and content metadata that describes characteristics of one or more content items. In a implementation, a method includes searching channel metadata and content metadata. A result of the searching is formed for output in conjunction with an electronic program guide (EPG).

System and method for indexing and prefiltering
Invented by Brian Burdick, Joshua J. Forman, Kevin P. Kornelson, Murali Vajjiravel, and Rajeev Prasad
Assigned to Microsoft
US Patent Application 20060129555
Published June 15, 2006
Filed December 9, 2004

Abstract

A method and system are provided for selecting advertisements for presentation to a user in response to a user search query. The system may include a keyword server for parsing the user search query and an index server for receiving the parsed search query. The index server may include an index of advertising phrases and pre-filtering components for comparing index entries to the parsed user search query in order to discard non-matching index entries and locate matching entries. The pre-filtering components may include either a phrase length pre-filtering component or a word hash pre-filtering component. The system may additionally include a listing server for sorting through the matching entries located by the index server and further filtering the matching entries for retrieval and presentation to the user.

IBM

Ring method, apparatus, and computer program product for managing federated search results in a heterogeneous environment
Invented by Wade Shelby Beavers and David Joseph Borrillo
Assigned to IBM
US Patent Application 20060129530
Published June 15, 2006
Filed December 9, 2004

Abstract

A method, apparatus and computer program product are provided for managing federated search results in a heterogeneous environment. A user enters a search term and the search term is submitted to multiple selected search engines. Search results are gathered from each selected search engine. A search ring is generated including a ring section to represent each of the selected search engines for enabling the user to view search results from one or more of the selected search engines.

Method and system for suggesting search engine keywords
Invented by Cary Lee Bates
Assigned to IBM
US Patent Application 20060129531
Published June 15, 2006
Filed December 9, 2004

Abstract

A search engine receives a search query having one or more keywords. The documents in the result set from that search query are analyzed to identify one or more additional keywords that further segment, or separate, the initial result set. These additional keywords are presented to the user who then selects whether to include or exclude documents matching the additional keywords. In this way, the number of documents in the initial result set is reduced in a relatively quick and effortless manner.

Go Daddy

Email filtering system and method
Invented by Brad Owen and Jason Steiner
US Patent Application 20060129644
Published June 15, 2006
Filed December 14, 2004

Abstract

Systems and methods of the present invention allow filtering out spam and phishing email messages based on the links embedded into the email messages. In a preferred embodiment, an Email Filter extracts links from the email message and obtains desirability values for the links. The Email Filter may route the email message based on desirability values. Such routing includes delivering the email message to a Recipient, delivering the message to a Quarantine Mailbox, or deleting the message.

Xerox

Personalized web search method
Invented by Lisa S. Purvis
Assigned to Xerox Corporation
US Patent Application 20060129533
Published June 15, 2006
Filed December 15, 2004

Abstract

A method for contextualizing search results is disclosed. The method includes performing a traditional web query that returns a set of result pages, using collaborative filtering techniques to generate a set of predicted pages, comparing the set of predicted pages with the set of result pages, and ranking the set of result pages so that result pages that are also included in the set of predicted pages are ranked higher than those that are not. Methods herein also contemplate using the search history of the user or others to refine the results of searches.

Ask.com

Relevancy-based database retrieval and display techniques
Invented by Tao Yang, Wei Wang, and Apostolos Gerasoulis
US Patent Application 20060129552
Published June 15, 2006
Filed February 2, 2006

Abstract

Techniques to retrieve, rank and display data objects retrieved form a database are described. In particular, methods to assign a global ranking value to a data object based on a combination of that object’s link-based (e.g., vector-space cluster analysis) and text-based (e.g., word frequency) ranks are described. Additional techniques to determine a set of concepts, topics or key words associated with each retrieved data objects are described.

My usual reminder about patents: Some of the processes and technology described in patents are created in house, and some are developed with the assistance of contractors and partners. A percentage are never developed in a tangible manner, but may serve as a way to attempt to exclude others from using the technology, or even to possibly mislead competitors into exploring an area that they might not have an interest in (sometimes skepticism is good.)

There are times when a Google or Yahoo acquires a company to gain access to the intellectual property of that company, or the intellectual prowess and expertise of that company’s employees. And sometimes patents are just purchased.

Want to comment or discuss? Visit our Search Technology & Relevancy area of the Search Engine Watch Forums.

Resources

The 2023 B2B Superpowers Index

whitepaper | Analytics The 2023 B2B Superpowers Index

8m
Data Analytics in Marketing

whitepaper | Analytics Data Analytics in Marketing

10m
The Third-Party Data Deprecation Playbook

whitepaper | Digital Marketing The Third-Party Data Deprecation Playbook

1y
Utilizing Email To Stop Fraud-eCommerce Client Fraud Case Study

whitepaper | Digital Marketing Utilizing Email To Stop Fraud-eCommerce Client Fraud Case Study

1y