New Search Patent Filings: August 28, 2006 - Identifying Web Spam and Adult Images
New Microsoft patent applications include one that attempts to identify web spam based upon signals within the content of a page, another looks at ways to search using pattern matching and relevance to answer specific questions, a third describes a method of relating people to each other during a search based upon things such as being co-authors of documents, a fourth defines a process of refining searches based upon previous searches for the same query or by providing additional context to searchers, and a fifth allows users or communities of users to provide reviews of web pages independently of the owners of sites being reviewed.
Yahoo (Overture) was granted a patent on the bidding and ranking of pages through paid search. Yahoo also had published two patent applications which explore social networks, and previewing, inviting and granting authorization for others to view specific pages within that social network.
Ask.com looks at adult images, and a way to identify them as being adult content without performing a visual analysis of those images, but instead by looking a query sessions related to the pictures.
Oracle was granted a patent which mines information about users from query logs and user profiles to retrieve recommendations for pages, expansion of queries based upon that information, and a thematic clustering of those search results.
Microsoft
The following patent application covers some very similar ground as a white paper from Microsoft Research titled Detecting Spam Web Pages through Content Analysis (pdf). It looks at a number of ways that web spam might be identified from the content on a page, though the authors note that the methods involved would likely be used in conjunction with other indications of web spam, perhaps like the ones discussed in Spam, Damn Spam, and Statistics, and in an earlier patent application on Content Evaluation.
Using content analysis to detect spam web pages
Invented by Marc Alexander Najork, Dennis Craig Fetterly, Mark Steven Manasse, and Alexandros Ntoulas
Assigned to Microsoft
US Patent Application 20060184500
Published August 17, 2006
Filed on February 11, 2005
Abstract
Evaluating content includes receiving content, analyzing the content for web spam using a content-based identification technique, and classifying the content according to the analysis. An index of analyzed contents may be created. A system for evaluating content includes a storage device configured to store data and a processor configured to analyze content using content-based identification techniques to determine whether web spam is present.
Search methods and associated systems
Invented by Larry Israel and John Solaro
Assigned to Microsoft
US Patent Application 20060184523
Published August 17, 2006
Filed on February 15, 2005
Abstract
Search methods and associated systems are disclosed. One aspect of the invention is directed toward search methods and associated systems. One aspect of the invention is directed toward a computer-implemented searching method that includes receiving an input having a format. The method further includes finding a pattern that matches the format of the input using a rule set. The method still further includes determining a subject of the input based on the pattern, finding a result record corresponding to the subject, and sending an output based on the result record. In certain embodiments, the method can further include determining at least one qualifier based on the pattern and finding a result record corresponding to the subject and the at least one qualifier. In still other embodiments, the method can further include determining a subject of the input based on the pattern and at least one synonym rule.
Method and system for mining information based on relationships
Invented by Benyu Zhang, Wei-Ying Ma, Gu Xu, Hongbin Gao, Zheng Chen, Randy Hinrichs, Hua-Jun Zeng
Assigned to Microsoft
US Patent Application 20060184481
Published August 17, 2006
Filed on February 11, 2005
Abstract
A method and system for identifying information about people is provided. The information system identifies groups of people that have relationships based on their relationships to documents or more generally to objects. The information system initially is provided with an indication of which people have which relationships to which documents. The information system then identifies clusters of people based on having a relationship to the same objects. The information system may also identify clusters of related objects associated with a cluster of people. When a user wants to identify information about a person, the user can provide the name of that person to the information system. The information system then can retrieve and display the names of the other people who are in the same cluster as the person.
Content searching and configuration of search results
Invented by Greg A. Kohanim, Jonathan L. Wiedemann, Christine A. Jefson, and David Aaron Ward Snelling
Assigned to Microsoft
US Patent Application 20060184512
Published August 17, 2006
Filed on February 17, 2005
Abstract
Content searching and configuration of search results are described. In an implementation, a method includes in response to a search query, selecting a keyword based on heuristic data which describes a plurality of previously performed searched. A search is performed utilizing the search query and the selected keyword to locate content.
Method and system for contextual site rating
Invented by Peter G. Williams, Mark A. Wilson-Thomas, Martin Peck, Robert J. Wilcox, Andrew Burns, Martin Grayson
Assigned to Microsoft
US Patent Application 20060184608
Published August 17, 2006
Filed on February 11, 2005
Abstract
The present invention allows a user or community of users to rate content across a variety of web sites and display contextual sensitive reviews. Rather than the rating information being controlled by the web site owner, the rating information may be owned and controlled by a third party. Users have the ability to rate a web site, review ratings from a web site, or operate a web site rating system.
Yahoo
System and method for influencing a position on a search result list generated by a computer network search engine
Invented by Darren J. Davis, Matthew Derer, Johann Garcia, Larry Greco, Tod E. Kurt, Thomas Kwong, Jonathan C. Lee, Ka Luk Lee, Preston Pfarner, and Steve Skovran
Assigned to Overture
United States Patent 7,092,901
Granted August 15, 2006
Filed on July 24, 2001
Abstract
A system and method for enabling information providers using a computer network such as the Internet to influence a position for a search listing within a search result list generated by an Internet search engine. The system and method of the present invention provides a database having accounts for the network information providers. Each account contains contact and billing information for a network information provider. In addition, each account contains at least one search listing having at least three components: a description, a search term comprising one or more keywords, and a bid amount. The network information provider may add, delete, or modify a search listing after logging into his or her account via an authentication process. The network information provider influences a position for a search listing in the provider’s account by first selecting a search term relevant to the content of the web site or other information source to be listed. The network information provider enters the search term and the description into a search listing. The network information provider influences the position for a search listing through a continuous online competitive bidding process. The bidding process occurs when the network information provider enters a new bid amount, which is preferably a money amount, for a search listing. The system and method of the present invention then compares this bid amount with all other bid amounts for the same search term, and generates a rank value for all search listings having that search term. The rank value generated by the bidding process determines where the network information providers listing will appear on the search results list page that is generated in response to a query of the search term by a searcher located at a client computer on the computer network. A higher bid by a network information provider will result in a higher rank value and a more advantageous placement.
Control for enabling a user to preview display of selected content based on another user’s authorization level
Invented by Michael La Rotonda, Neal Sample, Paul Brody, Ellen Sue Perelman, Ericson DeJesus
Assigned to Yahoo
US Patent Application 20060184578
Published August 17, 2006
Filed on December 20, 2005
Abstract
Enabling a first user to preview content as it would be seen by a second user, if the second user had a selected user relationship with the first user. The selected user relationship may comprise a relationship degree, a relationship category, a relationship rating, and/or other form of relationship. In one embodiment, a user interface enables the first user to assign user relationships to portions of content and to other users. The first user selects a user relationship, which is used to access those portions of content that are associated with the first user and assigned the selected user relationship. The corresponding portions of content are used to generate a preview display for the first user, illustrating the portions of content that would be accessible to other users assigned the same user relationship or assigned a closer user relationship. Preview may be generated by a server or a local client.
Control for inviting an unauthenticated user to gain access to display of content that is otherwise accessible with an authentication mechanism
Invented by Michael La Rotonda, Neal Sample; ; (Santa Cruz, CA) ; F. Randall Farmer, Paul Brody, and Ellen Sue Perelman
Assigned to Yahoo
US Patent Application 20060184997
Published August 17, 2006
Filed on December 20, 2005
Abstract
Enabling an unauthenticated user to access content associated with an authenticated user as though the unauthenticated user had a selected user relationship with the authenticated user. The user relationship may comprise a relationship degree, a relationship category, a relationship rating, and/or the like. An invitation to join an electronic service, such as an online social network, is sent to the unauthenticated user at an address known to the authenticated user. The invitation includes a time-limited token, such as a URL, that includes an invitation identifier, which relates the invitation to the authenticated user content. The token may be encrypted in the invitation. The unauthenticated user returns the token as a request to preview the authenticated user content without first becoming an authenticated user of the electronic service. If the token is still valid, access is granted. The unauthenticated user may also request to establish a connection with the authenticated user.
Ask.com
Methods and apparatuses to determine adult images by query association
Invented by Kaushal Kurapati and Rahul Lahiri
US Patent Application 20060184577
Published August 17, 2006
Filed on May 18, 2005
Abstract
Various methods and apparatuses are described for an adult content detection implementation. In one embodiment, a method detects adult content images by tracked query association to a user’s query for an image search. The set of images returned in response to the user’s query on a search engine are based on whether one or more images in the set are classified as an adult content image.
Oracle
System and method for search and recommendation based on usage mining
Invented by Omar Alonso and Atul Kumar
Assigned to Oracle
United States Patent 7,092,936
Granted August 15, 2006
Filed on August 22, 2001
Abstract
A method, system, and computer program product for performing searching that generates improved queries, retrieves meaningful and relevant information, and presents the retrieved information to the user in a useful and comprehensive manner is described. The method of searching comprises the steps of: receiving from a user a search query requesting information, retrieving at least one recommendation relating to the search query, generating an expanded query based on the received query, performing a search using the expanded query to retrieve documents, and generating themes relating to the retrieved documents. The at least one recommendation relating to the search query is retrieved from a recommendation database. The recommendation database is generated by performing the steps of: performing data mining using users search query logs, user search patterns, and user profile information to generate a plurality of recommendations relating to search query strings, generating a data structure including the recommendations relating to search query strings, and generating a text index based on information in the data structure.