SES Chicago - December 7-11, 2009

August 28, 2006

New Search Patent Filings: August 28, 2006 - Identifying Web Spam and Adult Images

New Microsoft patent applications include one that attempts to identify web spam based upon signals within the content of a page, another looks at ways to search using pattern matching and relevance to answer specific questions, a third describes a method of relating people to each other during a search based upon things such as being co-authors of documents, a fourth defines a process of refining searches based upon previous searches for the same query or by providing additional context to searchers, and a fifth allows users or communities of users to provide reviews of web pages independently of the owners of sites being reviewed.

Yahoo (Overture) was granted a patent on the bidding and ranking of pages through paid search. Yahoo also had published two patent applications which explore social networks, and previewing, inviting and granting authorization for others to view specific pages within that social network.

Ask.com looks at adult images, and a way to identify them as being adult content without performing a visual analysis of those images, but instead by looking a query sessions related to the pictures.

Oracle was granted a patent which mines information about users from query logs and user profiles to retrieve recommendations for pages, expansion of queries based upon that information, and a thematic clustering of those search results.

Microsoft

The following patent application covers some very similar ground as a white paper from Microsoft Research titled Detecting Spam Web Pages through Content Analysis (pdf). It looks at a number of ways that web spam might be identified from the content on a page, though the authors note that the methods involved would likely be used in conjunction with other indications of web spam, perhaps like the ones discussed in Spam, Damn Spam, and Statistics, and in an earlier patent application on Content Evaluation.

Using content analysis to detect spam web pages Invented by Marc Alexander Najork, Dennis Craig Fetterly, Mark Steven Manasse, and Alexandros Ntoulas Assigned to Microsoft US Patent Application 20060184500 Published August 17, 2006 Filed on February 11, 2005

Abstract

Evaluating content includes receiving content, analyzing the content for web spam using a content-based identification technique, and classifying the content according to the analysis. An index of analyzed contents may be created. A system for evaluating content includes a storage device configured to store data and a processor configured to analyze content using content-based identification techniques to determine whether web spam is present.

Search methods and associated systems Invented by Larry Israel and John Solaro Assigned to Microsoft US Patent Application 20060184523 Published August 17, 2006 Filed on February 15, 2005

Abstract

Search methods and associated systems are disclosed. One aspect of the invention is directed toward search methods and associated systems. One aspect of the invention is directed toward a computer-implemented searching method that includes receiving an input having a format. The method further includes finding a pattern that matches the format of the input using a rule set. The method still further includes determining a subject of the input based on the pattern, finding a result record corresponding to the subject, and sending an output based on the result record. In certain embodiments, the method can further include determining at least one qualifier based on the pattern and finding a result record corresponding to the subject and the at least one qualifier. In still other embodiments, the method can further include determining a subject of the input based on the pattern and at least one synonym rule.

Method and system for mining information based on relationships Invented by Benyu Zhang, Wei-Ying Ma, Gu Xu, Hongbin Gao, Zheng Chen, Randy Hinrichs, Hua-Jun Zeng Assigned to Microsoft US Patent Application 20060184481 Published August 17, 2006 Filed on February 11, 2005

Abstract

A method and system for identifying information about people is provided. The information system identifies groups of people that have relationships based on their relationships to documents or more generally to objects. The information system initially is provided with an indication of which people have which relationships to which documents. The information system then identifies clusters of people based on having a relationship to the same objects. The information system may also identify clusters of related objects associated with a cluster of people. When a user wants to identify information about a person, the user can provide the name of that person to the information system. The information system then can retrieve and display the names of the other people who are in the same cluster as the person.

Content searching and configuration of search results Invented by Greg A. Kohanim, Jonathan L. Wiedemann, Christine A. Jefson, and David Aaron Ward Snelling Assigned to Microsoft US Patent Application 20060184512 Published August 17, 2006 Filed on February 17, 2005

Abstract

Content searching and configuration of search results are described. In an implementation, a method includes in response to a search query, selecting a keyword based on heuristic data which describes a plurality of previously performed searched. A search is performed utilizing the search query and the selected keyword to locate content.

Method and system for contextual site rating Invented by Peter G. Williams, Mark A. Wilson-Thomas, Martin Peck, Robert J. Wilcox, Andrew Burns, Martin Grayson Assigned to Microsoft US Patent Application 20060184608 Published August 17, 2006 Filed on February 11, 2005

Abstract

The present invention allows a user or community of users to rate content across a variety of web sites and display contextual sensitive reviews. Rather than the rating information being controlled by the web site owner, the rating information may be owned and controlled by a third party. Users have the ability to rate a web site, review ratings from a web site, or operate a web site rating system.

Yahoo

System and method for influencing a position on a search result list generated by a computer network search engine Invented by Darren J. Davis, Matthew Derer, Johann Garcia, Larry Greco, Tod E. Kurt, Thomas Kwong, Jonathan C. Lee, Ka Luk Lee, Preston Pfarner, and Steve Skovran Assigned to Overture United States Patent 7,092,901 Granted August 15, 2006 Filed on July 24, 2001

Abstract

A system and method for enabling information providers using a computer network such as the Internet to influence a position for a search listing within a search result list generated by an Internet search engine. The system and method of the present invention provides a database having accounts for the network information providers. Each account contains contact and billing information for a network information provider. In addition, each account contains at least one search listing having at least three components: a description, a search term comprising one or more keywords, and a bid amount. The network information provider may add, delete, or modify a search listing after logging into his or her account via an authentication process. The network information provider influences a position for a search listing in the provider's account by first selecting a search term relevant to the content of the web site or other information source to be listed. The network information provider enters the search term and the description into a search listing. The network information provider influences the position for a search listing through a continuous online competitive bidding process. The bidding process occurs when the network information provider enters a new bid amount, which is preferably a money amount, for a search listing. The system and method of the present invention then compares this bid amount with all other bid amounts for the same search term, and generates a rank value for all search listings having that search term. The rank value generated by the bidding process determines where the network information providers listing will appear on the search results list page that is generated in response to a query of the search term by a searcher located at a client computer on the computer network. A higher bid by a network information provider will result in a higher rank value and a more advantageous placement.

Control for enabling a user to preview display of selected content based on another user's authorization level Invented by Michael La Rotonda, Neal Sample, Paul Brody, Ellen Sue Perelman, Ericson DeJesus Assigned to Yahoo US Patent Application 20060184578 Published August 17, 2006 Filed on December 20, 2005

Abstract

Enabling a first user to preview content as it would be seen by a second user, if the second user had a selected user relationship with the first user. The selected user relationship may comprise a relationship degree, a relationship category, a relationship rating, and/or other form of relationship. In one embodiment, a user interface enables the first user to assign user relationships to portions of content and to other users. The first user selects a user relationship, which is used to access those portions of content that are associated with the first user and assigned the selected user relationship. The corresponding portions of content are used to generate a preview display for the first user, illustrating the portions of content that would be accessible to other users assigned the same user relationship or assigned a closer user relationship. Preview may be generated by a server or a local client.

Control for inviting an unauthenticated user to gain access to display of content that is otherwise accessible with an authentication mechanism Invented by Michael La Rotonda, Neal Sample; ; (Santa Cruz, CA) ; F. Randall Farmer, Paul Brody, and Ellen Sue Perelman Assigned to Yahoo US Patent Application 20060184997 Published August 17, 2006 Filed on December 20, 2005

Abstract

Enabling an unauthenticated user to access content associated with an authenticated user as though the unauthenticated user had a selected user relationship with the authenticated user. The user relationship may comprise a relationship degree, a relationship category, a relationship rating, and/or the like. An invitation to join an electronic service, such as an online social network, is sent to the unauthenticated user at an address known to the authenticated user. The invitation includes a time-limited token, such as a URL, that includes an invitation identifier, which relates the invitation to the authenticated user content. The token may be encrypted in the invitation. The unauthenticated user returns the token as a request to preview the authenticated user content without first becoming an authenticated user of the electronic service. If the token is still valid, access is granted. The unauthenticated user may also request to establish a connection with the authenticated user.

Ask.com

Methods and apparatuses to determine adult images by query association Invented by Kaushal Kurapati and Rahul Lahiri US Patent Application 20060184577 Published August 17, 2006 Filed on May 18, 2005

Abstract

Various methods and apparatuses are described for an adult content detection implementation. In one embodiment, a method detects adult content images by tracked query association to a user's query for an image search. The set of images returned in response to the user's query on a search engine are based on whether one or more images in the set are classified as an adult content image.

Oracle

System and method for search and recommendation based on usage mining Invented by Omar Alonso and Atul Kumar Assigned to Oracle United States Patent 7,092,936 Granted August 15, 2006 Filed on August 22, 2001

Abstract

A method, system, and computer program product for performing searching that generates improved queries, retrieves meaningful and relevant information, and presents the retrieved information to the user in a useful and comprehensive manner is described. The method of searching comprises the steps of: receiving from a user a search query requesting information, retrieving at least one recommendation relating to the search query, generating an expanded query based on the received query, performing a search using the expanded query to retrieve documents, and generating themes relating to the retrieved documents. The at least one recommendation relating to the search query is retrieved from a recommendation database. The recommendation database is generated by performing the steps of: performing data mining using users search query logs, user search patterns, and user profile information to generate a plurality of recommendations relating to search query strings, generating a data structure including the recommendations relating to search query strings, and generating a text index based on information in the data structure.

Posted by Bill Slawski at 6:29 PM | Permalink

January 26, 2006

Ask Jeeves' New Image Search vs. the Competition

Image search is offered by all of the major search engines, and people tend to think it's a fairly generic service, with little difference in results between any of the engines. That's now changed with the introduction of a new image search feature by Ask Jeeves, that makes it clearly different from any of the others. I've got an overview of the new service including a comparison of the major image search services in today's SearchDay article, Searching For a Better Image.

Posted by Chris Sherman at 8:17 AM | Permalink

January 24, 2006

Ask Jeeves Launches Their Own Image Database, New Refinement Features Also Available

Ask Jeeves has released their own internally built database of images and is no longer using imagery provided by PicSearch to power Ask Jeeves Picture Search. For the last 10 months, AJ was utilizing their own image retrieval technology on top of content provided by PicSearch.

Last March, we blogged about AJ beginning to use their own image retrieval technology/algorithm and included a link to an AJ blog post where you could compare relevancy with the AJ algorithm vs. the PicSearch algo.

Image Search accounts for 16% of all searches on Ask Jeeves.

In addition to their new internally built image database and retrieval technology, AJ Pictures also now offers:

+ "Zoom" technology to help the searcher focus and refine their picture search. Zoom is also available on web search results pages. More about Zoom in this SearchDay article.

+ More Picture Search Smart Answers Said a different way, more iinline images on web results pages if the query string suggests that the searcher might be looking for imagery. This is a feature that AJ has been offering for almost three years.

Here's an example. I also noticed that some general web searches, like this one for "dogs". Note the Smart Answers box at the top of the page with a pull down menu to go directly to the pictures database and find images of man's best friend organized by breed. The Smart Answer box also contains a pull down menu with direct access to Smart Answers about specific breeds.

Posted by Gary Price at 1:29 PM | Permalink

March 4, 2005

Ask Jeeves Improves Relevancy With Image Search

A e-mail note from Ask Jeeves team along with a post on the AJ blog lets us know that Ask is now using their own relevancy algorithm on top of the image database that's provided by Picsearch. The Picsearch database and algorithm is also used with MSN's Image Search.

The AJ blog post offers a couple of before and after examples.

Posted by Gary Price at 8:58 AM | Permalink

See More Posts From:

This Week | This Month

  var gaJsHost = (("https:" == document.location.protocol) ? "https://ssl." : "http://www."); document.write(unescape("%3Cscript src='" + gaJsHost + "google-analytics.com/ga.js' type='text/javascript'%3E%3C/script%3E")); var pageTracker = _gat._getTracker("UA-564586-7"); pageTracker._setDomainName(".searchenginewatch.com"); pageTracker._trackPageview(); window.collarity_appid = "incmedia"; //> //>

Senior Digital Planner
U.S. International Media Los Angeles, United States

Senior Search Analyst
U.S. International Media Los Angeles, United States New York, United States

Webmaster - Marketing
West Virginia School of Osteopathic Medicine Lewisburg, United States

Web Marketing Manager
Harvard Business Publishing Watertown, United States


0