If that sounds exciting, you could've been as excited years ago, when Nutch, which powers Wikia Search, was launched in 2005. Searches in Nutch return results which can be analyzed to see what on-page elements were factored and weighted to give a...
For developers, the project presents a programming challenge that will help both Wikia and the open source communities around search applications like Nutch and Lucene, which Wikia will incorporate into its search engine.
One way to determine how many pages *approximately* exist is to use an open source crawler, such as Nutch or Heritrix, to crawl your pages. In Part I of this article, I began debunking the arguments against having an in-house search engine...
In March 2006, this archive became keyword searchable using Nutch technology. Another week has passed which means it's time for a look at a few new or updated specialty search tools that they have posted about on ResourceShelf.
The search technology is currently using NUTCH and NUTCHWAX, open source solutions. College Basketball Lucene, Hadoop and Nutch. SES: Forward Planning + Top Stories + More From The Search Engine Watch Blog + Daily SearchCast: Search News Via...
The search technology is currently using NUTCH and NUTCHWAX, open source solutions. Gary Price reports on a new search engine named Web Harvest that enables you to search governmental and military Web pages.
Nutch's Doug Today's search podcast covers Google to be ordered to hand over search
query data; Google buys 3D software company; Google Payments offers PayPal
alternative; ads come to Microsoft Live products; Yahoo employee shifts; Yahoo
testing...
Doug will most likely continue working from home on his open source projects; Lucene, Hadoop and Nutch. Jeremy Zawodny notes that Doug Cutting, who has been working at Yahoo for four-years as an independent contractor, as now signed on with Yahoo...
Nutch Alexa Web Search Platform
that's available to anyone willing to pay a fee. Pay a fee for what? You can create your own search engine by tapping into the
billion web pages Alexa has indexed over time.
Btw, the search on the Hurricanes Katrina & Rita Web Archive is powered by Nutch. Brewster and crew at The Internet Archive have just debuted a new specialty collection that contains more 25 million fully archived web pages that are also full text...
Btw, the Creative Commons site also offers a search engine that's powered by Nutch.org open source technology. Yahoo has partnered with Creative Commons and released a new resource that restricts a web search (using the Yahoo web index)
to content...
Cutting's own Nutch project that I've written about is based on Lucene. Got the itch to go head-to-head with Google, Yahoo and all of the other big search players on the web? A new book provides a detailed blueprint for using and customizing Lucene...
The Creative Commons search engine (powered by Nutch, which we've previously covered) makes it easy to find this content. Looking for photos, music, text, books and other content that's free to share or modify for your own purposes?
A news release on the Creative Commons site lets us know that an "updated version of their Nutch-based search tool is now online. In This Issue Beta Test Search Topics Area! SES Chicago Happens This Month + Search Engine Watch Articles + More From...
A news release on the Creative Commons site lets us know that an "updated version of their Nutch-based search tool is now online. CC Search was "soft launched" in September. Creative Commons also announced that their database is now included in the...
Not since Nutch have people gotten such a good view at the different components in ranking for a major search engine. New MSN Search Goes LIVE in Beta Search Engine Watch Forums So, what do you think of the new MSN Search (beta)?
ObjectsSearch powered with Nutch technology is now offering clustered results and offering several "specialty" databases including image search. The developers of small general-purpose web engines continue to impress with their innovative spirit.
This new search engine, powered by technology from Nutch, lets you scan the web to do just that. In This Issue Search Engine Watch News + Search Engine Strategies Set For Stockholm, Chicago + Search Engine Watch Articles + Search Engine Articles...
Q&A with open source search engine founder and creator Doug Cutting, with some interesting comments on Google wanting to help but fearing it will help their competitors, the idea that Nutch APIs may be coming and comments on dealing with spam.