Go to Google Webmaster Tools and use the built-in robots.txt checker (you can find this in the Crawler Access section under Site Configuration). Google Webmaster Tools can give a quick report on duplicate title tags, missing title tags, short title...
All flash site - Very, very pretty, but not a great experience for a crawler. Just make sure you offer text link navigation options and use a technique like Scalable Inman Flash Replacement (sIFR) to tell the crawler what is in the movie.
This team also works on other custom technologies such as the SiLK Crawler they designed to solve another client problem. These team members are also very skilled, and often specialize in advanced keyword research techniques, Title and META...
Slurp's Change of AddressYahoo's web crawler, affectionately known as Yahoo Slurp, will move from its domain of inktomisearch.com to crawl.yahoo.net. WebFetch, new meta search engine, launches in UKIn addition to web search results from Google...
A query on a crawler-based search engine often turns up thousands or even millions of matching web pages. More about the title tag can be found on the How To Use HTML Meta Tags page. Next: How To Use HTML Meta Tags Previous - Beginning
Submitting And Encouraging Crawlers Explains how to ensure that more of your content with a web site is indexed by crawler-based search engines. Blocking Crawlers With The Meta Robots TagExplains how to use this alternative to the robots.txt file...
ACAP will develop and pilot a system by which the owners of content
published on the World Wide Web can provide permissions information (relating to
access and use of their content) in a form in which it can be recognised and
where necessary...
Those that say they do are merely making
educated guesses at reverse engineering the crawler-based search engines. Occasionally, I get questions about what "numbers" or "rules" should be
followed to construct the perfect page for each crawler...
Microsoft's MSN Search To Build Crawler-Based Search Engine: Hmm. Maybe Google offering both paid and unpaid results
really is an advantage and we should own some technology to produce
crawler-based editorial results.
Gigablast Now Indexing More than 2 Billion Pages May 16, 2005 - Matt Wells over at Gigablast has had his web crawler really cranking lately. Hitwise: Meta Travel Engines Show Gains from MediaPost covers how stats from Hitwise show new meta travel...
When it comes to language, you need to understand that no major crawler-based search engine allows you to specify what language your page is in. So, for those writing in non-English languages, adding charset meta tags possibly might be helpful, but...
Deep Submission Tools: Designed to submit many pages from your site to crawler-based search engines. Multisubmit Tools: Designed to submit your web site to hundreds of crawler-based search engines and human-powered directories at the same time.
There's a meta tag generator, a "Page Primer" analysis tool that offers basic advice for changes that may help with crawler-based search engines, a submission tool to send your URLs to many search engines at once as well as a "deep" submit of many...
Tips on writing copy that pleases crawler-based search engines and humans. Just saying you are Google does nothing to cause your browser to act the way Google's crawler actually does, any more than saying to someone that you are a famous movie star...
AltaVista's Scooter crawler begins trials These previously came from the World Wide Web Worm crawler that GoTo.com acquired in 1997. Dogpile, the meta search engine with the goofy name, opened its virtual door just after Christmas in 1996.
It covers key tips on getting important directory, crawler and paid listings with major search engines. This version of the article for Search Engine Watch members looks at some specific test queries for a rough sense of how the new search...
As for this submit URL that some may remember, it submits your page to the Yahoo crawler, which powers the current MSN Search, MSN says. I've yet to test this, but if so, I think it would make it the only major crawler that can spider frame content...
The former appears to be the internal address those within Microsoft can use to try out its own crawler-based search engine. Lets you search through RDF, RSS, web feed and XML content from across the web and provides the ability to narrow searches...
This leaves the crawler-based search engines entirely dependent on ranking the page in the top results solely off of link data, and I'm guessing not many people are linking over to it using the phrase.
When you search at Yahoo, it will check to see if any of the pages listed from its web crawler also live within its Yahoo Directory. And, of course, Yahoo is using its own crawler-based results now, rather than depending on Google.