A new search engine focuses solely on investment-related information from more than 12,000 companies around the world.
The Scannery currently contains info about more than 12,000 companies in over 50 countries. The focus is on investor-specific targets, organized by country and industry sector. Content for the database is gathered by a custom crawler, which indexes html, pdf, ppt, doc, and other format types.
The service offers numerous search options. For example, you can limit by inclusion in a particular index (SP 500, Global 1000, Euro 400, etc.) country, and/or industry.
Boolean connectors along with a proximity operator (within a certain number of terms) are also available. Wildcards, stemming, natural language, fuzzy match, and synonym searching also add to the database’s potential usefulness.
Joseph Pols, developer of the database, is committed to rapidly improving the service. “A very significant development is the addition of a new ‘sister’ website to The Scannery,” he writes in an email. “We found that some users were getting confused when they searched, for example, FORD but did not find the Ford Motor Company ranking near the top of the results.
“This is because not all companies allow their websites to be [fully” indexed. Ford is one of them, as is IBM and Proctor and Gamble, among others. Therefore, the only results would be those documents on other websites which contain the search phrase.”
In some cases, this may be because a company has used the robots.txt file to exclude search engine crawlers. Other large company web sites use dynamic content management systems which cause problems for crawlers.
To help overcome this problem, Pols has created a new ‘sister’ website called TimBuckOne. This site, which requires free registration, allows a user to first choose a country, then navigate to a specific public company within that country. You can then perform a search restricted to that company’s local country website. In addition, the user can choose to expand the search to all public companies in that country.
Let’s watch The Scannery closely. Complete documentation for using the site is available here.
NOTE: Article links often change. In case of a bad link, use the publication’s search facility, which most have, and search for the headline.
| Ask Jeeves steps up search competition with Excite deal…
Revolution Mar 8 2004 9:12AM GMT
| ICANN approves domain wait list service…
MultiReg.com Mar 8 2004 9:05AM GMT
| Overture to Introduce Local Search Ad Program…
dmnews.com Mar 8 2004 6:17AM GMT
| Spam is 10 years old…
Sydney Morning Herald Mar 8 2004 1:30AM GMT
| Yahoo Move Rekindles Pay-for-Inclusion Debate…
Yahoo Mar 7 2004 11:41AM GMT
| Search: The Trends…
Internet.com Mar 6 2004 0:24AM GMT
| Disputed Web browser patent invalidated…
MSNBC Mar 6 2004 0:08AM GMT
| Europe Drafting Internet Copyright Law…
theWHIR Mar 5 2004 11:23PM GMT
| Microsoft, IBM Top Off Web Services Metadata Spec…
Internet News Mar 5 2004 9:01PM GMT
| InfoSpace Latest Search Supporter of XML Syndication…
Yahoo Mar 5 2004 6:41PM GMT
| Jeeves extends reach in searches…
San Jose Mercury News Mar 5 2004 11:38AM GMT
| European Union pushes to retain ‘free’ Internet…
ZDNet UK Mar 5 2004 11:27AM GMT
| Ask Jeeves doubles market share…
BBC Mar 5 2004 10:05AM GMT