Thunderstone chases Google's search watermark
By Thomas Powell
,
Network World
, 10/31/2005
- Share/Email
- Tweet This
- Print
With the continuing explosion of unstructured Web-based content in the enterprise, a quality search engine is no longer a
luxury, but a necessity. Encouraged by reader feedback after our recent Google Search Appliance Clear Choice Test, we tested a similar product, the Thunderstone Search Appliance.
Archive of Network World testsSubscribe to the Network Product Test Results newsletter
Overall, the Thunderstone Software appliance is a capable, flexible and fast search platform, though at times it is hampered
by its lack of polish in the areas of administration and security.
Immediately upon installation, it is clear the Thunderstone appliance does not hide its implementation details well. Packaged
in a custom blue case is a fairly stock RedHat Linux box equipped with open source Webmin interface for addressing system
tasks.
To configure the search functionality, we had to use the supplied, very rudimentary Web-based interface, which simply does
not do justice to the power of the search provided. While some users may be initially attracted to what appears to be a simple
form-based interface, we found the forms cluttered and confusing, containing little or no field grouping, and rife with little
annoyances, most notably one-line-high scrolling text areas that don't allow you to see a field's contents at once. During
testing, we also found pages occasionally not displaying the requested information.
However, once you get beyond the interface issues, you will see that the system allows for detailed customization of indexing
and search results. When building a search index with the Thunderstone appliance, you first indicate the starting URL(s) and
the particular file types to include or exclude during the site walk.
If you take the time to explore the complete walk settings, you will find many features that may help you handle the special
cases you might encounter during a site walk. For example, it is possible to configure the system to remove the contents of
certain types of tags or even remove commonly found text in page navigation, headers and footers.
However, you may find indexing sites with form-based logons very difficult to do, requiring lots of trial and error if you
want to do more than basic Web authentication.
We were happy to find the Thunderstone crawler (the Texis software the company has offered for years) was able to traverse
our test sites fairly easily because it can be configured to execute JavaScript content, including external .js files, or
examine strings within JavaScript for URLs to traverse. While in practice this helped the program move around sites, there
were situations where the crawler made mistakes with JavaScript content and noted many pages in error. For example, on one
test site that used Google AdSense, the crawler pulled out data that was not a URL to crawl. However, these problems were
forgivable given that many crawlers cannot even index sites that rely too much on JavaScript for navigation.
Partner Content
Blue Stripe Software
www.bluestripe.com/
Improving Application Performance Troubleshooting
Diagnosing why an application is slow is hard, at times taking days or weeks to isolate and resolve. This paper explains the challenges involved using current management tools, provides a 'wish list' for application management and analysis, and explains the need for an application system-wide approach that monitors entire applications, not components.
Download Whitepaper
Virtual Vigilance: Managing Application Performance in Virtual Environments
This paper highlights the impact of virtualization on application performance. "Managing Application Performance in Virtual Environments" states: "Best-in-Class organizations are predominately taking actions around improving visibility across both physical and virtual systems, assessing the business impact of application performance and understanding interdependencies of applications in virtualized environments."
Download Whitepaper
Application Service Requests: The Missing Link for Pragmatic ITSM
Forrester Research analyst Glenn O'Donnell and BlueStripe co-founder Vic Nyman discuss a breakthrough approach to application problem management. Learn the new approach for ITSM problem management, which provides: Rapid isolation of application slow-downs to specific components for quick problem resolution, 24/7 monitoring for proactive notification of potential issues before end users are impacted and much more.
Register for Webcast
Comment