Skip Links

Network World

  • Social Web 
  • Email 
  • Close

Google searches for an enterprise space

By Thomas Powell , Network World , 08/01/2005

The Google Search Appliance packages up the company's famously accurate technology into an easy-to-use search engine for intranets and public-facing corporate sites. In our Clear Choice test of the GB-1001 model, we found that while the searching and indexing features live up to the Google name, the product lacks polish and advanced management features.


Google Mini - A cheap GSA?
How we did it
Archive of Network World tests
Subscribe to the Network Product Test Results newsletter


The appliance's honeycomb case caught our eye, but the whimsy wore off as we began to notice occasional unevenness in the appliance. For example, the appliance takes a number of minutes to start up and run its various system checks. To alert you it is done, it plays a little tune. In testing in our server room and at a collocation facility, we couldn't hear the tune over the dull roar of such environments and had to manually probe for the system's state.

The GB-1001 does not provide obvious light indicators or a small LCD screen on the unit. No on-off switch is provided, as the designer likely intended you to go through the proper shutdown procedure. We experienced an unplanned UPS failure, and upon power restoration the box recovered properly once it performed an automated rebuild of its RAID system that lasted several hours. After you do trigger shutdown through the Web administration system provided, you need to be careful not to cut power too early; otherwise, you will have the RAID rebuild wait on your hands.

We also found other polish points lacking. Within the administration system, confirmations of configuration changes didn't appear in a logical place, form fields were slightly misaligned or oddly arranged, warning messages did not appear reliably, help information was too concise or lacked good examples, result output previews didn't always work, and, in some cases, error messages lacked detail.

There were some bright spots, including clear installation documentation, color-coded cables and a built-in DHCP server that allowed us to plug in a laptop and quickly configure the network settings.

Using a Web-based GUI, your first step after installation would likely be to define a search index by indicating starting URLs, URL patterns and file types that should be recorded and discarded by the crawler. (see "How we did it" ).

According to Google, the crawler is capable of indexing 220 types of content. In our test we saw no limitation in the crawler, and found that the device tended to discover files that we were not aware of in some test data sets.

You will likely want to break up the indexed documents into different collections based upon a URL pattern. The GB-1001 allows for an unlimited number of collections.

The crawler is quite adept at dealing with secured content. It handles Secure-HTTP connections and can negotiate basic authentication, NT LAN Manager authentication, and custom cookie and form-based access. The GB-1001 can crawl content from databases, including Oracle, SQL Server, mySQL, IBM DB2 and Sybase. If you happened upon a data type the crawler cannot access, you can feed it directly to the device in an XML format.

Google does limit its appliances by document count starting with 500,000 for the base unit (for smaller deployments, use the Google Mini ). You can of course increase your license and associated hardware to build out a search infrastructure that could support millions of documents. When you size your appliance be aware that if you plan on doing direct database indexing, Google will count each record as a document, so you might chew up a license very quickly.

One aspect of the crawl process that we especially liked was the diagnostics facility, which was not only useful to understand what the crawler was doing, but it also clearly helped us isolate such indexing problems as broken links, server issues and access-denied problems.

The GB-1001 provides a great deal of flexibility for the search page and result listings. Some administrators may be happy to use the page layout helper and modify the logo and basic aspects of the search page. However, most folks will probably want to modify the results to fully integrate it into the look and feel of the site. If you are familiar with XML Stylesheet Language Transformation you can modify a near-3,000-line template that controls just about every aspect of the search form and result. If this doesn't suit you, just use the raw XML returned from the appliance and do whatever you like, including putting it into another system.

Google's approach is to implement searches in an easy-to-use "black box" fashion, which could place constraints on a private search. You turn the appliance loose, and it ranks based upon the Google algorithm. We were pleased that the accuracy of the test search lived up to what we see in everyday use of the Google Internet search. It easily found buried test phrases and correctly identified primary documents.

Partner Content
CA logo

CA Network & Voice Resource Center

Comprehensive Network & Voice Management Visit CA Network & Voice Management Resource Center and get insights into industry best practices, information that helps you to address your challenges.

CA Network & Voice Management Resource Center

whitepaper

Managing Voice Over IP for Successful Convergence

Voice over IP (VoIP) has much to offer in cost savings but some customers have concerns about VoIP call quality compared to the quality of traditional voice services. This white paper will help you learn how to take the right steps so that voice quality is assured.

Managing VoIP for Successful Convergence

whitepaper

The Changing Face of Network Management

Managing your network is serious business. This paper discusses the benefits of integrating configuration change-awareness into your network fault management solution

Download Whitepaper

Comment
Login
Forgot your account info?
Add comment
Anonymous comments subject to approval. Register here for member benefits.
Have a NetworkWorld account? Log in here. Register now for a free account.

Videos

rssRss Feed
Get instant email notification when white papers, webcasts, executive guides are added to our library. Stay informed and up-to-date with the latest on IT Technologies with Network World's Resource Alerts.

Whitepapers

File Integrity Monitoring: Secure Your Virtual and Physical IT Environments

Discover the capabilities your file integrity monitoring solution should have to effectively secure...

5 Biggest Blunders when Building Spreadsheet Applications in Java

Developers are asked to incorporate spreadsheets into Java applications for a number of reasons....

Java: Four Server-based Approaches

Java applications often need to tap into the logic in a spreadsheet. Developers are challenged to...

Webcasts

PoE Plus: Impact on the PoE Market

The standard for Power over Ethernet (PoE), IEEE Std. 802.3af(tm)-2003, advanced networking,...

Harnessing the power of communications to increase workplace performance

Due to the convergence of IT and telecommunications technologies, the business workplace has been...

Stay out of the headlines: Detecting and preventing network intrusions

How do YOU stay out of the headlines? There is no denying that risk exists in our computer-driven...

Special Reports

How to lower software costs, complexity

Discover how Software as a Service is the economical alternative to expensive on-site software,...

Executive Guide: Virtualization Reality Check

Find out why analysts say approaching virtualization with an ounce of caution is wise. And also why...

WAN Optimization: The Ultimate No Brainer

Find out how you can dramatically improve data throughput, significantly reduce bandwidth usage and...