Hungry for automation
Bill Randall, director of MIS infrastructure at restaurant chain Red Robin, dishes up the scoop on automated management.
By
Julie Bort
,
Network World
, 08/22/2005
- Share/Email
- Tweet This
- Print
Restaurateur Red Robin is devouring new data center technology. Use of automated management, for example, has led to impressive
performance and financial gains, such as a reduction of network downtime by 50% and a resulting $130,000 saved in 2004. Now
the staff has discovered that security information management (SIM) can greatly cool the sting of Sarbanes-Oxley audits. Bill
Randall, director of MIS infrastructure at the Denver company, shares his insights on the payback of proactive management,
the realities of using today's automation tools and more in an interview with Signature Series Executive Editor Julie Bort.
Red Robin has been hopping with cool, new technology since building your new data center. How are you managing it all?
We started out with NetIQ AppManager Suite two years ago just to help us keep a grip on everything. We have over 40 servers sitting on a rack, and I don't want to pay
an engineer for logging in and checking on them everyday. My engineers are paid to think and work on projects. So it made
sense to have AppManager monitor our systems and check that the processors, temperature, memory and so on look fine. If it
sees that, hey, memory utilization has been over 50% now for over eight hours, it notifies us so we can take the appropriate
action.
Are you using advanced automation - where processes a human used to do by hand are now done automatically?
There are things that we let NetIQ do. For instance, if utilization gets to a certain point on our NAS box and stays there,
we know that it just needs a restart. Somehow memory utilization creeps up on Windows boxes until you reboot. So it monitors
for that and reboots the box at an off-hour and sends us a notification.
I don't know that we're ready to turn the keys entirely over to any automation program. Remember that Matthew Broderick movie
- "WarGames" - where the computer was ready to declare World War III? I don't think that would happen in our network, but
we're still careful. Automation technology lacks is the ability to test before it acts. We need to find out what the impact
is going to be. If you change the way SNMP is done, for example, it may affect the backup server and several other environments.
We look at automation from a standpoint of wanting [the monitoring system] to collect information, make recommendations, and
let us use our human and engineering perspectives to determine whether that recommendation actually applies. Once we've tested
it and know the right course to take, we can script and automate.
You attribute a 50% reduction in downtime, which saved some $130,000 last year, to automation. Can you give me an example?
We looked at what our downtime was a year prior vs. a year after implementing AppManager. Outages that happen with a network
infrastructure are often due to silly things - no one notices the print server is turned on until it finally runs out of disk
space and print services go down because it crashes the operating system. Then you've got to rebuild it from the [backup]
image. We now take a proactive approach and we've alleviated most types of errors like memory or disk space [problems] that
give you page fault errors. With a proactive approach, we'll replace that memory right away, before it can corrupt the application.
Partner Content
Blue Stripe Software
www.bluestripe.com/
Improving Application Performance Troubleshooting
Diagnosing why an application is slow is hard, at times taking days or weeks to isolate and resolve. This paper explains the challenges involved using current management tools, provides a 'wish list' for application management and analysis, and explains the need for an application system-wide approach that monitors entire applications, not components.
Download Whitepaper
Virtual Vigilance: Managing Application Performance in Virtual Environments
This paper highlights the impact of virtualization on application performance. "Managing Application Performance in Virtual Environments" states: "Best-in-Class organizations are predominately taking actions around improving visibility across both physical and virtual systems, assessing the business impact of application performance and understanding interdependencies of applications in virtualized environments."
Download Whitepaper
Application Service Requests: The Missing Link for Pragmatic ITSM
Forrester Research analyst Glenn O'Donnell and BlueStripe co-founder Vic Nyman discuss a breakthrough approach to application problem management. Learn the new approach for ITSM problem management, which provides: Rapid isolation of application slow-downs to specific components for quick problem resolution, 24/7 monitoring for proactive notification of potential issues before end users are impacted and much more.
Register for Webcast
Comment