- How to make new stuff from your piles of obsolete tech
- Why your computer sucks
- 10 recession-proof IT skills
- Juniper execs share network vision
- 9-year-old plots his fifth Microsoft certification
When fire again threatened to char Santa Barbara, Calif., last month, the IT staff at Fielding Graduate University literally raced flames out of town with its network stashed in a cardboard box safely resting on the back seat of the getaway car.
While it might sound like a dramatic rescue, it was more the result of a disaster-recovery plan, a newly virtualized network and a quick-acting network operations team.
Seven steps to disaster recovery planning
And not wanting to settle just for an escape, the team got operations back on line in a day at an off-site location where they kept the online university functioning through a nine-day ordeal.
"We joked that it was our network-in-a-box, but that is the power of virtualization and a blade environment," said Deby DeWeese, director of network services for the university.
For Fielding, the network is the university as graduate students and faculty collaborate over the Internet. The school just months prior to the fire had completed consolidating 30 servers running its Windows network onto a virtualized network based on Microsoft's Hyper-V technology and four HP ProLiant BL460c blades that included an HP MSA 1500cs SAN. In all, the network and a data protection system housed 2.4 terabytes of data.
The network-in-a-box version was born May 5, the day the Jesusita Fire broke out.
DeWeese's boss – Dan Sewell, associate provost for research and chief learning officer – returned to the data center that night when the flames came closer to the city. Over the phone from her rural Santa Barbara County home, DeWeese walked Sewell through pulling the blades and unhooking the school's servers and disk array that runs its Microsoft Data Protection Manager (DPM) system.
The next day they returned, plugged everything back in and started to work.
But the fire continued to rage and by that afternoon Fielding's operations center was a block from the fire's evacuation zone.
"We could see flames from our office windows and ash was falling from the sky," DeWeese said. "We couldn't keep doing network-in-a-box,
we needed to get something up and running somewhere else."
DeWeese and Sewell again did the network-in-a-box drill, while HR made sure staff was safe.
And what happened next was a combination of some nimble negotiating and finding friends in the right places that resulted in the university returning online from a new location in about 25 hours.
DeWeese managed the communications from her home, where the only Internet access is via satellite and where she still had reliable phone lines. She even helped displaced colleagues find hotel rooms around Santa Barbara. Sewell kept communication open with school leadership for decision making.
Ironically, the day the school got back online was the same day it had a scheduled data dump that would have backed up all the university's data to a site in Las Vegas run by provider SBWH, which has roots in Santa Barbara.
That was now on hold, but DeWeese called SBWH, the school's disaster-recovery vendor, and they offered engineer Paul Fisher to help clear rack space the company rents 10 miles from Santa Barbara with Tw Telecom, which just happens to be the provider of Internet lines for the university and of infrastructure for emergency services such as fire fighting.
Partner Content
www.bmc.com
Gartner 2009 Magic Quadrant for Job Scheduling
Gartner has positioned BMC CONTROL-M in the Leaders Quadrant of their "2009 Magic Quadrant for Job Scheduling." The report assesses the ability to execute and completeness of vision of key vendors in the marketplace. Read a full copy today, courtesy of BMC Software.
Download whitepaper
Dell's SMART Approach to Workload Automation
Read a compelling case study by EMA, Inc. to learn how Dell uses BMC CONTROL-M to cut cost and increase productivity with workload automation.
Download whitepaper
Workload Automation Cost Savings 2 Minute Video
A major computer manufacturer uses BMC CONTROL-M and just four people to schedule and run over 85,000 jobs every month. By switching to BMC CONTROL-M, they more than quadrupled the workload without adding a single staff member. See how in this 2-minute video overview.
Go to video
Comments (5)
Virtualization/back-up and recovery as in unplug and run?By Smithwill on June 17, 2009, 7:07 pmOK. This story had a happy ending. However, if the point of the story is to highlight the value of virtualization, I'd also like to introduce another question: What...
Reply | Read entire comment
better solution?By Anon on June 17, 2009, 7:57 pm Why not utilize the rented rack space to mirror traffic from the school to secondary devices? BOOM the college is on fire and with a few DNS changes the mirror...
Reply | Read entire comment
25 hours is something to be proud of? I can fail over my entireBy Anonymous on June 18, 2009, 6:44 pm25 hours is something to be proud of? I can fail over my entire operations to a DR site in about 25 seconds and fail it back 25 seconds later. And a lot of it...
Reply | Read entire comment
remember, cut over to back site had not happenedBy Anon on June 22, 2009, 6:47 pmthe school was days away from having all its data backed up at a new off-site location. If the fire had been a week later, the failover could have been 25 seconds....
Reply | Read entire comment
sheer lack of planningBy Anonymous on July 19, 2009, 11:10 amThis article boggles me. Seems to me that this should be a cautionary tale, not chest thumping, fist buping and high fives! This statement absolutely boggles my...
Reply | Read entire comment
View all comments