Amazon Web Services has another outage
By Nancy Gohring
,
IDG News Service
, 04/07/2008
- Share/Email
- Tweet This
- Print
Amazon's cloud computing service was down on Monday morning for more than an hour, following an outage on its hosted storage
service two months ago.
While Amazon appears to have learned some lessons since the previous outage, the incidents underscore the immaturity of the
services, an analyst said.
"In terms of Amazon, what you need to know is that this is very new," said Phil Shih, an analyst with Tier 1 Research, a division
of The 451 Group. "It's not something they've perfected. Because of this, we don't advise anybody to use this for anything
mission-critical."
Amazon's Elastic Compute Cloud is a Web service that offers hosted computing. Users can quickly scale up or down the amount
of processing power that they need, based on their changing requirements.
On Monday at around 2 a.m. Pacific Time, the first EC2 customer reported problems accessing the service on Amazon's Web services forum. Others quickly chimed in.
Within 15 minutes, an Amazon employee acknowledged reading about the problems and said the company was investigating them.
That note, and subsequent messages at regular intervals, seemed to placate some customers. "Not all doom and gloom," one person
wrote on the forum. "It should be noted that [Amazon Web Services] are keeping us up to date... 10 out of 10 for communication.
Bravo!"
That's a very different type of response than customers had after the S3 outage in mid-February, when some users were quite
angry at a lack of acknowledgement and information from Amazon about the outage, which lasted for as long as three hours.
At 3:21 a.m. Pacific Time on Monday, the first customer posted a note saying that the EC2 service was back up. Others followed.
On the forum, Amazon said it would post more details about what caused the problem, but hadn't by Monday afternoon. An Amazon
spokesman said he was working to get answers to questions about the outage.
Still, improvements in communication don't change the reliability of the services. Shih recommends that companies only consider
using Amazon's Web services for small internal development products, where a company can absorb the risks and potential downtime.
But that recommendation could change in the future. "Do I expect them to raise their game and get better over time? Absolutely,"
Shih said. "They're pouring resources into this, and they're serious about it."
While these types of outages are a black eye for Amazon, they likely don't cost the company in terms of service level agreement
payouts, Shih said. Late last year, Amazon created an SLA that lets companies apply for credits in the event of an outage.
"Most people won't bother to get their money back," Shih said. "It's such a small amount, and it requires more paperwork to
get the credit." But an SLA is something Amazon has to offer in order for companies to consider it a true enterprise-class
service, he said.
The IDG News Service is a Network World affiliate.
Partner Content
www.bmc.com
Gartner 2009 Magic Quadrant for Job Scheduling
Gartner has positioned BMC CONTROL-M in the Leaders Quadrant of their "2009 Magic Quadrant for Job Scheduling." The report assesses the ability to execute and completeness of vision of key vendors in the marketplace. Read a full copy today, courtesy of BMC Software.
Download whitepaper
Dell's SMART Approach to Workload Automation
Read a compelling case study by EMA, Inc. to learn how Dell uses BMC CONTROL-M to cut cost and increase productivity with workload automation.
Download whitepaper
Workload Automation Cost Savings 2 Minute Video
A major computer manufacturer uses BMC CONTROL-M and just four people to schedule and run over 85,000 jobs every month. By switching to BMC CONTROL-M, they more than quadrupled the workload without adding a single staff member. See how in this 2-minute video overview.
Go to video
Comment