Amazon gets 'black eye' from cloud outage

Analysts say downtime hurts Amazon, and cloud computing

For a company that's known as the dominant player in the cloud market, Amazon's troubles on Thursday means a black eye for the company and for the cloud in general.

Trouble started early Thursday morning when popular websites like Quora, foursquare and Reddit were left staggering or totally knocked out because of server problems in the Amazon datacenter that handles the company's Web hosting services.

While service was restored by 4 p.m. ET to some sites such as Foursquare, Quora was still disabled and Reddit was still being affected.

"Reddit is in 'emergency read-only mode' right now because Amazon is experiencing a degradation," the company noted on its site. "They are working on it, but we are still waiting for them to get to our volumes. You won't be able to log in. We're sorry and will fix the site as soon as we can."

According to AlertSite, a Web performance management company, between 6 a.m. and 1 p.m. ET Thursday, one portion of the Reddit site took more than 60 seconds to load, only to return an error message. Foursquare's homepage had 84.44% availability between 8:15 a.m. and noon, also returning error messages explaining the downtime and slowness.

At 5:16 a.m. ET Thursday, site administrators reported that they were dealing with connectivity issues impacting Amazon's Relational Database Service, which is used to manage a cloud database, across multiple zones in the Eastern United States.

That means some Web sites were down or partially disabled for at least 11 hours.

While that's a problem for the downed sites, it's probably going to be tougher on Amazon itself, according to Robert Mahowald, an analyst with IDC. "Amazon is held as a paradigm of operational uptime," he said. "When this kind of thing happens, it definitely sends a chill through the whole cloud and hosted services industry.... It's absolutely a black eye. There's no doubt about it."

Mahowald was quick to point out that this kind of outage happens. It doesn't point to a specific operational problem at Amazon.

"This shouldn't give Amazon a bad reputation, but this is a very, very visible problem," he said. "I don't think it will turn people's heads away from using Amazon, but it will give companies that have been on the fence a lot of cause for pause. This will live on and on on the Web."

The biggest impact from the outage may be to the cloud itself, said Rob Enderle, an analyst with the Enderle Group.

"What will take a hit is the image of this technology as being one you can depend on, and that image was critically damaged today," he added. "If the outage continues for long, it could set back growth of this service years and permanently kill efforts by many to use this service in the future."

Given how high profile the outage has been, it may be hard to dismiss - especially for people who have to decide whether they're moving their enterprise to the cloud.

"This provides a massive showcase of the risk associated with these kinds of services, which are sold like utilities but don't yet have the reliability we expect of most utilities," said Enderle. "The impression being set today, if the outage continues, may take five to 10 years to fully recover from."

Keith Shaw chats with Network World's Jon Brodkin about the Amazon EC2 cloud service outage that brought down some high-profile Web sites on Thursday.

Sharon Gaudin covers the Internet and Web 2.0, emerging technologies, and desktop and laptop chips for Computerworld. Follow Sharon on Twitter at @sgaudin or subscribe to Sharon's RSS feed. Her e-mail address is

Join the CSO newsletter!

Error: Please check your email address.

Tags amazon.comcloud computinginternet

More about Amazon Web ServicesC2IDC AustraliaTopic

Show Comments

Featured Whitepapers

Editor's Recommendations

Solution Centres

Stories by Sharon Gaudin

Latest Videos

  • 150x50

    CSO Webinar: The Human Factor - Your people are your biggest security weakness

    ​Speakers: David Lacey, Researcher and former CISO Royal Mail David Turner - Global Risk Management Expert Mark Guntrip - Group Manager, Email Protection, Proofpoint

    Play Video

  • 150x50

    CSO Webinar: Current ransomware defences are failing – but machine learning can drive a more proactive solution

    Speakers • Ty Miller, Director, Threat Intelligence • Mark Gregory, Leader, Network Engineering Research Group, RMIT • Jeff Lanza, Retired FBI Agent (USA) • Andy Solterbeck, VP Asia Pacific, Cylance • David Braue, CSO MC/Moderator What to expect: ​Hear from industry experts on the local and global ransomware threat landscape. Explore a new approach to dealing with ransomware using machine-learning techniques and by thinking about the problem in a fundamentally different way. Apply techniques for gathering insight into ransomware behaviour and find out what elements must go into a truly effective ransomware defence. Get a first-hand look at how ransomware actually works in practice, and how machine-learning techniques can pick up on its activities long before your employees do.

    Play Video

  • 150x50

    CSO Webinar: Get real about metadata to avoid a false sense of security

    Speakers: • Anthony Caruana – CSO MC and moderator • Ian Farquhar, Worldwide Virtual Security Team Lead, Gigamon • John Lindsay, Former CTO, iiNet • Skeeve Stevens, Futurist, Future Sumo • David Vaile - Vice chair of APF, Co-Convenor of the Cyberspace Law And Policy Community, UNSW Law Faculty This webinar covers: - A 101 on metadata - what it is and how to use it - Insight into a typical attack, what happens and what we would find when looking into the metadata - How to collect metadata, use this to detect attacks and get greater insight into how you can use this to protect your organisation - Learn how much raw data and metadata to retain and how long for - Get a reality check on how you're using your metadata and if this is enough to secure your organisation

    Play Video

  • 150x50

    CSO Webinar: How banking trojans work and how you can stop them

    CSO Webinar: How banking trojans work and how you can stop them Featuring: • John Baird, Director of Global Technology Production, Deutsche Bank • Samantha Macleod, GM Cyber Security, ME Bank • Sherrod DeGrippo, Director of Emerging Threats, Proofpoint (USA)

    Play Video

  • 150x50

    IDG Live Webinar:The right collaboration strategy will help your business take flight

    Speakers - Mike Harris, Engineering Services Manager, Jetstar - Christopher Johnson, IT Director APAC, 20th Century Fox - Brent Maxwell, Director of Information Systems, THE ICONIC - IDG MC/Moderator Anthony Caruana

    Play Video

More videos

Blog Posts