Once again, Netflix shows how to avoid a cloud meltdown

October 30, 2012 Off By David
Object Storage

Grazed from GigaOM.  Author: Barb Darrow.

Streaming media powerhouse Netflix says its experience with Amazon Web Services outages led to best practices and technology that can insulate Netflix — and potentially other companies — from the impact of weather-related and other events.

As data centers struggle to fend off or repair the effects of superstorm Sandy, Netflix says lessons it learned from past Amazon Web Services outages helped it dodge a bullet last week when Amazon’s US East data center complex went down again. Other companies that have been impacted by cloud outages might be able to apply these lessons as well…

Netflix first noted issues with AWS US East at 8:30 a.m. EDT last Monday, but the problem showed up as a network issue, not a problem with Amazon’s Elastic Block Store (EBS) service which caused initial confusion, according to the blog entitled “A Post-mortem of October 22, 2012 AWS degradation.” According to Netflix Reliability Architect Jeremy Edberg and Director of Cloud Solutions Ariel Tseitlin:

“When we were able to narrow down the network issue to a single zone, Amazon was also able to confirm that the degradation was limited to a single Availability Zone. Once we learned the impact was isolated to one AZ, we began evacuating the affected zone.”…

Read more from the source @ http://gigaom.com/cloud/once-again-netflix-shows-how-to-avoid-a-cloud-meltdown/