Google, Microsoft Weather Cloud Computing Outages

September 10, 2011 Off By David
Grazed from eWeek.  Author: Clint Boulton.

Google and Microsoft both watched their cloud computing systems choke this past week, with Google Docs going dark for an hour and Microsoft Hotmail, Office 365 and SkyDrive knocked offline for three hours.

It was not a banner week for cloud computing. Cloud computing rivals Google (NASDAQ:GOOG) and Microsoft (NASDAQ:MSFT) suffered brief outages that knocked off some of their key messaging and collaboration products…

Google Sept. 7 saw its Google Docs word collaboration application cramp up for one hour, shutting out millions of users from their document lists, documents, drawings and Apps Scripts. Microsoft, meanwhile, watched its online services, including Hotmail, SkyDrive and Office 365 software, go kaput for three hours Sept. 8.

Google’s outage was caused by a memory management bug software engineers triggered in a change designed to "improve real time collaboration within the document list," the company explained in a corporate blog post.

That’s the simple explanation. Google Engineering Director Alan Warren provided more technical detail for the outage in the blog post, noting:

    "Every time a Google Doc is modified, a machine looks up the servers that need to be updated. Due to the memory management bug, the lookup machines didn’t recycle their memory properly after each lookup, causing them to eventually run out of memory and restart. While they restarted, their load was picked up by the remaining lookup machines—making them run out of memory even faster."

Pounded by such heavy usage, the servers couldn’t effectively process most of the requests to access document lists, documents, drawings and scripts. Thus, the outage occurred. Warren said his team has taken steps to eliminate the chance of a similar event in the future.

Microsoft’s outage was more serious. Beginning around 9:30 PDT Sept. 8, the company’s Hotmail, SkyDrive and Office 365 services went down, owing to a Domain Name System (DNS) issue. The DNS converts domain names into numerical IP addresses to route Internet traffic worldwide.

"We believe service restored for all Office365," Microsoft tweeted on its Office 365 Twitter account. "If customers are still having issues, please let us know. Thanks for your patience."

No outages are good, but Microsoft’s was clearly worse and must seem like insult to injury after Office 365 (along with Microsoft CRM) went dark in August attributed to a networking issue.

Office 365—the Web-based version of the on-premises Office suite, only launched in late June, so its user base is limited to early adopters. However, hundreds of millions of users access Hotmail, the No. 2 Webmail app behind Yahoo.

Google, which has 40 million users for its Google Apps suite, of which Docs is an integral component, sees it programs go down a few times a year. However, most of this outages affect its Gmail Webmail service. Gmail last went down in late February for around 30,000 users.