Researchers improve cloud computing reliability

October 18, 2011 Off By David
Object Storage
Grazed from EE Times India.  Author:  Editorial Staff.

A group of Cornell University researchers, with the support of NSF and Microsoft, develops methods to improve the reliability of cloud computing and are testing their work in a piece of real cloud.

Cloud computing is a cheaper alternative for storing and processing quickly vast amounts of data. However, as cloud computing applications become larger, potential errors of the system also grow. A Cornell University research team, composed of Ken Birman, professor of computer science; Hakim Weatherspoon, assistant professor of computer science; and research associate Robbert van Renesse, is working on strengthening cloud reliability in hopes of minimising these errors…

The study is supported by the National Science Foundation (NSF) and Microsoft Corp. NSF is providing the team with a two-year Rs.1.66 crore ($370,000) grant, while Microsoft is giving the researchers access to its Windows Azure, a cloud computing platform that provides on-demand computing and storage through Microsoft data centres. These will enable the team to test their ideas in challenging environments. The group is also working with storage systems maintained by Internet Archive.

Cloud computing refers to farming out data storage and computing onto huge data centres accessed over the Internet. Economies of scale allow cloud providers to offer services at a fraction of what it would cost users to set up and maintain their own server farms. And by locating near power sources like hydroelectric plants, cloud computing often promises to be eco-friendly.

Cloud systems make many copies of data or applications to give multiple users quick results. But replication can introduce errors, and avoiding those errors slows down the system. If a user is changing a bit of data, others have to be locked out of all copies of that data until the change is completed; and then the change has to be propagated to all the duplicates. So rather than making customers wait, many managers simply accept the risks of stale data in return for speed and low cost.

The result, the Cornell researchers say, is a system that is "inconsistent by design." But consistency is vital in managing medical records, the electric power grid or air traffic control.

The new research will be more about engineering than discovery. "It’s not that we don’t have a pretty good idea how to do it," Birman said. But testing a system on a large scale will demonstrate to the industry that such an approach is practical, he said. One way to speed replication is by multi-casting, in which a computer sends a data change to many server addresses at once, rather than reeling off the addresses one after another. Cloud services have avoided multi-casting because it can confuse routers, sending the same message to every computer in the system instead of just those meant to receive it. Birman gets around this by consolidating addresses into groups, somewhat the way an email programme combines several addresses into a single alias. To avoid errors, Birman locks replicas with out-of-date data out of a working group to be restarted and reinitialized later.

Meanwhile, Weatherspoon is working to speed data transmission over the fibre-optic lines that connect servers, and to design better ways to organise stored data. Since using the cloud is partly about saving energy, he hopes to distribute data in ways that make it possible to spin down large groups of disc storage units when they’re not needed, saving both the power to drive the disks and the air conditioning to cool them.

Van Renesse focuses on security. He believes that so-called Byzantine security systems that are absolutely impenetrable can be scaled up to cloud size and still work—perhaps not as fast as insecure systems, but fast enough to be useful.

"We can offer much stronger guarantees with pretty comparable speed," Birman concluded. "Not a radical breakthrough, just good engineering. We’ll be giving out free software."