Tag: fault tolerance

Toward a Fault-Tolerant Cloud

Jun 23, 2011 |

With the proliferation of public cloud infrastructures, our dependability on them has increased. Many of our vital services pertaining to the research, industry or even lifestyle domain have been massively moved onto the cloud. Then, what happens when the cloud services we are depending on go down? Dr. Jose Luis Vazquez-Poletti shares some key aspects on how the scientific community can provide answers to this problem.

Read more…

Looking to Fault-Tolerant Software

Nov 9, 2010 |

Achieving workable software-based fault tolerance will require a fresh approach for developers.

Read more…

The Other Exascale Challenge

Jun 10, 2010 |

Supercomputing apps may have to ditch the checkpoint-restart model.

Read more…

Embrace Failure!

Apr 22, 2009 |

Can smart checkpoints and fault-resilient applications avert a Malthusian Catastrophe?

Read more…