Friday, March 20, 2009

Patching the cloud - Azure failure

Hoff posted some nice comments on the Azure's failure regarding patching the infrastructure used by cloud services. An interesting conclusion about it is that future patching mechanisms will have to be integrated to VMotion-like features, in a way that when you apply an OS patch to the infrastructure it can dynamically deal with that without disrupting the service. It would be something like this:

  1. Move the virtualized hosts from one server to the others

  2. Patch it the "idle" server

  3. Check if it comes back properly

  4. Gradually puts back the load on that server and checks if there is any impact from the patch

  5. If everything is ok, go back to step #1 for the next server - repeat until all servers are patched
I wonder if the guys from Microsoft Update are talking with the Azure team - big challenge for team integration ahead, and business opportunity for patch management companies.