Friday, June 29, 2012

Chaos Monkey

This is old, but I haven't heard of the technique before - shutting down parts of the service randomly to ensure it can actually recover from failures.