Category Archives: Stuff just breaks

‘Cos the world is a nasty place


Steve Vinoski talking about reliability (in particular my favourite: Erlang)

To achieve reliability, you have to accept the fact that failure will occur, Once you accept that, then other things fall into place: you need to be able to restart things quickly, and to do that, processes need to be cheap. If something fails, you don’t want it taking everything else with it, so you need to at least minimize, if not eliminate, sharing, which leads you to message passing. You also need monitoring capabilities that can detect failed processes and restart them

Amen to that!