- “And nobody expressed any concern that this might not be right?“
- “Bill, you are building a system, not a collection of parts.“
Steve Vinoski talking about
To achieve reliability, you have to accept the fact that failure will occur, Once you accept that, then other things fall into place: you need to be able to restart things quickly, and to do that, processes need to be cheap. If something fails, you don’t want it taking everything else with it, so you need to at least minimize, if not eliminate, sharing, which leads you to message passing. You also need monitoring capabilities that can detect failed processes and restart them
Amen to that!