Category Archives: Distributed Computing

Distributed Computing

“And nobody expressed any concern that this might not be right?“
“Bill, you are building a system, not a collection of parts.“

Reliability

May 2, 2008Distributed Computing, Notes to self, Stuff just breaksewan

Steve Vinoski talking about reliability (in particular my favourite: Erlang)

To achieve reliability, you have to accept the fact that failure will occur, Once you accept that, then other things fall into place: you need to be able to restart things quickly, and to do that, processes need to be cheap. If something fails, you don’t want it taking everything else with it, so you need to at least minimize, if not eliminate, sharing, which leads you to message passing. You also need monitoring capabilities that can detect failed processes and restart them

Amen to that!

23 January 2008

January 23, 2008Distributed Computing, Notes to selfewan

Towards Robust Distributed Systems – Eric Brewer’s slidedeck that kicked off the whole CAP Theorem debate at PODC in 2000.

18 Jan 2008

January 18, 2008Distributed Computing, Notes to selfewan

London Erlang Meeting – “How do you test large systems written in Erlang?” on Monday (I presume) Jan 21.
Map Reduce a major step back. Michael Stonebraker and David J. DeWitt let forth on why they dislike Google’s MapReduce infrastructure and how it is poor relation to standard relational database technology.
Relational Database Experts Jump The MapReduce Shark – a reply to Stonebraker and DeWitt that I pretty broadly agree with.
MapReducing 20 petabytes per day – Greg Linden commenting on the recent paper “MapReduce: Simplified Data Processing on Large Clusters” by Jeff Dean and Sanjay Ghemawat.

Eventual Consistency

December 20, 2007Distributed Computing, Notes to selfewan

Werner has a new piece on eventual consistency. Go read!!!!

Amazon DB and Erlang

December 15, 2007Distributed Computing, General, Notes to selfewan

The world and it’s dog is jumping on the Amazon SimpleDB announcement. Interesting to note that apparently (big caveat there!) it is built on Erlang.

I am keen to see how the DB heads respond to this. I also want to see how how long it takes for people to get their heads around the eventual consistency model it employs. The use of an eventual consistency model is often a critical factor in being able to scale any ultra large system but the concept scares many people to death (and thats only the people who realise what it means!)

July 23 2007

July 23, 2007Distributed Computing, Notes to selfewan

Interesting job spec from Vogels…
My favourite line: “You have at least once tried to understand Paxos by reading the original paper.“. It all sounds like ancient greek to me!

Thankgod for chaos!

May 25, 2007Biological Computing, Distributed Computingewan

It’s always nice to get an Ebay perspective on things – especially when you totally agree with what is being said. For a long time I have felt that if we really want to embrace the power of large scale distributed systems we have to accept that we just do not know what is going on – life is non-determinstic, messy, chaotic and generally random. Most tech orientated people are used to thinking in a single linear fashion. First the processor does this, then it does that. Nice – but not the way the world works. Things actually happen at the same time. Just not neccasarily on your processor or in your address space. For many people dealing with that is a problem and something that should be somehow hidden away or, at the very least, forced into a 2PC harness. That might work in a small system but not when you reach internet scale systems. Forget all about a linear world and move to thinking about multiple agents doing multiple things at the same time.

This is a thought process that is going to increasingly dominant the way we build systems.

Collaborative Filtering – Google style

May 25, 2007Distributed Computing, Generalewan

I have always had a soft spot for collaborative filtering so it is interesting to read Google’s take on things.

Via Greg

The river is flowing….

January 3, 2007apache river, Distributed Computing, Jiniewan

The Jini project was accepted into the Apache Incubator and is now available as a Podling called River.

Great stuff!

Ewan Silver

Ewan Silver Blog