Nailing down bugs in distributed systems

Speaker: Kamil Szymański

Finding bugs in distributed systems is challenging.

Finding bugs on production in distributed systems is even harder due to time pressure, especially if the bug is on critical path.

At times like that you can't afford guessing what's wrong, you have to take decisions based on real data.

In this slides-free session we will see how can we use live data for making bugs discovery quicker and finding root causes of such bugs easier.

Moreover we will discuss how to deal with production outages and quickly recover from them.