recentpopularlog in

jabley : queue   39

How to trade off server utilization and tail latency
When running large scale systems, we strive to deliver both low tail latency and high
utilization of servers. However, these two dimenions are at odds: increasing the
average utilization of a system will have a detrimental impact on the tail latency.
This talk provides a light-weight walkthrough of the important basics of queueing
theory (avoiding unnecessary formalism), illustrates graphically several typical
outcomes of this analysis, and closes with a few basic rules on how to think about
utilization and tail latency.
filetype:pdf  slides  srecon  queue  model  code  server  scalability 
3 days ago by jabley

Copy this bookmark:





to read