This comprehensive treatment of fluid diffusion and manyserver scaling applies queueing networks and largescale asymptotics to model and solve core problems such as scheduling in semiconductor wafer fabs matching Uber drivers to passengers routing patient flow in emergency rooms and load balancing in cloud computing. Includes 330 exercises.