Load balancing with random job arrivals

Cluster administration techniques, akin to Google’s Borg, run lots of of 1000’s of jobs throughout tens of 1000’s of machines with the aim of attaining excessive utilization by way of efficient load balancing, environment friendly process placement, and machine sharing. Load balancing is the method of distributing community visitors or computational workloads throughout a number of servers or computing sources, and it is likely one of the most crucial elements of a contemporary cluster administration system. Efficient load balancing is crucial to bettering the efficiency, robustness and scalability of the system.

Within the classical formulation of the net load balancing downside, computational jobs arrive one-by-one and, as quickly as a job arrives, it should be assigned to one in every of a number of machines. Every job might impose totally different processing hundreds on totally different machines, and the load incurred by a machine is dependent upon the roles which can be assigned to it. The aim of a load balancing algorithm is to reduce the utmost load on any machine. On-line algorithms are these designed for conditions the place the enter to the system is revealed to the algorithm piece by piece.

On-line issues are frequent in decision-making eventualities which have uncertainty, together with the ski-rental downside, secretary downside, caching and scheduling issues, and plenty of others. Scheduling and cargo balancing questions are prevalent in useful resource administration for large-scale techniques resulting in analysis into many real-world scheduling issues, together with sustaining constant allocation of purchasers to servers and, extra just lately, platforms for AI workloads. Historically, on-line algorithms for scheduling and cargo balancing are studied by means of the lens of aggressive evaluation. The aggressive ratio of an internet algorithm quantifies the worst-case efficiency of the algorithm relative to an optimum offline algorithm that is aware of future jobs, particularly by figuring out the worst-case ratio of the price incurred by the 2 algorithms over all doable sequences of jobs.

In “On-line Load and Graph Balancing for Random Order Inputs”, introduced at SPAA 2024, we research the aggressive ratio of on-line load balancing issues when jobs arrive in uniformly random order (i.e., when every doable permutation of job arrival sequences is equally probably). We present new limitations on how nicely deterministic on-line algorithms can carry out on this setting.