Load balancing can be defined as disseminating workloads among multiple computer systems so that more work gets done in the given amount of time.Such computer resources can be a computer cluster, network links, central processing units or disk drives.With application load balancing across multiple servers, it prevents any application server from becoming a single point of failure, thereby augmenting overall availability and responsiveness.For instance, when one application server becomes unavailable due to any problems, a load balancer simply puts all the application requests to another server available in the pool.
Load balancing technique improves the server utilization and maximizes availability. It is the most straightforward method of distributing the traffic load among multiple computing devices and scaling out an application server infrastructure. As the new application demand increases, new resources can be added to the resource pool and the load balancer will immediately start sending traffic to the newly available server. Load balancing is highly focused on optimizing resource use, minimizing resource-time, maximize throughput, and eliminating the overloading of any single resource. On the internet, companies whose websites are generating a great amount of web traffic, generally, use load balancing to improve availability and maximizing server utilization.