This document presents a study on load balancing algorithms in cloud computing aimed at improving response times at both physical and virtual server levels. The proposed virtual machine-level load balancing algorithm is designed to optimize average response and processing times, and is compared with existing algorithms like maxmin and throttled, demonstrating enhanced performance. The paper details the theoretical foundation, algorithm design, and simulation results to validate the proposed method's effectiveness.