Adaptive and Learning-Driven Resource Optimization Framework for Scalable Cloud Computing Systems
Main Article Content
Abstract
With the rapid development of cloud computing technologies in 2024, large-scale distributed systems face increasing challenges in resource allocation, latency control, and energy efficiency. This paper proposes a scalable and adaptive resource optimization framework that integrates deep learning-based prediction models with dynamic scheduling strategies. By leveraging workload forecasting and real-time feedback mechanisms, the proposed framework improves resource utilization and system stability. Experimental results demonstrate that the framework achieves significant improvements in latency reduction and energy efficiency compared with conventional approaches. The proposed method provides a robust solution for modern cloud environments characterized by high variability and heterogeneous workloads.
Article Details
Issue
Section
References
[1]Z. Zhu, Y. Yan, R. Xu, Y. Zi, and J. Wang, "Attention-Unet: A deep learning approach for fast and accurate segmentation in medical imaging," Journal of Computer Science and Software Applications, vol. 2, no. 4, pp. 24-31, 2022.
[2]Y. Deng, "Transfer methods for large language models in low-resource text generation tasks," 2024.
[3]R. Buyya, C. S. Yeo, S. Venugopal, J. Broberg, and I. Brandic, "Cloud computing and emerging IT platforms: Vision, hype, and reality for delivering computing as the 5th utility," Future Generation Computer Systems, vol. 25, no. 6, pp. 599-616, 2009.
[4]X. Sun, Y. Yao, X. Wang, P. Li, and X. Li, "AI-driven health monitoring of distributed computing architecture: Insights from XGBoost and SHAP," in Proc. 4th Int. Conf. Communication Technology and Information Technology (ICCTIT), 2024, pp. 480-484.
[5]Z. Qiu, "A multi-scale deep learning and uncertainty estimation framework for comprehensive anomaly detection in cloud environments," 2023.
[6]B. Barlocker and X. Yan, "Contrastive representation learning for anomaly detection in cloud-based backend services," Artificial Intelligence and Computing Innovations, vol. 1, no. 2, 2021.
[7]M. Armbrust et al., "A view of cloud computing," Communications of the ACM, vol. 53, no. 4, pp. 50-58, 2010.
[8]L. A. Barroso and U. Hölzle, "The case for energy-proportional computing," Computer, vol. 40, no. 12, pp. 33-37, 2007.
[9]Y. Ma, "Anomaly detection in microservice environments via conditional multiscale GANs and adaptive temporal autoencoders," 2024.
[10]Y. Xing, "Enhancing advertising recommendation performance via integrated causal inference and exposure bias correction," 2023.
[11]T. Wood, P. J. Shenoy, A. Venkataramani, and M. S. Yousif, "Black-box and gray-box strategies for virtual machine migration," in Proc. USENIX NSDI, 2007.
[12]S. Pan, T. Hu, S. Sun, J. Yuan, and J. Luo, "Help oneself in helping the others: The ecology of online support groups," in Proc. IEEE Big Data, 2019, pp. 2418-2427.
[13]A. M. Jones et al., "USC-DCT: A collection of diverse classification tasks," Data, vol. 8, no. 10, p. 153, 2023.
[14]J. Li, "LocateNet: Large multimodal models for text-guided object localization," 2024.
[15]A. Verma, P. Ahuja, and A. Neogi, "pMapper: Power and migration cost aware application placement in virtualized systems," in Proc. Middleware, 2008, pp. 243-264.
[16]R. N. Calheiros, R. Ranjan, A. Beloglazov, C. A. De Rose, and R. Buyya, "CloudSim: A toolkit for modeling and simulation of cloud computing environments," Software: Practice and Experience, vol. 41, no. 1, pp. 23-50, 2011.
[17]Y. Hu, "Autonomous agent architecture for complex tasks via hierarchical planning and language model reasoning," 2024.
[18]F. Chen, "AI-augmented anomaly detection via generative distribution modeling and uncertainty quantification in cloud systems," 2024.
[19]A. Beloglazov and R. Buyya, "Energy efficient resource management in virtualized cloud data centers," in Proc. IEEE/ACM CCGrid, 2010, pp. 826-831.
[20]F. Chen, "AI-augmented anomaly detection via generative distribution modeling and uncertainty quantification in cloud systems," Transactions on Computational and Scientific Methods, vol. 4, no. 11, 2024.
[21]A. Beloglazov, J. Abawajy, and R. Buyya, "Energy-aware resource allocation heuristics for efficient management of data centers," Future Generation Computer Systems, vol. 28, no. 5, pp. 755-768, 2012.
[22]B. Hindman et al., "Mesos: A platform for fine-grained resource sharing in the data center," in Proc. USENIX NSDI, 2011.
[23]C. Nie, "Representation learning with multi-task self-supervision for spatiotemporal time series forecasting," 2024.
[24]J. Lai, "Attention alignment under logical constraints for reliable financial statement reasoning," 2024.
[25]M. Schwarzkopf et al., "Omega: Flexible, scalable schedulers for large compute clusters," in Proc. EuroSys, 2013, pp. 351-364.
[26]C. Delimitrou and C. Kozyrakis, "Paragon: QoS-aware scheduling for heterogeneous datacenters," ACM SIGPLAN Notices, vol. 48, no. 4, pp. 77-88, 2013.
[27]C. Delimitrou and C. Kozyrakis, "Quasar: Resource-efficient and QoS-aware cluster management," ACM SIGPLAN Notices, vol. 49, no. 4, pp. 127-144, 2014.
[28]Y. Luan, "Long text classification with large language models via dynamic memory," 2024.
[29]J. Guo, "Balancing performance and efficiency in large language model fine-tuning," 2024.
[30]A. Verma et al., "Large-scale cluster management at Google with Borg," in Proc. EuroSys, 2015.
[31]B. Burns, B. Grant, D. Oppenheimer, E. Brewer, and J. Wilkes, "Borg, Omega, and Kubernetes," Communications of the ACM, vol. 59, no. 5, pp. 50-57, 2016.
[32]Z. Qiu, "Time series and graph structure fusion for anomaly detection in microservices," 2024.
[33]D. Wu, "Federated deep learning with contrastive representation for node state identification," 2024.
[34]D. Lo et al., "Heracles: Improving resource efficiency at scale," in Proc. ISCA, 2015, pp. 450-462.
[35]I. Gog et al., "Firmament: Fast, centralized cluster scheduling at scale," in Proc. USENIX OSDI, 2016, pp. 99-115.
[36]Z. Gong, X. Gu, and J. Wilkes, "PRESS: Predictive elastic resource scaling for cloud systems," in Proc. ICNSM, 2010, pp. 9-16.
[37]N. Roy, A. Dubey, and A. Gokhale, "Efficient autoscaling using predictive models," in Proc. IEEE Cloud, 2011, pp. 500-507.
[38]A. Gandhi, P. Dube, A. Karve, A. Kochut, and L. Zhang, "Adaptive, model-driven autoscaling for cloud applications," in Proc. ICAC, 2014, pp. 57-64.
[39]A. V. Papadopoulos et al., "PEAS: A performance evaluation framework for autoscaling strategies," ACM TOMPECS, vol. 1, no. 4, pp. 1-31, 2016.