Skip to main content

Reinforcement Learning for Dynamic Cloud Resource BI Optimization

Abstract

Dynamic cloud environments: These environments have a varying volume of data, query workload, number of users and/or business continuity objectives (SLOs) for Business Intelligence (BI) workloads. Due to these uncertainties, it will be impossible to balance the three important factors of performance, energy efficiency and cost with fixed rule and threshold based autoscaling approaches. In this research article we want to propose a framework for dynamically optimizing cloud resource BI based on Reinforcement Learning (RL) model who will continually interact with the cloud environment and learn a set of optimal policies for cloud resource allocation. This framework consists of five layers such as BI workload monitoring, state space construction, RL decision engine, resource orchestration and evaluation of performance feedback. Every system metric (such as CPU usage, memory usage, query response time, queue length, storage I/O, workload priority and cost constraints) is important. Those can be the scaling of virtual machines, moving containers around and changing memory sizes, prioritizing BI queries, moving workloads from node to node, etc., and are decided by the RL agent. The rewards can be specified as a multi-objective function, which include reduction of response time, reduction of cloud cost, resource utilization gain and SLA and energy consumption satisfaction. The proposed model is adaptive – it adapts itself based on the trends of the historical data or real-time data of BI workload. Algorithms like Q-learning, Deep Q-Networks and Proximal Policy Optimization could be potentially applied for an intelligent decision making in cloud based BI platforms, the study mentions. This framework can enhance the analytical responsiveness, minimise resources wastage and enable cost-aware business decision systems. It also helps businesses in planning for capacity with a pro-active assessment and hand over their real-time dashboards and predictive analytics services. Lastly, reinforcement learning can be leveraged for scaling and adapting for optimal BI workloads in a complex, dynamic and data rich cloud environment.

References

[1] P. Mishra, M. Putrevu, G. R. Ganger, and A. Slominski, “Optimizing Cloud Workloads: Autoscaling with Reinforcement Learning,” IBM Research, 2024. [Online]. Available: https://research.ibm.com/publications/optimizing-cloud-workloads-autoscaling-with-reinforcement-learning
[2] Amazon Web Services, “PERF02-BP05: Scale your compute resources dynamically,” AWS Well-Architected Framework. [Online]. Available: https://docs.aws.amazon.com/wellarchitected/latest/performance-efficiency-pillar/perf_compute_hardware_scale_compute_resources_dynamically.html
[3] T. Cui, R. Yang, C. Fang, and S. Yu, “Deep reinforcement learning-based resource allocation for content distribution in IoT-edge-cloud computing environments,” Symmetry, vol. 15, no. 1, 217, 2023.
[4] K. Saidi and D. Bardou, “Task scheduling and VM placement to resource allocation in cloud computing: challenges and opportunities,” Cluster Computing, 2023.
[5] Y. Xu and A. H. Mohammed, “An energy-aware resource management method in cloud-based internet of things using a multi-objective algorithm and crowding distance,” Telecommunication Systems, vol. 34, no. 1, e4673, 2023.
[6] M. I. Khaleel, “Hybrid cloud-fog computing workflow application placement: joint consideration of reliability and time credibility,” Multimedia Tools and Applications, 2023.
[7] K. K. Gola et al., “Multi-objective hybrid capuchin search with genetic algorithm based hierarchical resource allocation scheme with clustering model in cloud computing environment,” Concurrency and Computation: Practice and Experience, vol. 35, no. 7, e7606, 2023.
[8] A. Amini Motlagh, A. Movaghar, and A. M. Rahmani, “A new reliability-based task scheduling algorithm in cloud computing,” International Journal of Communication Systems, vol. 35, no. 3, e5022, 2022.
[9] D. Parfenov, I. Bolodurina, L. Kuznetsova, L. Zabrodina, and N. Yanishevskaya, “Application of bioinspired methods for solving the problem of resource allocation in cloud platforms,” in 2021 International Conference on Information Technology and Nanotechnology (ITNT), pp. 1–7, IEEE, 2021.
[10] P. Gupta et al., “Hybrid Whale optimization algorithm for resource optimization in cloud e-healthcare applications,” Computers, Materials & Continua, vol. 71, no. 3, pp. 5381–5396, 2022.
[11] P. Durgadevi and S. Srinivasan, “Resource allocation in cloud computing using SFLA and cuckoo search hybridization,” International Journal of Parallel Programming, vol. 48, pp. 549–565, 2020.
[12] S. K. Kayalvili and M. Selvam, “Hybrid SFLA-GA algorithm for an optimal resource allocation in cloud,” Cluster Computing, vol. 22, Suppl. 2, pp. 3165–3173, 2019.