Serverless Computing Optimization Strategies Using ML-Based Auto-Scaling and Event-Stream Intelligence for Low-Latency Enterprise Workloads

Authors

  • Parameswara Reddy Nangi Independent Researcher, USA. Author
  • Chaithanya Kumar Reddy Nala Obannagari Independent Researcher, USA. Author
  • Sailaja Settipi Independent Researcher, USA. Author

DOI:

https://doi.org/10.63282/3050-9246.IJETCSIT-V5I3P113

Keywords:

Serverless Computing, Function-As-A-Service (Faas), Predictive Auto-Scaling, Event-Stream Intelligence, Event-Driven Architectures

Abstract

This paper presents a framework of optimization of serverless computing as a combination of machine learning-driven predictive auto-scaling and event-stream intelligence, to serve low-latency enterprise workloads. Conventional function-as-a-service (FaaS) systems tend to exhibit cold starts, bursty usage, and obscure resource assignment and all these come in the form of tail-latency violations to business-critical workloads like real-time analytics, digital payments, and IoT backends. To overcome these difficulties, the framework uses the past trace of workload patterns and the current telemetry to predict demand and preemptive warm instances and the best configuration parameters including concurrency limits and memory footprints. The architecture incorporates an event-stream intelligence layer, which is constantly scanning message queues, logs and domain events in order to spot anomalies, micro-bursts, and seasonality in traffic. These insights are combined with the predictions of the ML to adjust dynamically the scaling policies of serverless functions and supporting services with the cost and SLO constraints to a closed-loop controller. Synthetic benchmarks and enterprise-inspired case studies show that the experimental evaluation of p95 and p99 latency, a lower cold-start frequency and a better cost performance ratio are much better than reactive and threshold-based scaling baselines. These findings, to practitioners, underscore the effectiveness of utilizing predictive modeling alongside event-based observability to turn serverless platforms into robust substrates to support mission-critical, latency-sensitive applications in any modern enterprise as well as provide them with a blueprint to be systematically reused

Downloads

Download data is not yet available.

References

[1] Li, Z., Guo, L., Cheng, J., Chen, Q., He, B., & Guo, M. (2022). The serverless computing survey: A technical primer for design architecture. ACM Computing Surveys (CSUR), 54(10s), 1-34.

[2] Shafiei, H., Khonsari, A., & Mousavi, P. (2022). Serverless computing: a survey of opportunities, challenges, and applications. ACM Computing Surveys, 54(11s), 1-32.

[3] Anju Bhole, 2023. "Serverless Computing - Benefits And Challenges", ESP International Journal of Advancements in Computational Technology (ESP-IJACT) Volume 1, Issue 3: 149-156.

[4] Wen, J., Chen, Z., Jin, X., & Liu, X. (2023). Rise of the planet of serverless computing: A systematic review. ACM Transactions on Software Engineering and Methodology, 32(5), 1-61.

[5] Thatikonda, V. K. (2023). Serverless computing: advantages, limitations and use cases. European Journal of Theoretical and Applied Sciences, 1(5), 341-347.

[6] Das, S. S. (2023). Enterprise Event Hub: The Rise of Event Stream-Oriented Systems for Real-Time Business Decisions. JOURNAL OF ADVANCE AND FUTURE RESEARCH, 1(10), 1-7.

[7] Ai-Powered Cloud Automation: Enhancing AutoScaling Mechanisms Through Predictive Analytics And Machine Learning, International Journal of Creative Research Thoughts (IJCRT), Volume 10, Issue 6 June 2022. https://ijcrt.org/papers/IJCRT22A6978.pdf

[8] Sudharsanam, S. R., Namperumal, G., & Selvaraj, A. (2022). Integrating AI/ML workloads with serverless cloud computing: Optimizing cost and performance for dynamic, event-driven applications. Journal of Science & Technology, 3(3), 286-325.

[9] Singla, N. (2023). Assessing the Performance and Cost-Efficiency of Serverless Computing for Deploying and Scaling AI and ML Workloads in the Cloud. International Journal of Intelligent Systems and Applications in Engineering, 11, 618-630.

[10] Madupati, B. (2023). Serverless Architectures and Function-As-A-Service (Faas): Scalability, Cost Efficiency, And Security Challenges. Cost Efficiency, And Security Challenges (April 05, 2023).

[11] Bebortta, S., Das, S. K., Kandpal, M., Barik, R. K., & Dubey, H. (2020). Geospatial serverless computing: Architectures, tools and future directions. ISPRS International Journal of Geo-Information, 9(5), 311.

[12] Rajan, R. A. P. (2018, December). Serverless architecture-a revolution in cloud computing. In 2018 Tenth International Conference on Advanced Computing (ICoAC) (pp. 88-93). IEEE.

[13] Lorido-Botran, T., Miguel-Alonso, J., & Lozano, J. A. (2014). A review of auto-scaling techniques for elastic applications in cloud environments. Journal of grid computing, 12(4), 559-592.

[14] Verma, S., & Bala, A. (2021). Auto-scaling techniques for IoT-based cloud applications: a review. Cluster Computing, 24(3), 2425-2459.

[15] Stopford, B. (2018). Designing event-driven systems. O'Reilly Media, Incorporated.

[16] Imdoukh, M., Ahmad, I., & Alfailakawi, M. G. (2020). Machine learning-based auto-scaling for containerized applications. Neural Computing and Applications, 32(13), 9745-9760.

[17] Majid, A. Y., & Marin, E. (2023). A review of deep reinforcement learning in serverless computing: function scheduling and resource auto-scaling. arXiv preprint arXiv:2311.12839.

[18] Jiang, L., Rahman, P., & Nandi, A. (2018, May). Evaluating interactive data systems: Workloads, metrics, and guidelines. In Proceedings of the 2018 International Conference on Management of Data (pp. 1637-1644).

[19] Tirumala, D., Lampe, T., Chen, J. E., Haarnoja, T., Huang, S., Lever, G., ... & Wulfmeier, M. (2023). Replay across experiments: A natural extension of off-policy rl. arXiv preprint arXiv:2311.15951.

[20] Jonas, E., Schleier-Smith, J., Sreekanti, V., Tsai, C. C., Khandelwal, A., Pu, Q., ... & Patterson, D. A. (2019). Cloud programming simplified: A berkeley view on serverless computing. arXiv preprint arXiv:1902.03383.

[21] Adegboyega, A. (2017, May). Time-series models for cloud workload prediction: A comparison. In 2017 IFIP/IEEE Symposium on Integrated Network and Service Management (IM) (pp. 298-307). IEEE.

[22] Bhat, J. (2022). The Role of Intelligent Data Engineering in Enterprise Digital Transformation. International Journal of AI, BigData, Computational and Management Studies, 3(4), 106–114. https://doi.org/10.63282/3050-9416.IJAIBDCMS-V3I4P111

[23] Sundar, D. (2022). Architectural Advancements for AI/ML-Driven TV Audience Analytics and Intelligent Viewership Characterization. International Journal of Artificial Intelligence, Data Science, and Machine Learning, 3(1), 124–132. https://doi.org/10.63282/3050-9262.IJAIDSML-V3I1P113

[24] Jayaram, Y., & Sundar, D. (2022). Enhanced Predictive Decision Models for Academia and Operations through Advanced Analytical Methodologies. International Journal of Artificial Intelligence, Data Science, and Machine Learning, 3(4), 113–122. https://doi.org/10.63282/3050-9262.IJAIDSML-V3I4P113

[25] Bhat, J., & Sundar, D. (2022). Building a Secure API-Driven Enterprise: A Blueprint for Modern Integrations in Higher Education. International Journal of Emerging Research in Engineering and Technology, 3(2), 123–134. https://doi.org/10.63282/3050-922X.IJERET-V3I2P113

[26] Jayaram, Y., & Bhat, J. (2022). Intelligent Forms Automation for Higher Ed: Streamlining Student Onboarding and Administrative Workflows. International Journal of Emerging Trends in Computer Science and Information Technology, 3(4), 100–111. https://doi.org/10.63282/3050-9246.IJETCSIT-V3I4P110

[27] Sundar, D., Jayaram, Y., & Bhat, J. (2022). A Comprehensive Cloud Data Lakehouse Adoption Strategy for Scalable Enterprise Analytics. International Journal of Emerging Research in Engineering and Technology, 3(4), 92–103. https://doi.org/10.63282/3050-922X.IJERET-V3I4P111

[28] Bhat, J., Sundar, D., & Jayaram, Y. (2022). Modernizing Legacy ERP Systems with AI and Machine Learning in the Public Sector. International Journal of Emerging Research in Engineering and Technology, 3(4), 104–114. https://doi.org/10.63282/3050-922X.IJERET-V3I4P112

[29] Sundar, D., & Jayaram, Y. (2022). Composable Digital Experience: Unifying ECM, WCM, and DXP through Headless Architecture. International Journal of Emerging Research in Engineering and Technology, 3(1), 127–135. https://doi.org/10.63282/3050-922X.IJERET-V3I1P113

[30] Jayaram, Y., Sundar, D., & Bhat, J. (2022). AI-Driven Content Intelligence in Higher Education: Transforming Institutional Knowledge Management. International Journal of Artificial Intelligence, Data Science, and Machine Learning, 3(2), 132–142. https://doi.org/10.63282/3050-9262.IJAIDSML-V3I2P115

[31] Bhat, J. (2023). Automating Higher Education Administrative Processes with AI-Powered Workflows. International Journal of Emerging Trends in Computer Science and Information Technology, 4(4), 147–157. https://doi.org/10.63282/3050-9246.IJETCSIT-V4I4P116

[32] Jayaram, Y. (2023). Cloud-First Content Modernization: Migrating Legacy ECM to Secure, Scalable Cloud Platforms. International Journal of Emerging Research in Engineering and Technology, 4(3), 130–139. https://doi.org/10.63282/3050-922X.IJERET-V4I3P114

[33] Sundar, D. (2023). Serverless Cloud Engineering Methodologies for Scalable and Efficient Data Pipeline Architectures. International Journal of Emerging Trends in Computer Science and Information Technology, 4(2), 182–192. https://doi.org/10.63282/3050-9246.IJETCSIT-V4I2P118

[34] Bhat, J., & Jayaram, Y. (2023). Predictive Analytics for Student Retention and Success Using AI/ML. International Journal of Artificial Intelligence, Data Science, and Machine Learning, 4(4), 121–131. https://doi.org/10.63282/3050-9262.IJAIDSML-V4I4P114

[35] Jayaram, Y., & Sundar, D. (2023). AI-Powered Student Success Ecosystems: Integrating ECM, DXP, and Predictive Analytics. International Journal of Artificial Intelligence, Data Science, and Machine Learning, 4(1), 109–119. https://doi.org/10.63282/3050-9262.IJAIDSML-V4I1P113

[36] Bhat, J. (2023). Strengthening ERP Security with AI-Driven Threat Detection and Zero-Trust Principles. International Journal of Emerging Trends in Computer Science and Information Technology, 4(3), 154–163. https://doi.org/10.63282/3050-9246.IJETCSIT-V4I3P116

[37] Sundar, D., & Bhat, J. (2023). AI-Based Fraud Detection Employing Graph Structures and Advanced Anomaly Modeling Techniques. International Journal of Artificial Intelligence, Data Science, and Machine Learning, 4(3), 103–111. https://doi.org/10.63282/3050-9262.IJAIDSML-V4I3P112

[38] Jayaram, Y. (2023). Data Governance and Content Lifecycle Automation in the Cloud for Secure, Compliance-Oriented Data Operations. International Journal of AI, BigData, Computational and Management Studies, 4(3), 124–133. https://doi.org/10.63282/3050-9416.IJAIBDCMS-V4I3P113

[39] Sundar, D. (2023). Machine Learning Frameworks for Media Consumption Intelligence across OTT and Television Ecosystems. International Journal of Artificial Intelligence, Data Science, and Machine Learning, 4(2), 124–134. https://doi.org/10.63282/3050-9262.IJAIDSML-V4I2P114

Published

2024-10-30

Issue

Section

Articles

How to Cite

1.
Reddy Nangi P, Reddy Nala Obannagari CK, Settipi S. Serverless Computing Optimization Strategies Using ML-Based Auto-Scaling and Event-Stream Intelligence for Low-Latency Enterprise Workloads. IJETCSIT [Internet]. 2024 Oct. 30 [cited 2025 Dec. 24];5(3):131-42. Available from: https://www.ijetcsit.org/index.php/ijetcsit/article/view/510

Similar Articles

51-60 of 344

You may also start an advanced similarity search for this article.