ACC

Article http://dx.doi.org/10.26855/acc.2025.10.012

Research on Elastic Service Scaling Methods for Generative AI Applications Under Cloud-native Architectures

TOTAL VIEWS: 516

Hongyan Qu

King Graduate School, Monroe University, New Rochelle, NY 10801, USA.

*Corresponding author: Hongyan Qu

Published: October 28,2025

Abstract

This study focuses on the operational characteristics of generative AI applications under cloud-native architectures, analyzing their computational intensity and fluctuating load patterns, and exploring the adaptability and necessity of elastic service scaling. By reviewing the theoretical foundations, the research further investigates key techniques such as auto-scaling and resource scheduling, constructing an elastic scaling framework suited for cloud-native environments. Combined with load-aware dynamic scaling methods and incorporating containerization, microservices, and service mesh mechanisms, feasible strategic pathways are proposed. Experiments conducted in a representative environment evaluate and compare different scaling schemes, showing that the proposed strategies enhance resource utilization and system stability. The findings indicate that integrating cloud-native architectures with generative AI applications offers significant advantages and practical value in elastic service scaling.

Keywords

Cloud-native architecture; Generative AI; Elastic services; Auto-scaling

References

[1] Kotama DNI, Funabiki N, Panduman FYY, et al. Implementation of Sensor Input Setup Assistance Service Using Generative AI for SEMAR IoT Application Server Platform. Information. 2025;16(2):108.

[2] Panduman FYY, Husna R, Noprianto, et al. An Application of SEMAR IoT Application Server Platform to Drone-Based Wall Inspection System Using AI Model. Information. 2025;16(2):91.

[3] MinIO and F5 to Enhance AI Workloads with High-Performance Object Storage and Distributed Application Services. M2 Press-wire. 2024.

[4] Skandali D, Magoutas A, Tsourvakas G. Consumer Behaviour on AI Applications for Services: Measuring the Impact of Value-Based Adoption Model on Luxurious AI Resorts’ Applications. Rev Mark Sci. 2024;22(1):57-85.

[5] Kumar Y, Lin M, Paredes C, et al. A Comprehensive Review of AI Advancement Using testFAILS and testFAILS-2 for the Pursuit of AGI. Electronics. 2024;13(24):4991.

[6] Sela L, Sowby BR, Salomons E, et al. Making waves: The potential of generative AI in water utility operations. Water Res. 2025;272:122935.

[7] Elastic Releases Its Elastic Cloud Serverless Powered by Search AI Lake. Manuf Close-Up. 2024.

[8] Plevris V, Papazafeiropoulos G. AI in Structural Health Monitoring for Infrastructure Maintenance and Safety. Infrastructures. 2024;9(12):225.

How to cite this paper

Research on Elastic Service Scaling Methods for Generative AI Applications Under Cloud-native Architectures

How to cite this paper: Hongyan Qu. (2025) Research on Elastic Service Scaling Methods for Generative AI Applications Under Cloud-native Architectures. Advances in Computer and Communication6(4), 224-229.

DOI: http://dx.doi.org/10.26855/acc.2025.10.012