CODEC: Cost-Effective Duration Prediction System for Deadline Scheduling in the Cloud
- Haozhe Li ,
- Ming-Jie Ma ,
- Yudong Liu ,
- Si Qin ,
- Bo Qiao ,
- Randolph Yao ,
- Harshwardhan Chaturvedi ,
- Tri Tran ,
- Murali Chintalapati ,
- S. Rajmohan ,
- Qingwei Lin 林庆维 ,
- Dongmei Zhang
2023 IEEE 34th International Symposium on Software Reliability Engineering (ISSRE) | , pp. 298-308
Modern cloud platforms allow customers to flexibly allocate or release computing resources. One crucial scenario is how to drive existing VMs to a specific state by a given deadline in a reliable and cost-effective manner. These state transition requests could involve starting or reallocating VMs. Performing millions of these requests per day can be challenging because the throughput of cloud platforms is not consistently deterministic. To meet customer service level agreements, cloud providers often trade-off cost for reliability and use oversized estimates to ensure that they initiate state transitions against VMs well ahead of time. In this paper, we propose a COst-effective Duration prEdiCtion system (CODEC) that solves the deadline scheduling problem for cloud providers by building intelligent automation to discover and execute optimal strategies when performing concurrent requests. In the CODEC, the core is to categorize durations of requests into buckets in real-time and the buffer time of each bucket is then predicted by extreme value theory. Extensive experiments show the proposed approach can guarantee the success rate of requests, while significantly saving cost with 38.71% and 86.97% compared to the baseline approach and the static buffer time, respectively. The results obtained show that CODEC can effectively and efficiently solve the deadline scheduling problem, and lead a worldwide cloud provider to deploy its prototype.