Visible to the public A Trace-Based Performance Study of Autoscaling Workloads of Workflows in Datacenters

TitleA Trace-Based Performance Study of Autoscaling Workloads of Workflows in Datacenters
Publication TypeConference Paper
Year of Publication2018
AuthorsVersluis, L., Neacsu, M., Iosup, A.
Conference Name2018 18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID)
Keywordsallocation policies, Analytical models, application domains, autoscaler, autoscaling, cloud computing, computational complexity, computer centres, customer experience, Data models, datacenter operators, datacenters, elasticity metrics, human factors, Measurement, memory related autoscaling complexity metrics, Monitoring, operational behavior, provisioning policies, pubcrawl, real world traces, resilience, Resiliency, resource allocation, Resource management, resources, Scalability, scheduling, simulation, Task Analysis, time related autoscaling complexity metrics, trace based simulation, work factor metrics
AbstractTo improve customer experience, datacenter operators offer support for simplifying application and resource management. For example, running workloads of workflows on behalf of customers is desirable, but requires increasingly more sophisticated autoscaling policies, that is, policies that dynamically provision resources for the customer. Although selecting and tuning autoscaling policies is a challenging task for datacenter operators, so far relatively few studies investigate the performance of autoscaling for workloads of workflows. Complementing previous knowledge, in this work we propose the first comprehensive performance study in the field. Using trace-based simulation, we compare state-of-the-art autoscaling policies across multiple application domains, workload arrival patterns (e.g., burstiness), and system utilization levels. We further investigate the interplay between autoscaling and regular allocation policies, and the complexity cost of autoscaling. Our quantitative study focuses not only on traditional performance metrics and on state-of-the-art elasticity metrics, but also on time-and memory-related autoscaling-complexity metrics. Our main results give strong and quantitative evidence about previously unreported operational behavior, for example, that autoscaling policies perform differently across application domains and allocation and provisioning policies should be co-designed.
DOI10.1109/CCGRID.2018.00037
Citation Keyversluis_trace-based_2018