IEEE TRANSACTIONS ON CLOUD COMPUTING, VOL. XX,

2y ago

32 Views

2 Downloads

1.82 MB

11 Pages

Last View : 14d ago

Last Download : 3m ago

Upload by : Axel Lin

Report this link

Download PDF

Transcription

This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI10.1109/TCC.2014.2350475, IEEE Transactions on Cloud ComputingIEEE TRANSACTIONS ON CLOUD COMPUTING, VOL. XX, NO. YY, MONTH 20141Workload Prediction Using ARIMA Model and ItsImpact on Cloud Applications’ QoSRodrigo N. Calheiros, Enayat Masoumi, Rajiv Ranjan, Rajkumar BuyyaAbstract—As companies shift from desktop applications to Cloud-based Software as a Service (SaaS) applications deployed onpublic Clouds, the competition for end-users by Cloud providers offering similar services grows. In order to survive in such acompetitive market, Cloud-based companies must achieve good Quality of Service (QoS) for their users, or risk losing their customersto competitors. However, meeting the QoS with a cost-effective amount of resources is challenging because workloads experiencevariation over time. This problem can be solved with proactive dynamic provisioning of resources, which can estimate the future needof applications in terms of resources and allocate them in advance, releasing them once they are not required. In this paper, we presentthe realization of a Cloud workload prediction module for SaaS providers based on the Autoregressive Integrated Moving Average(ARIMA) model. We introduce the prediction based on the ARIMA model and evaluate its accuracy of future workload prediction usingreal traces of requests to web servers. We also evaluate the impact of the achieved accuracy in terms of efficiency in resource utilizationand QoS. Simulation results show that our model is able to achieve an average accuracy of up to 91%, which leads to efficiency inresource utilization with minimal impact on the QoS.Index Terms—Cloud Computing; Workload Prediction; ARIMA.F1I NTRODUCTIONCLOUD computing [1] has evolved from a set ofpromising virtualization and data center technologies to a consolidated paradigm for delivery of computing as a service to end customers, which pay for suchservices according to its use, likewise utilities such aselectricity, gas, and water. Adoption of the technologyby enterprises is growing fast, and so is the number ofCloud-based companies offering Cloud-based solutionsfor end users.The shift from desktop applications to public Cloudhosted Software as a Service (SaaS) business model hasintensified the competition for Cloud providers. Thisis due to the presence of multiple providers in thecurrent Cloud computing landscape that offer servicesunder heterogeneous configurations. Selecting particularCloud service configuration (e.g., VM type, VM cores,VM speed, cost, and location) translates to a certainlevel of Quality of Service (QoS) in terms of responsetime, acceptance rate, reliability, etc. In order to survivein such a competitive market, Cloud providers mustdeliver acceptable QoS to end-users of the hosted SaaSapplications, or risk losing them.However, one issue that arises from the transition toa SaaS model is the fact that the pattern of access to theapplication varies according to the time of the day, day R. N. Calheiros and R. Buyya are with the Cloud Computing andDistributed Systems (CLOUDS) Laboratory, Department of Computingand Information Systems, The University of Melbourne, Australia R. Ranjan is with the Commonwealth Scientific and Industrial ResearchOrganisation (CSIRO), Information and Communication Technologies(ICT) Centre, Acton, ACT, Australiaof the week, and part of the year. It means that in someperiods there are many users trying to use the serviceat the same time, whereas in others only a few usersare concurrently accessing the servers. This makes staticallocation of resources to the SaaS application ineffective,as during a period of low demand there will be excessof resources available, incurring unnecessary cost for theapplication provider, whereas during high utilization periods the available resources may be insufficient, leadingto poor QoS and loss of costumers and revenue.Clouds can circumvent the above problem by enablingdynamic provisioning of resources to applications basedon workload behavior patterns such as request arrivalrate and service time distributions. This means that extraresources can be allocated for peak periods and canbe released during the low demand periods, increasingutilization of deployed resources and minimizing theinvestment in Cloud resources without loss of QoS toend users [2].The challenge of dynamic provisioning is the determination of the correct amount of resources to be deployedin a given time in order to meet QoS expectations inthe presence of variable workloads like what is observedby Cloud applications. This challenge has been tackledmainly via reactive approaches [3], [4], [5]—which increase or decrease resources when predefined thresholdsare reached—or via proactive approaches [6], [7], [8]—which react to future load variations before their occurrence. The latter is typically achieved with techniquesthat can monitor, predict (e.g. estimating QoS parametersin advance), adapt according to these prediction models,and capture the relationship between application QoStargets, current Cloud resource allocation, and changesin workload patterns, to adjust resource allocation con-2168-7161 (c) 2013 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. Seehttp://www.ieee.org/publications standards/publications/rights/index.html for more information.

This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI10.1109/TCC.2014.2350475, IEEE Transactions on Cloud Computing2IEEE TRANSACTIONS ON CLOUD COMPUTING, VOL. XX, NO. YY, MONTH 2014figuration on-the-fly.In previous work [9], we introduced an architecture for proactive dynamic provisioning via workloadprediction—which determines how many requests persecond are expected in the near future—combined withanalytical models to determine the optimal number ofresources in the presence of the predicted load. Althoughthe proposed architecture recognized the need for workload prediction, it did not propose a concrete method forworkload prediction. Thus, in this paper we present thedesign and evaluation of a realization of its workloadprediction model using the Autoregressive IntegratedMoving Average (ARIMA) model [10]. ARIMA is amethod for non-stationary time series prediction that iscomposed of an autoregressive and a moving averagemodel, and was successfully utilized for time seriesprediction in different domains such as finance. The keycontributions of this paper are: We propose, design, and develop a workload prediction module using the ARIMA model. Our workapplies feedback from latest observed loads to update the model on the run. The predicted load isused to dynamically provision VMs in an elasticCloud environment for serving the predicted requests taking into consideration QoS parameterssuch as response time and rejection rate; We conduct an evaluation of the impact of theachieved accuracy in terms of efficiency in resourceutilization and QoS of user requests.Results show that our module achieves accuracy of upto 91%, which leads to efficiency in resource utilizationwith minimal impact in QoS for users.The rest of this paper is organized as follows. Section 2presents related work. Section 3 introduces the application and system models that support our workload prediction architecture, which is detailed in Section 4. Section 5 contains experiments evaluating the accuracy ofour proposed prediction architecture. Section 6 presentsthe simulation experiments evaluating the impact ofthe prediction in the efficiency of utilization of Cloudresources. Finally, Section 7 presents the conclusions andfuture work.2R ELATED W ORKThe approaches for workload prediction in Clouds canbe classified as reactive methods and proactive methods.Among reactive methods, Zhu and Agrawal [3] proposea method based on control theory to vertically scaleresource configurations such as VM types, VM cores, VMspeed, and VM memory. Vertical scaling is the processof increasing the resources available to each VM, ratherthan increasing the number of VMs (which is known ashorizontal scaling). Their approach also addresses thebudget constraints related to the workload execution.They apply the ARMAX model to predict CPU cycleand memory configurations required for hosting an application component. In contrast to this approach, weapply the ARIMA model to predict the future applicationworkload behavior, which is fed into the queueing modelfor calculating the required VM configuration.Bonvin et al. [4] propose a reactive method that scalesservers based on the expected performance and profitgenerated by changes in the provisioning. This methodis able to perform both horizontal and vertical scaling.Similar to Bonvin et al., Yang et al. [5] propose areactive method for changing the resource configurationof cluster resources driven by the load incurred by thehosted application. It is based on user-defined threshold conditions and scaling rules that are automaticallyenacted over a virtualized cluster.Zhang et al. [11] propose a reactive workload factoringarchitecture for hybrid Clouds that decomposes incoming workload in base workload and trespassing workload. The first one is derived from ARIMA-based prediction and handled by the local infrastructure, whereasthe second is handled by a public Cloud.The limitation of reactive platforms is that they react tochanges in workload only after the change in utilizationand throughput is observed in the system. Therefore, ifthe change is quicker than the reconfiguration time, endusers will observe poor QoS until the extra resourcesare available. Considering that changes in the workloadtypically follow patterns that are time-dependent, prediction techniques can avoid the above problem by triggering the reconfiguration before the expected increaseof demand, so when the situation arises, the system isalready prepared to handle it. Caron et al. [6] proposea method based on pattern matching for prediction ofgrid-like workloads in public Clouds. Gong et al. [12]propose a method for predicting resource demand ofVMs based on predicted application workload. Islamet al. [8] apply Artificial Neural Networks (ANN) andlinear regression for prediction of resources required forapplications. Sladescu et al. [7] presents a system basedon ANN to predict the workload to be experienced byan online auction in terms of intensity and location ofthe peaks.Although techniques such as linear regression cangenerate predictions quicker than ARIMA, they also demand workloads that have simpler behavior than thosethat time series and ANN-based methods can accuratelypredict. Furthermore, studies [13], [14] show that weband data center workloads tend to present behavior thatcan be effectively captured by time series-based models. Thus, to increase the applicability of the proposedarchitecture, we adopt ARIMA-based prediction for ourproposed architecture.Tran et al. [14] applied the ARIMA model for prediction of server workloads. It targets long-time prediction(up to 168 hours), whereas we target short timespans toachieve timely reaction to workload changes. Our prediction, which is designed to be short-term and thereforequicker to be performed, is suitable for Clouds becauseCloud platforms can quickly react to requests for more orless VMs. Our work also goes further ahead by applying2168-7161 (c) 2013 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. Seehttp://www.ieee.org/publications standards/publications/rights/index.html for more information.

This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI10.1109/TCC.2014.2350475, IEEE Transactions on Cloud ComputingCALHEIROS ET AL.: WORKLOAD PREDICTION USING ARIMA MODEL AND ITS IMPACT ON CLOUD APPLICATIONS’ QOSfeedback from latest observed loads to update the modelon the run. Furthermore, in our work the predicted loadis used for dynamically provisioning VMs for servingthe predicted requests, and the impact of the predictionand provisioning is evaluated in regards to their effecton the QoS observed by end users.Other domain-specific proactive approaches that arerelated to Clouds include the approach by Nae et al.for Massively Multiplayer Online Games [15]. PachecoSanchez et al. [16] apply a Markovian model to predictserver performance in Clouds. Roy et al. [13] apply theARMA model for workload prediction in Clouds withthe goal of minimizing cost, whereas the main objectiveof our approach is meeting QoS target of applicationssuch as minimizing the request rejection rate, or maximizing resource utilization.3S YSTEMANDA PPLICATION M ODELSThe target system model of the architecture proposedin this paper consists of a public Cloud provider thatoffers to end users SaaS services backed by a PaaSlayer (Figure 1) [9]. The PaaS in turn interacts with anIaaS provider that can be a third party provider. Thetarget SaaS provider receives web requests, which areprocessed by the machines that are located at the IaaSlayer.For scaling up the infrastructure, the target providerdeploys a number of Virtual Machines (VM) that process end user requests. To simplify the management ofthe infrastructure and to take advantage of profilinginformation, a single VM configuration, consisting ofCPU power, amount of memory, and amount of storageis utilized by the SaaS provider. We also assume thatthe application has been profiled in the chosen VMconfiguration, so the provider has information about theVM’s expected performance.A single application instance executes on each VM,and since current Cloud providers do not support dynamic changes in the VM’s specifications without downtime, increasing and decreasing the total number of VMsrunning the application is the most suitable option forutilization of elastic computing infrastructures1 , as itbrings additional benefits such as higher fault toleranceand higher resilience to performance degradation causedby VM failures (as the crash of one of the VMs will notaffect the others, enabling the application to continueserving customers using the VMs that are running).The target application is web applications. Client requests consist of http requests that are processed by aweb server running on the VMs. QoS targets of relevance to the system are response time Ts , defined asthe maximum negotiated time in the SLA for serving a1. Although CloudSigma claims to support dynamic changes inthe hardware specifications of running VMs, such alterations requirethe VM to be stopped and restarted again, which imposes the samenot-negligible setup time as starting new VMs in conventional IaaSproviders.3user’s request and rejection rate Rej(Gs ), which is theproportion of incoming requests that cannot be servedwithout violating Ts [9].4S YSTEM A RCHITECTUREOne of the key characteristics of Clouds is elasticity,which enables the infrastructure to be scaled or down tomeet the demand of applications. However, instantiationof new VMs is not an immediate operation. Dependingon Cloud providers’ infrastructure architecture and theirhypervisor policies, launching a new VM involves anon-negligible start-up time. The start-up time is longenough to be noticed by the clients and dramaticallydecreases the users’ experience, which may result inabandonment of the application. Apart from potentialfinancial losses due to decline in the number of users, thesoftware provider may also be liable for not deliveringthe minimum required QoS.Although standby VM instances may be helpful fortolerating sudden increases in number of requests, thosestandby VMs are more likely to be idle most of the timesreducing the overall system utilization while increasingthe operational cost. Furthermore, if the increase in thenumber of requests exceeds the load that standby VMscan handle, the problem of poor QoS arises again. Thus,a different approach must be sought for the Cloudprovisioning problem.One approach that has been explored is based onworkload prediction: accurate predictions of the numberof end-users’ future service requests enable SLA’s QoStargets to be met with reduced utilization of Cloudresources. As requests pattern vary depending on theapplication type, this paper focuses on request patternsthat exhibit seasonal behavior, such as requests to Webor online gaming servers [15] [17]. To overcome theuncertainty in workload patterns in Cloud environmentsand minimize the estimation error in predicting futurerequests while maintaining optimal system utilization, inprevious work [9] we proposed an adaptive provisioningmechanism in order to achieve the following QoS targets: Automation: The whole process of provisioningshould be transparent to users; Adaptation: The provisioner should be aware ofdynamic and uncertain changes in the workload andreact to them accordingly; Performance assurance: In order to meet QoS targets, resource allocation in the system must bedynamic.The key components of the proposed provisioningsystem, depicted in Figure 1 are [9]:1) Application Provisioner: Receives accepted requestsfrom the Admission Control module and forwardsthem to VMs that have enough capacity to processthem. It also keeps track of the performance ofthe VMs. This information is passed to the LoadPredictor and Performance Modeler. The ApplicationProvisioner also receives from such module the2168-7161 (c) 2013 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. Seehttp://www.ieee.org/publications standards/publications/rights/index.html for more information.

10.1109/TCC.2014.2350475, IEEE Transactions on Cloud Computing IEEE TRANSACTIONS ON CLOUD COMPUTING, VOL. XX, NO. YY, MONTH 2014 1 Workload Prediction Using ARIMA Model and Its Impact on Cloud Applications’ QoS Rodrigo N. Calheiros, Enayat Masoumi, Rajiv Ranjan, Rajkumar Buyya Abstract—As companies shift from d

IEEE TRANSACTIONS ON CLOUD COMPUTING, VOL. XX,

It looks like you're using an ad-blocker